[Next] [Up] [Previous]
Next: Adapting the Tagger Up: Probabilistic Tagging in Previous: The Algorithm

Client/Server

The tagger is implemented in a client-server model. The knowledge base resides on the server, and the tagging engine establishes a network connection to the server and retrieves the relevant data for tags and tag sequence probabilities from it. This has several advantages:

  1. The resource data is in a well defined place and can be shared between different invocations of the tagger
  2. The workload can be distributed across different machines
  3. The memory usage of the tagger decreases enormously
  4. The tagger can be used on small machines
  5. The server can easily be used by other applications as well
The server is implemented in C, while the client is written in Java. That way it can run on all kinds of machines, provided there is a server available.



Multext-East