myCAT originally comes with the following alignment maps:
It is possible to add the following language pairs:
Simply download them and unzip them to C:\MYCAT\map (for the Windows version). Then add to your corpus a few document pairs which include the new language and run C:\MYCAT\run\UPDATE and RESTART.bat (for the Windows version).
Other languages are supported as well (see the installation manual on top of this page) but they will be aligned according to a simple geometric algorithm.
This myCAT distribution comes with a very small test corpus of 48 documents allowing to test the following six languages: English, French, Spanish, Arabic, Russian and Chinese. These documents are organized in three collections: UNO, WIPO and WTO. They are all public documents which were downloaded from those organization's websites.
The server required to run myCAT should comply with the following specifications.
myCAT must be deployed on a dedicated server, which can be either a physical or a virtual machine. Please note that the server needs to be 64-bit.
One CPU (Quadricore), 64-bit
6 to 8 Gb
Depends on your corpus size. myCAT needs about 10 Gb for all the applications (see below) but the additional free space on the disk should be about four times the size of your corpus, because we convert it to a TXT corpus (this is about a tenth of the initial corpus size) and we build alignment maps for each document pair (this is almost four times the corpus size). So, for example if your initial corpus size is 50 Gb, we would need the following disk space:
And of course it would be wise to provide for some additional space because the corpus grows over time, so maybe 25% to 50% more space (depending on how fast your corpus grows) would be reasonable.
The applications to be installed on it will be the following:
The software owned by Olanto are distributed under the GNU Affero General Public License Version 3, or AGPL V3.