Software > myCAT > Documentation


Installation of myCAT

  • myCAT for Windows
    The procedure to perform a standard installation of myCAT on a dedicated server running Windows is described here.
  • myCAT for GNU/Linux
    The procedure to perform a standard installation of myCAT on a dedicated server running Ubuntu 12.04 LTS is described here.

 



Adding Alignment Maps

myCAT originally comes with the following alignment maps:

  • English-Arabic
  • English-French
  • English-Portuguese
  • English-Russian
  • English-Spanish

It is possible to add the following language pairs:

Simply download them and unzip them to C:\MYCAT\map (for the Windows version). Then add to your corpus a few document pairs which include the new language and run C:\MYCAT\run\UPDATE and RESTART.bat (for the Windows version).

Other languages are supported as well (see the installation manual on top of this page) but they will be aligned according to a simple geometric algorithm.
 



Document Naming Rules

The corpus of bi-text documents to be used with myCAT should comply with the following specifications.

This myCAT distribution comes with a very small test corpus of 48 documents allowing to test the following six languages: English, French, Spanish, Arabic, Russian and Chinese. These documents are organized in three collections: UNO, WIPO and WTO. They are all public documents which were downloaded from those organization's websites.
 



Server Specifications

The server required to run myCAT should comply with the following specifications.

myCAT must be deployed on a dedicated server, which can be either a physical or a virtual machine. Please note that the server needs to be 64-bit.
 

CPU:

One CPU (Quadricore), 64-bit

RAM:

6 to 8 Gb

Disk space:

Depends on your corpus size. myCAT needs about 10 Gb for all the applications (see below) but the additional free space on the disk should be about four times the size of your corpus, because we convert it to a TXT corpus (this is about a tenth of the initial corpus size) and we build alignment maps for each document pair (this is almost four times the corpus size). So, for example if your initial corpus size is 50 Gb, we would need the following disk space:

  • Applications: 10 Gb
  • Corpus: 50 Gb
  • Converted corpus and maps: 200 Gb
    TOTAL: 260 Gb

And of course it would be wise to provide for some additional space because the corpus grows over time, so maybe 25% to 50% more space (depending on how fast your corpus grows) would be reasonable.

Operating system:
  • Windows Server 2008 or Windows 7, 64-bit version
  • or GNU/Linux Ubuntu 12.04 LTS, 64-bit version
Applications:

The applications to be installed on it will be the following:

  • Java JRE 6 or 7
  • Apache Tomcat 6.0 (to publish the application online)
  • OpenOffice 3 (for the conversions)
  • myCAT (Java classes distributed in a .war file through the Tomcat)

myCAT is available both for Windows (tested on Windows 7 and Windows 2008 Server) and for GNU/Linux (tested on Ubuntu 12.04 LTS).

The software owned by Olanto are distributed under the GNU Affero General Public License Version 3, or AGPL V3.

Olanto CAT Suite

Available for Windows and GNU/Linux