12/21/2023 0 Comments Russisk oversetter![]() Languages: English, Estonian, Latvian and Lithuanian.įormat: The Tokenization component works with plaintext data that is encoded in UTF -8. The Linguistic tool API provides functionality for the following tasks: text tokenisation, sentence breaking, morphological analysis, part of speech (and for morphologically rich languages also morpho-syntactic) tagging, and language detection.ĭomains: The component is domain independent. With Tilde’s Linguistic tool API, users can access linguistic processing components of text data. Texts can be in various formats: HTML, docx, etc. Identified terms in texts are enriched with translation equivalents acquired from terminology resources and databases. Text enrichment with term translation equivalents Online terminology extraction from parallel and comparable data sources found on the web and directed by users.Statistical Data Base (SDB) - a large offline resource of automatically extracted multilingual terminology, which is refined by Tilde Terminology users whenever translation equivalents are validated.Translation equivalent candidates can then be acquired from parallel and comparable corpora acquired from the web using: This service looks up identified terms in existing terminology resources. Supported document formats: PDF, Microsoft Word, Microsoft Excel, Microsoft PowerPoint, Text (.txt), Rich Text (.rtf), XLIFF, HTML, XML, MIF.Įxtracts identified terms in document and assembles glossaries of term sets. Identify terminology in documents and sentences using state-of-the-art linguistically, statistically, and reference corpora motivated term extraction methods. These services can be used to build comprehensive terminology solutions. With the Terminology API, Tilde provides services that keep terminology organized by identifying terms in documents, finding relevant translations, and assembling term glossaries. Tilde’s online terminology services ensure clear, consistent communication with customers across the globe. The MT systems are hosted in the cloud and can be integrated into any platform or application.Įnglish to Bulgarian, Czech, Danish, Dutch, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Polish, Portuguese, Romanian, Russian, Slovenian, Spanish, SwedishĬurrently the system supports these file formats: DOC, DOCX, XLSX, PPTX, ODT, ODP, ODS, HTML, HTM, XHTML, XHT, TXT, TMX, XLIFF, SDLXLIFF, and TTX. With Tilde’s Translation API, users can access MT systems in multiple language pairs and domains.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |