Showing posts with label TBX. Show all posts
Showing posts with label TBX. Show all posts

Sunday, March 12, 2023

Conversion tools and difference checkers

Conversion tools:
TBX convert: On this page, you can convert between several glossary filetypes: UTX-Simple, GlossML, TBXGlossary,
OLIF. TBX (TermBase eXchange) is a family of XML-based languages for the interchange of
terminological information (called TMLs, for Terminological Markup Language; also informally called “dialects” of TBX). All of TBX shares a core structure, in which information is represented on one of three structural levels: concept, language, and term.
UTF-16 to UTF-8 Converter
Glossary converter allows to convert between MultiTerm Termbases and other terminology formats by simple drag and drop, with minimal user interaction. It supports xls, xlsx, csv, txt, tbx, utx, multiterm export files and tmx.
TBX Utilities: This is a collection of tools to be used in working with Term Base eXchange (TBX); an open, XML based standard for exchanging structured terminological data submitted for adoption under ISO 30042 Technical Committee 37.
TBX Resources: TBX Resources is dedicated to helping you use the industry-standard TBX format with your terminological data. Here you’ll find tutorials and tools for using and converting to and from TBX.
Other TBX downloads and tools
Converting TBX files to XLS/CSV format
TXT
AntFile Converter: A freeware tool to convert PDF and Word (DOCX) files into plain text for use in corpus tools like AntConc.
EncodeAnt is a freeware character encoding detection and conversion tool. EncodeAnt takes an input list of text files (e.g. .txt) and attempts to auto-detect the character encoding that the files use. The character encoding can also be set manually. EncodeAnt also has an option to auto-convert the character encoding of the files to UTF-8, which is a standard used in most corpus research. The converted files are saved in a separate folder leaving the original files untouched.
Difference checkers:
Winmerge.org: WinMerge is an Open Source differencing and merging tool for Windows. WinMerge can
compare both folders and files, presenting differences in a visual text format that is easy to understand and handle.
DiffEngineX is a fast and scalable compare utility that finds the differences between the formulae, constants, defined names, cell comments and Visual Basic VBA code contained in either two whole Excel workbooks or selected worksheets on Windows. It can align similar rows and columns across two different Excel spreadsheets. It works with xls, xlsx, xlsm and xlsb files. xla and xlam add-ins need to be converted first into xls and xlsm files before DiffEngineX can compare them. Excel 2003, 2007, 2010 or 2013 is required for this spreadsheet comparison tool to work.
ExcelDiff analyzes multiple Microsoft Excel(.csv, .xls, .xlsx, .xlsm, .xlsb) files and shows their differences graphically, even clarifies cell-level.
KDiff3

Source: inmyownterms.com

Sunday, February 11, 2018

Regex to convert every second paragraph mark to TAB

After downloading the TBX from IATE, I managed to get a simple text file with the entries separated by new lines.

Verwaltungsvorschriften
norme administrative
optische Datenträger
suporți optici
Feuerwerkskörper
focuri de artificii

I wanted to get a TSV, a tab-delimited file out of it. Normally I would have opened the file in Word and convert to a table with the paragraph delimiter and then paste the text back into Notepad++, but with RegEx it is simpler and quicker. Make sure you select Regular Expressions in the find-replace mask.

Find:
\r\n(.*)\r\n
Replace with:
\t$1\r\n

The result is:
Zuständigkeit der Mitgliedstaaten    competența statelor membre ale Uniunii Europene
Verwaltungsvorschriften    norme administrative
optische Datenträger    suporți optici

Friday, March 17, 2017

Software with TBX support

This is a list of software claiming TBX support of some kind. It contains links to the software website, a brief description of the extent of its support of TBX, and other information (type, price, etc.). We are currently in the process of validating the TBX output of each of the listed products. The results of the validation will be shown in the final column as it is finished.
The widespread adoption of TBX would be a boon to the translation industry. By sharing a list of software with TBX support, we hope to further encourage its use both in further software development and personal data storage. Using TBX means easy data sharing between each of the products listed here: a non-trivial benefit!
Click the column headers to sort by that column.
Name and Website Tool Type Web-based? Nature of TBX Support Claims Price Compliance
Content Checker no Checks valid core structure, and then adherence to an XCS file. open source
Translation Environment Tool (TeNT) no
import/export
open source TBD
TeNT no import/export TBX-Basic only (see this document) $599 TBD
SDL Multiterm Terminology management no
import/export
$300
Translation Management System (TMS) no
import/export
open source TBD
Terminology management yes import/export ? TBD
TeNT no Convert from CSV or MARTIF to TBX
import/export
298-598 TBD
TeNT/TMS yes import/export
€0-199
TBD
Terminology Management optional import/export pro- $795
web- ?
web extension module: $4200
TBD
Terminotix Synchroterm Term extraction no export $420 TBD
Maxprograms Anchovy Terminology management/extraction no import/export "TBX with default XCS or proprietary extensions, cusomizable using XSL stylesheets" Swordfish: €260 (see also student/site prices) TBD
Mneme TeNT no import/export free TBD
Terminology Management (OWL) based no import (conversion to OWL) free TBD
Quality Assurance no import (only?)
€249-2500
TBD
Terminology management optional import/export suite-?
cloud-?
TBD
TMS yes
import/export/conversion (via Translate Toolkit) open source TBD
TEnT no
import/export/conversion (via Translate Toolkit) open source TBD
Toolkit/API for file conversion QA, and more general functions no
import/export/conversion open source TBD; see here for progress.
XBench QA and Terminology Management no import/export
€39/yr; varies by residency; beta is free
TBD
Lingotek TeNT yes import/export ? TBD
Metatexis CAT no import/export Word: $50-180
Server: ? (free for training purposes)
TBD
Okapi Checkmate QA no import/export open source TBD
TeNT/localization no import/export ? TBD
TeNT no import/export (via plugins)
€95-2900
TBD
Termbases Terminology Management yes import/export Personal-free
€30/mo 300/yr
TBD
Wordbee TeNT yes import (and export?) starts at $178/6 mo $323/yr TBD
Star Transit/TermStar CAT no ? ? TBD
Idiom Worldserver TeNT no import/export free? TBD
Across TeNT no import/export free for freelancers TBD
Wordfast TeNT no import/export
€200-500
TBD
Acrolinx Authoring/Terminology Management no import/export ? TBD
RC-WinTrans localization no imports Microsoft glossaries (which are TBX) $795-4575 TBD
Lingobit Localizer Enterprise localization no import/export $1950 TBD
localization/TeNT no import (and export?)
€620
TBD
TeNT cloud:yes import/export Cloud: free - $230/mo
Editor: free
Server: ?
TBD
JiveFusion CAT no import/export ? TBD
OpenTM2 TeNT no import/export open source TBD
Fluency TeNT/other optional import/export Translation suite starts at $349 TBD
Text United TeNT/other yes import/export (not for client's projects) single user- free

TBD
DVX2 TeNT no import/export 590 €

TBD
MultiTrans Prism TMS no import/export 590 €

TBD
Terminator Terminology Management yes import/export Open Source

TBD
QA-Distiller Quality Assurance no export Free using plugin

TBD
Weblate CAT yes import/export Free

TBD
The Microsoft Language Portal Downloads also deserve mention because they offer sizable terminology downloads in TBX format, for free.
Source: http://www.tbxconvert.gevterm.net