Showing posts with label software. Show all posts
Showing posts with label software. Show all posts

Sunday, March 12, 2023

Conversion tools and difference checkers

Conversion tools:
TBX convert: On this page, you can convert between several glossary filetypes: UTX-Simple, GlossML, TBXGlossary,
OLIF. TBX (TermBase eXchange) is a family of XML-based languages for the interchange of
terminological information (called TMLs, for Terminological Markup Language; also informally called “dialects” of TBX). All of TBX shares a core structure, in which information is represented on one of three structural levels: concept, language, and term.
UTF-16 to UTF-8 Converter
Glossary converter allows to convert between MultiTerm Termbases and other terminology formats by simple drag and drop, with minimal user interaction. It supports xls, xlsx, csv, txt, tbx, utx, multiterm export files and tmx.
TBX Utilities: This is a collection of tools to be used in working with Term Base eXchange (TBX); an open, XML based standard for exchanging structured terminological data submitted for adoption under ISO 30042 Technical Committee 37.
TBX Resources: TBX Resources is dedicated to helping you use the industry-standard TBX format with your terminological data. Here you’ll find tutorials and tools for using and converting to and from TBX.
Other TBX downloads and tools
Converting TBX files to XLS/CSV format
TXT
AntFile Converter: A freeware tool to convert PDF and Word (DOCX) files into plain text for use in corpus tools like AntConc.
EncodeAnt is a freeware character encoding detection and conversion tool. EncodeAnt takes an input list of text files (e.g. .txt) and attempts to auto-detect the character encoding that the files use. The character encoding can also be set manually. EncodeAnt also has an option to auto-convert the character encoding of the files to UTF-8, which is a standard used in most corpus research. The converted files are saved in a separate folder leaving the original files untouched.
Difference checkers:
Winmerge.org: WinMerge is an Open Source differencing and merging tool for Windows. WinMerge can
compare both folders and files, presenting differences in a visual text format that is easy to understand and handle.
DiffEngineX is a fast and scalable compare utility that finds the differences between the formulae, constants, defined names, cell comments and Visual Basic VBA code contained in either two whole Excel workbooks or selected worksheets on Windows. It can align similar rows and columns across two different Excel spreadsheets. It works with xls, xlsx, xlsm and xlsb files. xla and xlam add-ins need to be converted first into xls and xlsm files before DiffEngineX can compare them. Excel 2003, 2007, 2010 or 2013 is required for this spreadsheet comparison tool to work.
ExcelDiff analyzes multiple Microsoft Excel(.csv, .xls, .xlsx, .xlsm, .xlsb) files and shows their differences graphically, even clarifies cell-level.
KDiff3

Source: inmyownterms.com

Friday, September 11, 2020

Soffice from Command Line - MAN Pages

 

Name

libreoffice - LibreOffice office suite

SYNOPSIS

libreoffice [--accept=accept-string] [--base] [--calc] [--convert-to output_file_extension[:output_filter_name] [--outdir output_dir] file]... [--display display] [--draw] [--global] [--headless] [--help|-h|-?] [--impress] [--invisible] [--infilter="<filter>"] [--math] [--minimized] [-n file]... [--nodefault] [--nolockcheck] [--nologo] [--norestore] [-o file]... [-p file...] [--print-to-file [--printer-name printer_name] [--outdir output_dir] file]... [--pt printername file...] [--show Impress file]... [--unaccept=accept-string] [--terminate_after_init] [--view file]... [--web] [--writer] [file...]
lobase
localc
lodraw
lofromtemplate
loimpress
lomath
loweb
lowriter

DESCRIPTION

LibreOffice (LO for short) is a multi-platform office productivity suite. It was derived from OpenOffice.org 3.3 Beta on September 28, 2010.

libreoffice is a shell script that sets up the environment and passes the command line arguments to the soffice.bin binary.

Alternatively, the following helper scripts start the respective module:

sbase, scalc, sdraw, simpress, smath, sofficerc, swriter

OPTIONS

--accept=accept-string
Specify a UNO connect-string to create a UNO acceptor through which other programs can connect to access the API.
--base
Starts the wizard for a new Base document.
--calc
Starts with a new Calc document.
--convert-to output_file_extension[:output_filter_name] [--outdir output_dir] file...
Batch converts files. If --outdir is not specified then the current working directory is used as the output directory for the converted files. It implies --headless.

Examples:

--convert-to pdf *.doc

Converts all .doc files to PDFs.

--convert-to pdf:writer_pdf_Export --outdir /home/user *.doc

Converts all .doc files to PDFs using the settings in the Writer PDF export dialog and saving them in /home/user.

--display display
This option specifies the X server to use; see X(7)
--draw
Starts with a new Draw document.
--global
Starts with a new Global document.
--headless
Starts in "headless mode", which allows using the application without user a interface.

This special mode can be used when the application is controlled by external clients via the API.

It implies --invisible and strictly ignores any GUI environment. --quickstart does not work with this parameter.

--help|-h|-?
Lists LibreOffice command line parameters.
--impress
Starts with a new Impress document.
--invisible
Starts in invisible mode.

Neither the start-up logo nor the initial program window will be visible. LO can be controlled and documents and dialogs can be opened via the API.

When started with this parameter, it can only be quit using the taskmanager (Windows) or the kill command (UNIX based systems).

--quickstart does not work with this parameter.

--infilter="<filter>"
Force an input filter type if possible. For example --infilter="Calc Office Open XML" only
--math
Starts with a new Math document.
--minimized
Keeps the splash screen minimized.
-n template...
Creates the a new document from the given templates.
--nodefault
Starts LO without creating a new document. The next time you start LO, the welcome screen is shown.

It's used together with --nologo by quick starters. Note that --quickstart has no longer been supported since OpenOffice.org 2.0.0.

--nolockcheck
Disables the check for remote instances using the installation.
--nologo
Disables the splash screen at program start.
--norestore
Disables restart and file recovery after a system crash. It is possible that LO will try to restore a file it keeps crashing on, if that happens --norestore is the only way to start LO.
--nosplash
Disables the splash screen at program start.
-o file...
Opens the given files for editing, even templates.

Without -o a template file would create a new document derived from that template.

-p file...
Prints the given files to the default printer and ends. The splash screen does not appear.

If the file name contains spaces, then it must be enclosed in quotation marks.

--print-to-file [--printer-name printer_name] [--outdir output_dir] file...
Batch print files to file. If --printer-name is not specified the default printer is used. If --outdir is not specified then the current working directory is used as the output directory for the converted files.

Examples:

--print-to-file *.doc

Prints all .doc files to the current working directory using the default printer.

--print-to-file --printer-name nasty_lowres_printer --outdir /home/user *.doc

Prints all .doc files to /home/user directory using the nasty_lowres_printer.

--pt printername file...
Prints the given files to the printer printername and ends. The splash screen does not appear.

If a file name contains spaces, then it must be enclosed in quotation marks.

--quickstart --quickstart=no
Starts LO with it's quick starter. --quickstart disable the quick starter.

Does not work with --invisible or --headless.

--show Impress file...
Opens the given Impress files, starts the presentation and quits after they have finished.
--unaccept=accept-string
Closes an acceptor that was created with --accept option.

Use --unaccept=all to close all open acceptors.

--terminate_after_init
Starts LO and terminates after it registers some UNO services. Doesn't show the splash during startup.
--view file...
Opens the given files read-only creating a temporary copy of them at $TMPDIR.
--web
Starts with a new HTML document.
--writer
Starts with a new Writer document.

Wednesday, January 29, 2020

Excel formula to determine age from date of birth

Birth date Age
15.07.1996 24
07.06.1996 24
05.07.2002 18
04.12.2005 15
19.07.1992 28
23.11.1998 22
28.10.2000 20
02.10.2001 19
23.09.1996 24
09.05.2001 19
27.09.1997 23

Sometimes you might want to get the age of somebody, for instance, în order to determine whether they are minor or major (18 years old).
The German formula is: =(JAHR(JETZT())-JAHR(A2))

An English formula may be: ==INT(YEARFRAC(A2,TODAY()))
or
=IF(INT(YEARFRAC(A1,TODAY()))<18,"Minor","Adult")
or
=DATEDIF(A1, TODAY(), "y")
or with months and days
=DATEDIF(A1,TODAY(),"Y") & " Years, " & DATEDIF(A1,TODAY(),"YM") & " Months, " & DATEDIF(A1,TODAY(),"MD") & " Days"

You may use conditional formatting (bedingte Formatierung) to highlight figures under 18.

Thursday, November 1, 2018

You-Get - Cli Downloader

Most of you may used or heard about Youtube-dl, a command line program to download videos from youtube and other 100+ websites. I just stumbled upon a similar utility named “You-Get”. It is also a CLI downloader written in Python. It allows you to download images, audios and videos from popular websites like Youtube, Facebook, Twitter and a lot more. Currently, there are 80+ websites are supported. Click here to read the full list of supported sites.
You-Get is not only a downloader, but also can stream the online videos in your media player. It even allows you to search for videos on google. Just pass the search term and You-Get will google it and download the most relevant videos. Another notable feature, it allows you to pause and resume the downloads. It is completely free, open source and cross-platform application that on Linux, Mac OS and Windows.

Install You-Get

Make sure you have installed the following prerequisites.
You-Get can be installed in many ways. The officially recommended method is using Pip package manager. If you haven’t installed PIP yet, refer the following link.
Please note that you must install Python 3 version of pip.
Now, run the following command to install You-Get:
$ pip3 install you-get
You can upgrade You-Get to its latest version using command:
$ pip3 install --upgrade you-get

Getting Started With You-Get

The usage is pretty much same as Youtube-dl utility.
Download Videos
To download a video, just run:
$ you-get https://www.youtube.com/watch?v=HXaglTFJLMc
Sample output:
site: YouTube
title: The Last of The Mohicans by Alexandro Querevalú
stream:
 - itag: 22
 container: mp4
 quality: hd720
 size: 56.9 MiB (59654303 bytes)
 # download-with: you-get --itag=22 [URL]

Downloading The Last of The Mohicans by Alexandro Querevalú.mp4 ...
 100% ( 56.9/ 56.9MB) ├███████████████████████████████████████████████████████┤[1/1] 752 kB/s
You may want to view the details of the video before downloading. You-Get can do that for using “–info” or “-i” flag. This option will get you all available quality and formats of the given video.
$ you-get -i https://www.youtube.com/watch?v=HXaglTFJLMc
Or,
$ you-get -info https://www.youtube.com/watch?v=HXaglTFJLMc
Sample output would be:
site: YouTube
title: The Last of The Mohicans by Alexandro Querevalú
streams: # Available quality and codecs
 [ DASH ] ____________________________________
 - itag: 137
 container: mp4
 quality: 1920x1080
 size: 101.9 MiB (106816582 bytes)
 # download-with: you-get --itag=137 [URL]

- itag: 248
 container: webm
 quality: 1920x1080
 size: 90.3 MiB (94640185 bytes)
 # download-with: you-get --itag=248 [URL]

- itag: 136
 container: mp4
 quality: 1280x720
 size: 56.9 MiB (59672392 bytes)
 # download-with: you-get --itag=136 [URL]

- itag: 247
 container: webm
 quality: 1280x720
 size: 52.6 MiB (55170859 bytes)
 # download-with: you-get --itag=247 [URL]

- itag: 135
 container: mp4
 quality: 854x480
 size: 32.2 MiB (33757856 bytes)
 # download-with: you-get --itag=135 [URL]

- itag: 244
 container: webm
 quality: 854x480
 size: 28.0 MiB (29369484 bytes)
 # download-with: you-get --itag=244 [URL]

[ DEFAULT ] _________________________________
 - itag: 22
 container: mp4
 quality: hd720
 size: 56.9 MiB (59654303 bytes)
 # download-with: you-get --itag=22 [URL]
By default, You-Get will download the format marked with DEFAULT. If you don’t like that format or quality, you can pick any other format you like. Use the itag value given in the each format.
$ you-get --itag=244 https://www.youtube.com/watch?v=HXaglTFJLMc
Download Audios
The following command will download an audio from soundcloud website.
$ you-get 'https://soundcloud.com/uiceheidd/all-girls-are-same-999-prod-nick-mira'
Site: SoundCloud.com
Title: ALL GIRLS ARE THE SAME (PROD. NICK MIRA)
Type: MP3 (audio/mpeg)
Size: 2.58 MiB (2710046 Bytes)

Downloading ALL GIRLS ARE THE SAME (PROD. NICK MIRA).mp3 ...
 100% ( 2.6/ 2.6MB) ├███████████████████████████████████████████████████████┤[1/1] 983 kB/s
To view the details of the audio file, use -i flag.
$ you-get -i 'https://soundcloud.com/uiceheidd/all-girls-are-same-999-prod-nick-mira'
Download Images
To download an image, run:
$ you-get https://pixabay.com/en/mountain-crumpled-cyanus-montanus-3393209/
You-Get can also download all images from a web page.
$ you-get https://www.ostechnix.com/pacvim-a-cli-game-to-learn-vim-commands/
Search Videos
You-Get doesn’t even a valid URL. You can just pass a random search terms. You-Get will google it and download the most relevant video based on your search string.
$ you-get 'Micheal Jackson'
Google Videos search:
Best matched result:
site: YouTube
title: Michael Jackson - Beat It (Official Video)
stream:
 - itag: 43
 container: webm
 quality: medium
 size: 29.4 MiB (30792050 bytes)
 # download-with: you-get --itag=43 [URL]

Downloading Michael Jackson - Beat It (Official Video).webm ...
 100% ( 29.4/ 29.4MB) ├███████████████████████████████████████████████████████┤[1/1] 2 MB/s
Watch Videos
You-Get can able to stream the online videos in your media player or browser, just without ads or comment section.
To watch videos in a media player, for example VLC, run the following command:
$ you-get -p vlc https://www.youtube.com/watch?v=HXaglTFJLMc
Or,
$ you-get --player vlc https://www.youtube.com/watch?v=HXaglTFJLMc
Similarly, to stream the videos in your browser, for example chromium, use:
$ you-get -p chromium https://www.youtube.com/watch?v=HXaglTFJLMc

As you can see in the above screenshot, there is no ads, comment section. Just a plain page with the video.
Set path and file name for downloaded videos
By default, the videos will be downloaded in the current working directory with default video titles. You can, of course, change them as per your liking using –output-dir/-o flag to set the path and –output-filename/-O to set the name of the downloaded file.
$ you-get -o ~/Videos -O output.mp4 https://www.youtube.com/watch?v=HXaglTFJLMc
Pause and resume downloads
Press CTRL+C to cancel a download. A temporary .download file will be saved in the output directory. Next time you run you-get with the same arguments, the download process will resume from the last session.
In case the file is completely downloaded, the temporary .download extension will be gone, and you-get will just skip the download. To enforce re-downloading, use the –force/-f option.
For more details, refer the help section by running the following command.
$ you-get --help
Source: https://www.ostechnix.com 

Monday, February 12, 2018

Free Xliff Editors

OmegaT

OmegaT is a Java-based translation tool that supports many file formats, including XLIFF documents.

Open Language Tools - Olanto

The Open Language Tools project provides a Java-based XLIFF editor, along with filters for various file formats.

Qt Linguist

Qt Linguist is the translation tool for the Qt environment. It is designed to work with Qt TS files, but supports also PO and XLIFF documents.

Virtaal

Virtaal is the translation tool of the Translate Toolkit. It is designed to work with PO files, but can also work with XLIFF documents and a number of other formats.

FelixCat XLIFF Translator

(Not open-source, but free) XLIFF Translator is a free XLIFF editor part of the Felix TM system.

Lokalize

Lokalize is a KDE application designed as an XLIFF editor. Lokalize can run under Windows with the whole KDE environment installed. The handbook for Lokalize is at: http://docs.kde.org/development/en/kdesdk/lokalize/index.html

Poedit

Beginning with version 2.2, Poedit also supports the version 1.2 and the new XLIFF 2 format, both in the free and open-source version and the PRO paid version.

Ocelot

Xliffie

Xliff-translator-tool

brightec Online XLIFF Editor (Web App)

www.beyondf.com/tranzapp/translate (Web)

Weblate

Heartsome

Other online tools: SmartCat, MateCat, MemSource

Saturday, January 13, 2018

Python Scripts to translate using Microsoft

#install additional libraries to add coloured text to output
!pip install termcolor
!pip install bs4
from termcolor import colored
from bs4 import BeautifulSoup
import requests
#Using Python for Text Translation with Microsoft Cognitive Services
# Specify the subscription Key
subscriptionKey = "ENTER YOUR COGNITIVE API KEY"
#Specify URLs for Cognitive Services - Translator Text API
translateUrl = 'https://api.microsofttranslator.com/v2/http.svc/Translate'
cognitiveServiceUrl = 'https://api.cognitive.microsoft.com/sts/v1.0/issueToken'
# Request Access Token
requestHeader = {'Ocp-Apim-Subscription-Key': subscriptionKey}
responseResult = requests.post(cognitiveServiceUrl, headers=requestHeader)
token = responseResult.text
print ("Access Token")
print (token)
# Original Text
text = "Créez des applications intelligentes et stratégiques avec une plateforme de base de données évolutive et hybride qui intègre tout ce qu'il vous faut : performances in-memory et sécurité avancée pour les analyses au sein de la base de données."
print(text)
# Specify source and target language
srcLanguage = "fr"
targetLanguage = "en"
# Define Parameters
params = {'appid': 'Bearer '+token, 'text': text, 'from': srcLanguage, 'to': targetLanguage}
requestHeader = {'Accept': 'application/xml'}
# Invoke Cognitive Services to perform translation
responseResult = requests.get(translateUrl, params=params, headers=requestHeader )
# Show original and target text
print(colored('Original Text\n', 'green'))
print(colored(text,'green'))
print ("\n")
print(colored('Translated Text\n', 'blue'))
soup = BeautifulSoup(responseResult.text,"lxml")
print(colored(soup.get_text(), 'blue'))
 
Or, even easier:
 
# -*- coding: utf-8 -*-

import http.client, urllib.parse

# *** Update or verify the following values. ***

# Replace the subscriptionKey string value with your valid subscription key.
subscriptionKey = 'ENTER KEY HERE'

host = 'api.microsofttranslator.com'
path = '/V2/Http.svc/Translate'

target = 'fr-fr'
text = 'Hello'

params = '?to=' + target + '&text=' + urllib.parse.quote (text)

def get_suggestions ():

    headers = {'Ocp-Apim-Subscription-Key': subscriptionKey}
    conn = http.client.HTTPSConnection(host)
    conn.request ("GET", path + params, None, headers)
    response = conn.getresponse ()
    return response.read ()

result = get_suggestions ()
print (result.decode("utf-8"))

Translate response
A successful response is returned in XML, as shown in the following example:
XML
<string xmlns="http://schemas.microsoft.com/2003/10/Serialization/">Salut</string>
 

Microsoft Translator Customization Features


Until a few years ago, automatic translation solutions only offered two approaches when it came to translating your content — use a default translation engine that powers major translation sites and apps such as Bing.com/translator, or build your own customized system painfully from scratch.
In 2012, Microsoft Translator broke this inflexible model with the launch of the Microsoft Translator Hub. This is just one instance of a broader class of work Microsoft is pursuing around artificial intelligence, and our vision for more personal computing experiences and enhanced productivity aided by systems that increasingly can see, hear, speak, understand and even begin to reason. The Hub allowed users to create as many custom systems as needed by combining Microsoft's enormous translation corpus with their own previously translated documents, such as internal or external websites, brochures, white papers, etc.
There are 4 general levels of customization now available to Microsoft Translator API users, with corresponding increases in resource investment and translation quality.

  1. New: Use a Standard Category instead of the default one - Our new standard categories allow you to easily customize the context of your translation by narrowing the scope of the statistical analysis that Microsoft Translator uses to translate your text. Simply speaking, with standard categories, you can tell Microsoft Translator what type of content is being translated in order to improve its accuracy. The first two standard categories we are announcing today are "tech" and "speech", with more on the way.
    • The "tech" category will improve translation quality on all computer-related content (software, hardware, networking...) and has been built with the vast amount of data collected over the years within Microsoft as we translated product help files, documentation, and customer support for our users, and from other sources such as TAUS. The list of languages for which the tech category is supported can be found here.
    • The "speech" category was developed in the last 18 months as we built Skype Translator. For Skype Translator to work properly, it was critical to be able to translate spoken text, which in most cases can be very different from the written text. The languages that are supported in this category are the same speech translation languages that are available for Skype Translator and Microsoft Translator apps for iOS and Android. As new speech languages are released for these applications, the equivalent "speech" category will become available for text translation in our core Translator API as well.
    It's easy to start using standard categories in your translations — just set the value to "tech" or "speech" for the "category" parameter of your translation method if you are using the API, or in the Category ID box in any of our supported products, such as the Document Translator. The default value "general", can be omitted — just select your new standard category to begin receiving your customized translations. These neural network models are available for all speech languages through the Microsoft Translator Speech API, on the try and compare site translate.ai, and through the text API by using the ‘generalnn’ category ID.
    In addition to standard categories, we also developed a "social media" filter that we can enable server-side upon demand. This Client ID level filter has been developed to convert texts and instant messages to proper English to improve translations quality. For instance, once passed through the filter, "R u here?" would become "Are you here?" — which will obviously translate much better than the original. Please note that, for now, only an English texting filter exists.
  2. New: Upload a Custom Dictionary - You can customize your translations further with dictionaries. Dictionaries allow you to make your own foreign language word lists so that the terminology that is unique to your business or industry will translate just the way you want. For instance, if you have a product name that you want translated in a certain way in French, (or not translated at all, if it's a brand name) just add the product name and the corresponding French translation to your Hub dictionary. Every time you use the Microsoft Translator API with the custom category ID obtained from the Translator Hub, you will get your customized translation. To get your translations up and running, all you need to do is upload a simple Excel spreadsheet with your word list to the Translator Hub website and train the system. You can start with as little as one dictionary entry. The custom category you create with your dictionary can be built on top of the general or the standard (speech or tech) categories and remains valid even when you customize your system with one of the following options.
  3. New: Train a System with 1,000 - 5,000 Parallel Sentences - The third level of customization is to add pre-translated content to your custom category. Today, we are introducing the ability to train a system with as few as 1,000 parallel sentences (pre-translated sentences in the original and target language). By training a system with parallel sentences, you can go beyond just a simple list of translated words and phrases. Instead, the Hub tunes all of its internal parameters to produce translations that are similar to the test sentences you provided.By providing the Hub with at least 1,000 parallel sentences, you can help the Hub choose translations that match your organization's terminology and tone better than the standard categories. If you have created content in another language, such as webpages or documentation, you can use it to improve your translations. Obviously, the more sentences you have, the better the translations. You can use this customization mechanism alone or in combination with a custom dictionary.
  4. Train a system with more than 5,000 Parallel Sentences - As was possible since the Hub launched, but now starting with only 5,000 sentences rather than 10,000 previously, you can use any amount of parallel sentences above 5,000 to customize your translations. With more than 5,000 parallel sentences you can begin to create a system that is learning new terms and phrases in the right context and tone of your business. This leads to a better, more customized translation. Add a dictionary for even better results if you have a corpus of less than 50,000 parallel sentences.If you have more than 50,000 parallel sentences, you will be able to build a system that can give fully customized results. At this level, the machine has learned your terminology in context through parallel sentences, so the dictionary will be less helpful, and can be reduced to the new terms as you develop new topics in your source content.
With more than 50,000 parallel sentences, ideally in the 100s of thousands of sentences, the Hub enables you to create brand new language systems. Many of the Microsoft Translator supported languages were developed by Community Partners including the languages Hmong Daw, Yucatec Maya, Queretaro Otomi, Welsh, and Kiswahili.
Once you have trained and deployed your new customized system, it is available to use in all category ID-enabled Microsoft Translator products, such as the on-premise version of SharePoint, Office apps for PowerPoint and Word, the Document Translator, and the Multilingual App Toolkit, and many translation memory tools from our partners. The Hub can help improve translation quality for a wide variety of scenarios such as web localization, customer support, and internal communications, whether online or in apps.
After your translated content is published, you can engage your community of users to refine the translation by using the Collaborative Translation Framework (CTF). CTF allows you to use human translation to edit the output of the translated content or to manage crowdsourced edits to your content so that you can refine it over time. The Hub can import these human corrections easily so you can incorporate them in training a better-customized translation system.
To start using the Translator Hub to customize your system, simply visit www.microsoft.com/translator/hub.aspx, and register a workspace. You can invite as many other people as you like into your workspace to collaborate on improving your translation system. When you are ready to deploy a custom system, you will need to sign up for an account with Microsoft Translator. You can register for a free 2 million character per month subscription to get you started. After you have registered, you can go to the Translator Hub website to start customizing!
Source: blogs.msdn.microsoft.com