CTranslate

CTranslate is a C++ implementation of OpenNMT's translate.lua script with no LuaTorch dependencies. It facilitates the use of OpenNMT models in existing products and on various platforms using Eigen as a backend.

CTranslate provides optimized CPU translation and optionally offloads matrix multiplication on a CUDA-compatible device using cuBLAS. It only supports OpenNMT models released with the release_model.lua script.

Dependencies

Eigen >= 3.3
Boost (program_options, when -DLIB_ONLY=OFF)

Optional

CUDA for matrix multiplication offloading on a GPU
Intel® MKL for an alternative BLAS backend

Compiling

CMake and a compiler that supports the C++11 standard are required to compile the project.

git submodule update --init
mkdir build
cd build
cmake ..
make

It will produce the dynamic library libonmt.so (or .dylib on Mac OS, .dll on Windows) and the translation client cli/translate.

CTranslate also bundles OpenNMT's Tokenizer which provides the tokenization tools lib/tokenizer/cli/tokenize and lib/tokenizer/cli/detokenize.

Options

To give hints about Eigen location, use the -DEIGEN3_ROOT=<path to Eigen library> option.
To compile only the library, use the -DLIB_ONLY=ON flag.
To disable OpenMP, use the -DWITH_OPENMP=OFF flag.

Performance tips

Unless you are cross-compiling for a different architecture, add -DCMAKE_CXX_FLAGS="-march=native" to the cmake command above to optimize for speed.
Consider installing Intel® MKL when you are targetting Intel®-powered platforms. If found, the project will automatically link against it.

Using

Clients

See --help on the clients to discover available options and usage. They have the same interface as their Lua counterpart.

Library

This project is also a convenient way to load OpenNMT models and translate texts in existing software.

Here is a very simple example:

#include <iostream>

#include <onmt/onmt.h>

int main()
{
  // Create a new Translator object.
  auto translator = onmt::TranslatorFactory::build("enfr_model_release.t7");

  // Translate a tokenized sentence.
  std::cout << translator->translate("Hello world !") << std::endl;

  return 0;
}

For a more advanced usage, see:

include/onmt/TranslatorFactory.h to instantiate a new translator
include/onmt/ITranslator.h (the Translator interface) to translate sequences or batch of sequences
include/onmt/TranslationResult.h to retrieve results and attention vectors
include/onmt/Threads.h to programmatically control the number of threads to use

Also see the headers available in the Tokenizer that are accessible when linking against CTranslate.

Supported features

CTranslate focuses on supporting model configurations that are likely to be used in production settings. It covers models trained with the default options, plus some variants:

additional input or output word features
brnn encoder (with sum or concat merge policy)
dot attention
residual connections
no input feeding

Additionally, CTranslate misses some advanced features of translate.lua:

gold data score
best N hypotheses
hypotheses filtering
beam search normalization

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
cli		cli
cmake		cmake
include/onmt		include/onmt
lib		lib
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
.travis.yml		.travis.yml
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CTranslate

Dependencies

Optional

Compiling

Options

Performance tips

Using

Clients

Library

Supported features

About

Releases

Packages

Languages

License

QwantResearch/CTranslate

Folders and files

Latest commit

History

Repository files navigation

CTranslate

Dependencies

Optional

Compiling

Options

Performance tips

Using

Clients

Library

Supported features

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages