Posts by Tags

Computer Vision

How does image analysis work in contract management?

less than 1 minute read

Published: on by Romain Vial

In the contract analysis process developed by Dilitrust, everything often begins with image analysis. Indeed, the majority of the documents we process are scanned documents in which the text is not directly accessible. It is therefore necessary to go through an image analysis stage which aims to solve the following problems:

Interview

Behind the scenes of Dilitrust’s Machine Learning team

less than 1 minute read

Published: on by

Dilitrust is a SaaS solution for contract management. Since its creation, Dilitrust has been developing its own artificial intelligence. It is thanks to this AI that we can offer our service. It analyses contracts and extracts the important data they contain to facilitate your daily work.

Language Models

Training state of the art french language model on legal contract

less than 1 minute read

Published: on by Ahmed Touila

The majority of natural language processing modules leverage a certain technique for text representations on character, word or sequence level (sentence, paragraph or document). As a result, the efficiency of these modules is highly dependent on the quality of the embeddings they are built on.

MLOPS

6 Reasons why you should adopt MLOps?

less than 1 minute read

Published: on by Subaandh Sambharathan V K

Artificial Intelligence adoption in enterprises is growing steadily. According to a recent survey, 35% of companies reported using AI in their business, and 42% are reportedly experimenting. Increasing AI adoption requires maintenance and monitoring of the Machine Learning models. Machine Learning Operations (MLOps) is a set process that aims to track, deploy and monitor Machine Learning models in production.

NLP

Long Sequences Transformers: a review of the SOTA

less than 1 minute read

Published: on by Achraff Adjileye

A lot of work has been done on processing long documents, lifting the limitation encountered by BERT-like models which are only capable of processing sequences up to 512 tokens. This has led to the release of several variants of these models to process long documents. The main idea of most of these models is to make the attention mechanism of Transformer (see Attention is all you need) scale linearly with the input sequence length instead of quadratically, in terms of time and memory complexity.

Training state of the art french language model on legal contract

less than 1 minute read

Published: on by Ahmed Touila

The majority of natural language processing modules leverage a certain technique for text representations on character, word or sequence level (sentence, paragraph or document). As a result, the efficiency of these modules is highly dependent on the quality of the embeddings they are built on.

Transformers

Long Sequences Transformers: a review of the SOTA

less than 1 minute read

Published: on by Achraff Adjileye

A lot of work has been done on processing long documents, lifting the limitation encountered by BERT-like models which are only capable of processing sequences up to 512 tokens. This has led to the release of several variants of these models to process long documents. The main idea of most of these models is to make the attention mechanism of Transformer (see Attention is all you need) scale linearly with the input sequence length instead of quadratically, in terms of time and memory complexity.