JPMorgan Announces DocLLM for Multimodal Document Understanding

Published on January 4, 2024

Source: AIM

J.P. Morgan has introduced DocLLM, a generative language model designed for multimodal document understanding. DocLLM stands out as a lightweight extension to LLMs for analysing enterprise documents, spanning forms, invoices, reports, contracts that carry intricate semantics at the intersection of textual and spatial modalities.

Unlike existing multimodal LLMs, DocLLM strategically avoids expensive image encoders and focuses exclusively on bounding box information to incorporate spatial layout structures. The model introduces a disentangled spatial attention mechanism by decomposing the attention mechanism in classical transformers into a set of disentangled matrices.

For pre-training DocLLM, data was gathered from two primary sources: IIT-CDIP Test Collection 1.0 and DocBank. The former comprises over 5 million documents related to legal proceedings against the tobacco industry during the 1990s, while the latter consists of 500,000 documents, each featuring distinct layouts.

Read full article: https://analyticsindiamag.com/jpmorgan-announces-docllm-for-multimodal-document-understanding/

Sign up for our newsletter

Get weekly news and insights delivered straight to your inbox!

First name

Last name

Company

Position

Country

Sector

Latest posts

A conversation between Monica Phillips, President at Spark Plug Labs and Bradley Collins, CEO at LegalTechTalk

May 20, 2024

Linklaters raises junior lawyer pay to £150,000

May 17, 2024

SurePoint® Technologies Acquires Leopard Solutions

May 17, 2024

Leya raises $10.5 million

May 16, 2024

Legal app inCase bought by Access Group’s legal division

May 15, 2024

Axiom’s New Permanent Recruitment Solutions Now Available in the UK

May 14, 2024

The future of law is at LegalTechTalk – Europe’s Event for Legal Transformation.

The legal industry is undergoing a major transformation, driven by technological advances, changing demographics, and new business models. Lawyers who are able to adapt to these changes will be well-positioned for success in the future.