Infrrd Research Team wins the DocVQA challenge by ICDAR

By
Infrrd
September 8, 2021

Engineers from Infrrd's Research team emerged as the winners of the 2021 edition of the DocVQA challenge (Document Collection DocVQA task) organized by ICDAR 2021.

San Jose, CA - ICDAR 2021, a premier international event for scientists and practitioners involved in document analysis and recognition held its 2021 Edition of the DocVQA (Document Visual Question Answering) challenge earlier this year. In the Document Collection VQA task of the challenge, California-based Infrrd’s team of engineers made it to the top of the leaderboard and won the challenge with their ingenious approach.

ICDAR’s DocVQA challenge was created with a sophisticated vision. Contemporary Document Analysis and Recognition (DAR) research, which tends to focus on generic information extraction tasks, fails to provide human-ready insights from the extracted data. The DocVQA challenge tasks inspired participants to take this a step ahead where the document content is extracted and used to answer high-level tasks defined by the human consumers of this information.

Infrrd’s team of engineers from their AI research lab solved this task with the Infrrd-RADAR (Retrieval of Answers by Document Analysis and Re-ranking) method. The solution method authored by JiangLong He, Aditya Kumar Sarda, Deepak Kumar, Cesar Duran was able to solve the DocVQA challenge task in the most effective and efficient way. Infrrd’s team beat PingAn OneConnect GammaLab, one of China’s top FinTech companies, iFLYTEK research and engineers from the tech giant Amazon to race up the leaderboard.

The team was honored to present their method at the 16th International Conference on Document Analysis and Recognition in Lausanne, Switzerland held on September 5-10, 2021.

Read more about the ICDAR’s DocVQA challenge and the results here.

ABOUT INFRRD

Infrrd, headquartered in San Jose, California, is driving innovation ahead in the AI space with its proprietary Intelligent Document Processing (IDP) platform. Our platform allows enterprises to classify and extract data from semi-structured and unstructured documents. It offers pre-built, out-of-the-box models for use cases such as invoices, claims, and bank statements for easy adoption, while also allowing enterprises to create and train custom models. The platform has advanced cognitive capabilities, a user-friendly interface, and the flexibility to process different data types, contributing to its rapid growth in the market.

ABOUT ICDAR

ICDAR is the premier international event for scientists and practitioners involved in document analysis and recognition, a field of growing importance in the current age of digital transition. In 2021, the 16th edition of this flagship conference will be held for the first time in Switzerland. Read more about the conference here.

Frequently asked questions

What does your pricing model look like?

We price based on the annual volume of pages and complexity of document type.  We can get you preliminary pricing once we outlined a solution.  Let's do this.

To know more, book a 15-min session with an IDP expert

How can I try Infrrd before I commit to a full deployment?

Sure.  The first step is to schedule a guided demo where you get to jump into the thick of it.  After you explore our solution you can try a proof of concept. When you're ready, you can deploy the system to one use case.  Then more use cases.  Then across your enterprise.

To know more, book a 15-min session with an IDP expert

How does your system integrate with others in my enterprise?

We play nice.  Our solutions are API-based.  Your documents are feed into the solution using APIs. And extracted data is sent out through APIs.  We use REST APIs.

To know more, book a 15-min session with an IDP expert

Does your solution run in the cloud or on premise?

Our solution is cloud-native but is also design for premise deployments.  Your choice on how you want to deploy it.

To know more, book a 15-min session with an IDP expert

Does Infrrd run on mobile or desktop device?

Glad you asked.  Our data extraction process runs on servers.  We have found performance and accuracy decline when running on a desktop or mobile device. (Remember Infrrd is running a powerful AI stack).

To know more, book a 15-min session with an IDP expert

Does your system work out of the box or does it require training?

Common documents and use cases work out of the box.  The cool thing is your solution will improve as the system learns from your documents upfront and over time.

To know more, book a 15-min session with an IDP expert

How does your solution handle corrections?

Did you know no system is 100% accurate all the time?  When extraction errors occur you want to correct them.  We provide a simple UI that your business analyst will use to make corrections.

To know more, book a 15-min session with an IDP expert

Does your solution work with handwriting?

Our solution excels at data extraction from handwriting.  We've got proprietary methods and techniques that do the trick.  It's pretty cool.  See for yourself.

To know more, book a 15-min session with an IDP expert