Story of Akshay and the 3 year long path to patent

By
Anusha Venkatesh
IDP Evangelist

Welcome to our Careers Blog

It’s a place to talk about what brings people here, what they’re learning, and why they think others might like it here, too.

Infrrd has a strong culture of transparency, mentorship, and giving individuals the power to make decisions. Tech wizards from around the world join Infrrd to develop algorithms that make the impossible possible when it comes to unlocking the hidden value in data. They like the platform, the challenge, and the culture of curiosity. Plus the people are pretty great.

Meet Akshay Uppal, one of Infrrd’s brilliant machine learning engineers. He just got granted his very first patent for developing a system to extract data from unconventional images. And not just any images but from stamps and other complex images on the documents.

His invention has unlocked new pathways for Intelligent Document Processing and we had a chance to interview him and here’s how it went.

Interviewer: Tell us about your journey to the patent.

Akshay: This was one of the very first projects I picked up when I joined Infrrd 3 years ago. At the time, the team was working on a requirement to extract information from invoices and receipts like documents and as a subset of that use case, we had to extract important information from the stamps present on these documents. I was handed this subset of the problems which soon turned into solving the bottlenecks of OCR itself in capturing data from images.

Interviewer: What were the bottlenecks?

Akshay: So here’s what happens when extracting data from images with stamps.

All the existing OCR solutions are incapable of robustly extracting text from images that have three bottlenecks:

  • Random Orientation of the Images: In cases of logos, stamps, and even scanned documents it is not possible to always have a fixed orientation, and since every image could have a random orientation the OCR can not take into account this randomness and is unable to produce accurate results. While the orientation can be corrected to some extent in some cases by traditional image processing techniques and also being done by some high-end OCR solutions, it's not a robust method as there could be a lot of variations among documents, especially in the case of logos, stamps etc.
  • Occlusion in text: OCR solutions are not yet capable of fully isolating background from foreground, this means that if there is some background text overlapping the target text the OCR is incapable of producing results properly.
  • Unconventional Text: OCR solutions mostly expect almost horizontal texts or uniformly oriented text, in use cases of stamps, logos, and banners this condition is not always true and the existing OCR solutions cannot handle such problems.

Pretty soon we understood that there are no existing solutions to these OCR bottlenecks currently and that we had to develop one.

Interviewer: And how did that go?

Akshay: We used deep learning models to separate the stamp out and predict the orientation of the stamp. Based on that, it extracts numbers and letters from it.

This not only helped us separate the stamps and the original text underneath, but extract data from stamps successfully including locations, pincodes etc.

We further extended the solution to help extract data from other unconventional images like logos and more.

Today, this invention has created an entire solution suite for Infrrd’s proprietary platform.

Interviewer: Of all the places, why Infrrd?

Akshay: The work that I do at Infrrd is pretty challenging. There is a lot of research that goes into deep learning and machine learning. There is a significant gap between the industry and said research. At Infrrd, we’re trying to bridge this gap. The work I do here is intellectually stimulating and keeps me on my toes.

Another thing about Infrrd is that we are a close-knit community. We are continuously updating our platform. We all have the same big picture in mind that we are working towards together. That sense of inclusion and seeing your work make an impact is encouraging. That’s what is keeping me at Infrrd.

In the end:

That was Akshay Uppal in his own words. Akshay who joined us 3 years ago is now an integral part of the team. His passion to innovate and enthusiasm to execute is infectious. Kudos. Akshay! You made your Infrrd family proud.

Frequently asked questions

What does your pricing model look like?

We price based on the annual volume of pages and complexity of document type.  We can get you preliminary pricing once we outlined a solution.  Let's do this.

To know more, book a 15-min session with an IDP expert

How can I try Infrrd before I commit to a full deployment?

Sure.  The first step is to schedule a guided demo where you get to jump into the thick of it.  After you explore our solution you can try a proof of concept. When you're ready, you can deploy the system to one use case.  Then more use cases.  Then across your enterprise.

To know more, book a 15-min session with an IDP expert

How does your system integrate with others in my enterprise?

We play nice.  Our solutions are API-based.  Your documents are feed into the solution using APIs. And extracted data is sent out through APIs.  We use REST APIs.

To know more, book a 15-min session with an IDP expert

Does your solution run in the cloud or on premise?

Our solution is cloud-native but is also design for premise deployments.  Your choice on how you want to deploy it.

To know more, book a 15-min session with an IDP expert

Does Infrrd run on mobile or desktop device?

Glad you asked.  Our data extraction process runs on servers.  We have found performance and accuracy decline when running on a desktop or mobile device. (Remember Infrrd is running a powerful AI stack).

To know more, book a 15-min session with an IDP expert

Does your system work out of the box or does it require training?

Common documents and use cases work out of the box.  The cool thing is your solution will improve as the system learns from your documents upfront and over time.

To know more, book a 15-min session with an IDP expert

How does your solution handle corrections?

Did you know no system is 100% accurate all the time?  When extraction errors occur you want to correct them.  We provide a simple UI that your business analyst will use to make corrections.

To know more, book a 15-min session with an IDP expert

Does your solution work with handwriting?

Our solution excels at data extraction from handwriting.  We've got proprietary methods and techniques that do the trick.  It's pretty cool.  See for yourself.

To know more, book a 15-min session with an IDP expert