Train your own ML System to extract information from documents

Aaron Richiger, Sara Wick (turicode)


About 80 percent of all business-relevant information is unstructured and locked inside documents such as scans, PDFs, or e-mails. To extract information, the relevant data needs to be copy-pasted or typed out of documents manually. In recent years, turicode has developed an information retrieval engine to unlock the potential of documents.

In this hands-on session you will
- Learn about the state-of-the-art information extraction from Documents
- Learn about the potential and limits of current technology
- Get to train your machine learning system to extract data from Documents from scratch, there are no programming skills required!
- see the effects of quality and size of training material on the results
We are looking forward to exploring information extraction from different documents and learning more about training a machine learning system.

PLEASE BRING YOUR OWN LAPTOP, we will bring the rest.


About Turicode

turicode is a young technology company located in Winterthur. We are experts in extracting unstructured information from documents and making it available for analytics or use in further automated processing.

About the presenters:

Aaron Richiger is a passionate entrepreneur and Head of Machine Learning at turicode AG, which he co-founded. He employs his persistent scientific curiosity to develop novel software solutions in the area of information retrieval from documents. He completed both his bachelor’s and master’s degree in computer science at ETH Zurich

Sara Wick is a Business Solution Manager at turicode AG. Sara is passionate about languages and technology and helps customers reach true digitalization with their documents. Her strength lies in bridging the gap between business and technology She holds a master’s in English and Multilingual Text Analysis from the University of Zurich.