[Illinois] CSE 2013: Making Sense of Unstructured Data

By Dan Roth

University of Illinois at Urbana-Champaign

View Resource (HTM)

Licensed according to this deed.

Published on


Recent studies have shown that over 85% of the information people and organizations deal with is unstructured – the vast majority of which is text in different forms. A multitude of techniques has to be used in order to enable intelligent access to this information and support transforming it to forms that allow sensible use of the information. The fundamental issue that all these techniques have to address is that of semantics – there is a need to move toward understanding the text at an appropriate level, beyond the word level, in order to support access, knowledge extraction and synthesis. We will discuss two key issues: (1) From Data to Meaning: There are several dimensions of text understanding that can facilitate access to information and the extraction of knowledge from unstructured text, and (2) Trustworthiness of Information: While we can locate and extract information quite reliably, we lack ready means to determine whether we should actually believe them.


Prof. Dan Roth, Computer Science and Beckman Institute, University of Illinois

Dan Roth is a Professor in the Department of Computer Science and the Beckman Institute at the University of Illinois at Urbana-Champaign and a University of Illinois Scholar. He is the director of a DHS Center for Multimodal Information Access & Synthesis (MIAS) and also has faculty positions in Statistics, Linguistics and at the School of Library and Information Sciences. Roth is a Fellow of the ACM, AAAI and the ACL, for his contributions to machine learning and natural language processing. He has published broadly in machine learning, natural language processing, knowledge representation and reasoning, and learning theory, and has developed advanced machine learning based tools for natural language applications that are being used widely by the research community. Roth is the Associate Editor-in-Chief of the Journal of Artificial Intelligence Research (JAIR) and will serve as Editor-in-Chief for a two-year term beginning in 2015. Prof. Roth has given keynote talks in major conferences, including AAAI, EMNLP, and ECML and presented several tutorials in universities and conferences including at ACL, NAACL and EACL. Roth was the program chair of AAAI'11, ACL'03 and CoNLL'02, is or has been on the editorial board of several journals in his research areas and has won several teaching and paper awards.

Prof. Roth received his B.A Summa cum laude in Mathematics from the Technion, Israel, and his Ph.D. in Computer Science from Harvard University in 1995.

Cite this work

Researchers should cite this work as follows:

  • Dan Roth (2013), "[Illinois] CSE 2013: Making Sense of Unstructured Data," http://nanohub.org/resources/18168.

    BibTex | EndNote



NCSA Auditorium, University of Illinois at Urbana-Champaign, Urbana, IL


NanoBio Node, Adeeb Yunus

University of Illinois at Urbana-Champaign