Cognitive Class

Text Analytics 101

From social media to news articles to machine logs, text data is everywhere. Are you interested in learning how to derive valuable insights from text data? Enroll today and learn about the hot topic of text analytics from experts at IBM.

Start the Free Course

About This Course

From social media to news articles to machine logs, text data is everywhere. This class will teach you about Information Extraction: how to extract structured data from text in order to derive valuable insights. You will learn about information extraction applications in various domains, such as social media, healthcare analytics, and financial risk analysis. You will explore common text analytics tasks, including entity, relation, and event extraction, as well as sentiment analysis. Finally, you will dive into "Declarative Information Extraction", a powerful method for doing high-performance and high-quality text analytics, and gain hands-on experience writing your own extractors.

 

Course Syllabus

  • Module 1 - Getting to Know Information Extraction

  • Module 2 - Limitations in Information Extraction

  • Module 3 - Getting to Know SystemT

  • Module 4 - Information Extraction with AQL

  • Module 5 - AQL Basics

  • Module 6 - Advanced AQL

  • Module 7 - Declarative Information Extraction and the SystemT Optimizer

  • Module 8 - Best Practices

General Information

  • This course is free.
  • It is self-paced.
  • It can be taken at any time.
  • It can be audited as many times as you wish.

Recommended skills prior to taking this course

  • None

Requirements

  • None

Course Staff

 

Yunyao

Yunyao Li

Yunyao Li is a Principle Research Staff Member and Senior Research Manager at  IBM Almaden Research Center where she manages the Scalable Knowledge Intelligence department. She is also a Master Inventor and a member of IBM Academy of Technology.  Her expertise is in the interdisciplinary areas of natural language processing, databases, human-computer interaction, and information retrieval.  She is a founding member of SystemT, a state-of-the-art NLP system currently powering multiple IBM products, and numerous projects. She received her PhD and master degrees from  the University of Michigan Ann Arbor and undergraduate degrees from Tsinghua University, Beijing, China.  You can read about Yunyao's inspiring story from small-town China to Silicon Valley hereFollow her on Twitter @yunyao_li.

 

laura

Laura Chiticariu

Laura Chiticariu is the Chief Architect of Watson Knowledge and Language Foundation, with technical leadership responsibilities over Watson Natural Language Understanding, Watson Knowledge Studio and Watson Knowledge Graph. Laura is a core member of the SystemT, R&D team, and strongly believes in the notion of "Transparent NLP": leveraging machine learning techniques, while ensuring that the NLP system remains transparent - easy to comprehend, debug and enhance. She holds a Ph.D. in Computer Science, and has been teaching NLP across universities within and outside the U.S.

 

HuaiyuZhu

Huaiyu Zhu

Huaiyu Zhu is a Research Staff Member in the Scalable Knowledge Intelligence group at IBM Almaden Research Center. His main research focus is on text analytics, natural language processing, machine learning and statistical information processing.

 

Atsushi Ono

Atsushi Ono is a software engineer at Tokyo Software & Systems Development Lab (TSDL), IBM Japan. After several years of experience on business partner technical enablement missions, he has been working as a front-end developer on various projects, including contributing to the open source Dojo Mobile project. He has worked on the development of IBM Watson Knowledge Studio since the project’s inception.

 

Yuka Nomura

Yuka Nomura is a software engineer working on front-end development of IBM Watson Knowledge Studio at Tokyo Software & Systems Development Lab (TSDL), IBM Japan. She has contributed to user interface design and product development from her very first project start-up. She also specializes in robot application programming that runs on communication robots such as Pepper.

 

Chikako Oyanagi

Chikako Oyanagi is a front-end software developer of IBM Watson Knowledge Studio at Tokyo Software & Systems Development Lab (TSDL), IBM Japan.

 

Teruki Tauchi

Teruki Tauchi is a front-end software developer of IBM Watson Knowledge Studio at Tokyo Software & Systems Development Lab (TSDL), IBM Japan. He joined IBM after obtaining a Master of Engineering degree in Computer Science from University College London in 2015.