Towards automated mining of chemical structures in Chinese Patents
Daniel Bonniot (ChemAxon, Hungary)
In 2011, China overtook the United States in the number of patent applications. This new situation raises at least two challenges. First, this represents an enormous amount of data to monitor and to search. Second, non-Chinese companies face an additional difficulty caused by the language barrier. While translation services exist, they have limitations, especially in specialized areas such as chemical nomenclature. In this presentation, we present ChemAxon's efforts to extend its English chemical name to structure conversion tool to support Chinese chemical names. We also describe integrated solutions for the extraction of chemical structures found in patents and other types of documents, as well as automated chemical indexing in document management systems such as SharePoint and Documentum.