KMap
Hong Cui's research focuses on machine learning applications for semantic annotation of semi-structured information, with a current focus on biodiversity literature. She develops and evaluates machine learning and natural language processing algorithms for converting born-digital and digitized taxonomic descriptions into new Semantic Web formats. More recently her research has led to ontology building in biology domain. Her work has an explicit impact on how scientific information can be retrieved and used in the digital era by turning the wealth of human-readable scientific information into something that can be understood and read by computers. She is the principal investigator or co-PI of a number of National Science Foundation-funded projects. The methodology developed by Dr. Cui has been adopted by several other research groups in the US and abroad. She leads the biosemantics research group in the iSchool.

VOSviewer

Courses
  • FI
    Foundations of Information

  • DMD
    Data Mining and Discovery

  • IAT
    Introduction to Applied Technology

  • DSSW
    Data Standards for the Semantic Web

  • CV
    Controlled Vocabularies

  • IIR
    Issues in Information Resources

  • OI
    Organization of Information

Grants
  • Funding agency logo
    ABI Innovation: Authors in the Driver's Seat: Fast, Consistent, Computable Phenotype Data and Ontology Production

    Principal Investigator (PI)

    2017

    $641.8K
  • Funding agency logo
    Microbial Tree of Life

    Principal Investigator (PI)

    2017

    $54.2K
  • Funding agency logo
    BCSP: Collaborative Research: ABI Development: Exploring Taxon Concepts (ETC) through Analyzing Fine-Grained Semantic Markup of Descriptive Literature

    Principal Investigator (PI)

    2016

    $92.3K
  • Funding agency logo
    Collabroative Research: Building a Comprehensive Evolutionary History of Flagellate Plants

    Principal Investigator (PI)

    2016

    $53.2K
  • Funding agency logo
    BCSP: Collaborative Research: ABI Development: Exploring Taxon Concepts (ETC) Through Analyzing Fine-Grained Semantic Markup of Descriptive Literature

    Principal Investigator (PI)

    2012

    $1.1M
  • Funding agency logo
    LaSCALA: Latino Scholars Cambio Leadership Academy

    Co-Investigator (COI)

    2012

    $116.5K
  • Funding agency logo
    Collaborative Research: ABI Development: Ontology-enabled reasoning across phenotypes from evolution and model organisms

    Principal Investigator (PI)

    2011

    $29.7K
News
  • Award-Winning Teacher Hong Cui Joins SIRLS Faculty

    2007

Publications (63)
Recent
  • Enabling Authors to Produce Computable Phenotype Measurements: Usability Studies on the Measurement Recorder.

    2020

  • Innovative UX Methods for Information Access Based on Interdisciplinary Approaches: Practical Lessons from Academia and Industry.

    2019

  • Model Patterns for a Common Semantic Data Model for Phenotype Knowledge Graphs Bioinformatics.

    2019

  • Using shallow semantic analysis to implement automated quality assessment of web health care information.

    2018

  • Bringing a Semantic MediaWiki Flora to Life

    2018

  • Identifying bacterial biotope entities using sequence labeling: performance and feature analysis

    2018

  • Semiautomatic extraction of phenotypic traits from taxonomic descriptions using a Natural Language Processing approach.

    2018

  • Resolving “orphaned” parts using machine learning and natural language processing methods

    2018

  • Where are iSchools heading?

    2018

  • Incentivizing use of structured language in biological descriptions: author-driven phenotype data and ontology production

    2018

  • Modifier Ontologies for freqency, certainty, degree and coverage pheontoype modifiers.

    2018

  • A Natural Language Processing Pipeline to Extract Phenotypic Data from Formal Taxonomic Descriptions with a Focus on Flagellate Plants.

    2018

  • An automated approach for rating the content quality of Web healthcare information: A case study on depression treatment Web pages.

    2018

  • Building the “Plant Glossary”—A controlled botanical vocabulary using terms extracted from the Floras of North America and China.

    2017

  • Gold standard evaluation of machine- and human-generated annotations of biodiverse phenotypes

    2017

  • Natural Language Processing pipeline, a useful tool for the biology, taxonomy or systematics classroom or laboratory

    2017

  • User study of the ETC ontology building tool. (targeting BMC Bioinformatics)

    2017

  • A New Approach to Teach Taxonomy and Scientific Research Skills using Natural Language Processing

    2017

  • MultiLayerMatrix: Visualizing large taxonomic datasets [full paper]

    2016

  • Introducing Explorer of Taxon Concepts with a Case Study on Spider Measurement Matrix Building

    2016

  • Microbial Phenomics Information Extractor (MicroPIE): A Natural Language Processing Tool for the Automated Acquisition of Prokaryotic Phenotypic Characters from Text Sources.

    2016

  • Using taxonomic descriptions to build and evaluate a standardized Plant Glossary.

    2016

  • MicrO - an ontology of prokaryotic phenotypic and metabolic characters

    2016

  • Finding Our Way through Phenotypes

    2015

  • CharaPaser+EQ: Performance Evaluation Without Gold Standard. Nov 6-10, St Louis, Missouri, 2015. (Full paper, acceptance rate: 36.%)

    2015

  • Annotating phenotypes using ontological concepts: Inter-curator consistency as a baseline for evaluating the performance of a natural language processing system

    2015

  • The Biological Spatial Ontology: anatomical descriptors for spatial and topological aspects of biological structures. Journal of Biomedical Semantics

    2014

  • A machine learning approach for rating the quality of depression treatment web pages.

    2014

  • Next generation phenomics for the Tree of Life.

    2013

  • An overview of the BioCreative 2012 Workshop Track III: interactive text mining task

    2013

  • Heuristics based semantic annotation of biodiversity documents in Chinese

    2013

  • Semantic Annotation of Species Description Text in Chinese Literature by Naive Bayes Classifier.

    2012

  • An Overview of the BioCreative 2012 Workshop Track III: Interactive Text Mining Task.

    2012

  • Applications of Natural Language Processing in Biodiversity Science

    2012

  • CharaParser for fine-grained semantic annotation of organism morphological descriptions.

    2012

  • PCS for Phylogenetic Systematic Literature Curation

    2012

  • Mapping of glossary terms from the Flora of North America to the Plant Ontology enhances both resources

    2012

  • Evaluating the botanical coverage of PATO using an unsupervised learning algorithm

    2012

  • Study on Semantic Markup of Species Description Text in Chinese Based on Auto-Learned Rules

    2012

  • Workflow of CharaParser and Phenex: Turning character descriptions to EQ statements

    2012

  • OTO: Ontology Term Organizer

    2012

  • Fine-Grained Semantic Markup of Descriptive Data

    2011

  • CharaParser for Fine-Grained Semantic Annotation of Taxonomic Descriptions

    2011

  • Machine learning based semantic markup of biodiversity literature in English

    2011

  • Combine Unsupervised Learning and Heuristic Rules to Annotate Morphological Characters

    2011

  • Floras in the 21st Century: The Flora of North America saga

    2011

  • Semantic Annotation of Morphological Descriptions: An Overall Strategye

    2010

  • From Text to RDF Triple Store: An Application for Biodiversity Literature

    2010

  • Semantic annotation of morphological descriptions: an overall strategy

    2010

  • Semantic Annotation of Biosystematics Literature without Training Examples

    2010

  • Tools for Semantic Annotation of Taxonomic Descriptions

    2010

  • Evaluating Plant Character Ontologies Against Domain Literature

    2010

  • Linking Corpus Characteristics to Performance of Semantic Annotation Systems for Biosystematic Descriptions

    2010

  • Unsupervised Extraction of Text Segments from Heterogeneous Document Collections

    2010

  • Fine-Grained Semantic Annotation of Descriptive Data for Knowledge Application in Biodiversity

    2009

  • Semantic Annotation of Biosystematics Literature without Training Examples.

    2009

  • Evaluating Plant Character Ontologies Against Domain Literature.

    2009

  • Application of Semantic Annotation for Quality Insurance in Biosystematics Publishing

    2009

  • Converting Taxonomic Descriptions to New Digital Formats

    2008

  • Unsupervised Learning for Semantic Markup of Biodiversity Literature

    2008

  • Approaches to Semantic Mark up for Natural Heritage Literature

    2008

  • Use Server2Go to Teach IT Courses for LIS Students

    2007

  • The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions

    2007

Grants
Citations
H-Index
Patents
News
Books
Opportunities