KMap
I joined the University of Arizona School of Information in August 2016, after three years as an assistant professor in Computer and Information Science at the University of Alabama at Birmingham. I previously worked as a postdoctoral researcher at Stanford University's Natural Language Processing group, Johns Hopkins University's Human Language Technology Center of Excellence, KULeuven's Language Intelligence and Information Retrieval group in Belgium, and the University of Colorado's Center for Language and Education Research.

VOSviewer

Courses
  • SNLP
    Statistical Natural Language Processing

  • NN
    Neural Networks

  • SFIA
    Statistical Foundations for the Information Age

  • IRM
    Information Research Methods

Grants
  • Funding agency logo
    Learning Science Concepts Through Metaphor Comprehension, Production, and Conversation: Behavioral, Neural and Artificial Intelligence Measures

    Co-Investigator (COI)

    2022

    $352.8K
    Active
  • Funding agency logo
    Using Natural Language Processing to Determine Predictors of Healthy Diet and Physical Activity Behavior Change in Ovarian Cancer Survivors

    Principal Investigator (PI)

    2022

    $129.4K
    Active
  • Funding agency logo
    Extended Methods and Software Development for Health NLP

    Principal Investigator (PI)

    2021

    $224.0K
    Active
  • Funding agency logo
    VADER: Voice Assistant for Data Entry and Recording

    Principal Investigator (PI)

    2021

    $56.6K
    Active
  • Funding agency logo
    Temporal Relation Discovery for Clinical Text

    Principal Investigator (PI)

    2019

    $227.1K
    Active
  • Funding agency logo
    RIDIR: Collaborative Research: A Data Science Platform and Mechanisms for Its Sustainability

    Co-Investigator (COI)

    2018

    $1.5M
    Active
  • Funding agency logo
    Using Natural Language Processing to Determine Predictors of Healthy Diet and Physical Activity Behavior Change in Ovarian Cancer Survivors

    Multiple Principal Investigator (MPI)

    2021

    $209.3K
  • Funding agency logo
    Automated Domain Adaptation for Clinical Natural Language Processing

    Principal Investigator (PI)

    2018

    $213.3K
  • Funding agency logo
    GRASP: Global Reading and Assembly for Semantic, Probabilistic World Models

    Co-Investigator (COI)

    2017

    $9.6M
  • Funding agency logo
    GRASP: Global Reading and Assembly for Semantic, Probabilistic World Models

    Co-Investigator (COI)

    2017

    $1.1M
Books
  • Proceedings of the 4th Clinical Natural Language Processing Workshop

    2022

  • Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

    2021

  • Proceedings of the 3rd Clinical Natural Language Processing Workshop

    2020

  • Proceedings of the 2nd Clinical Natural Language Processing Workshop

    2019

  • Proceedings of The 12th International Workshop on Semantic Evaluation (SemEval-2018)

    2018

  • Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)

    2017

  • Proceedings of the Clinical Natural Language Processing Workshop (ClinicalNLP)

    2016

  • Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)

    2016

  • Proceedings of the EACL 2014 Workshop on Computational Approaches to Causality in Language (CAtoCL)

    2014

  • 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop

    2013

  • Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing

    2013

News
  • UArizona to Host DC Center for Outreach & Collaboration's Inaugural Event

    2021

Publications (160)
Recent
  • Toward NEPA performance: A framework for assessing EIAs

    2022

  • Taxonomy Builder: a Data-driven and User-centric Tool for Streamlining Taxonomy Construction

    2022

  • Exploring Text Representations for Generative Temporal Relation Extraction

    2022

  • Proceedings of the 4th Clinical Natural Language Processing Workshop

    2022

  • Ensemble-based Fine-Tuning Strategy for Temporal Relation Extraction from the Clinical Narrative

    2022

  • TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla

    2022

  • A Comparison of Strategies for Source-Free Domain Adaptation

    2022

  • Lessons Learned from a Secondary Analysis Using Natural Language Processing and Machine Learning from a Lifestyle Intervention

    2022

  • UA-KO at SemEval-2022 Task 11: Data Augmentation and Ensembles for Korean Named Entity Recognition

    2022

  • Exploring transformers and time lag features for predicting changes in mood over time

    2022

  • Engagement with partisan Russian troll tweets during the 2016 U.S. presidential election: a social identity perspective

    2022

  • If You Want to Go Far Go Together: Unsupervised Joint Candidate Evidence Retrieval for Multi-hop Question Answering

    2021

  • Explainable Multi-hop Verbal Reasoning Through Internal Monologue

    2021

  • Consumer Cynicism Identification for Spanish Reviews using a Spanish Transformer Model

    2021

  • Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

    2021

  • SemEval-2021 Task 10: Source-Free Domain Adaptation for Semantic Processing

    2021

  • Simplifying annotation of intersections in time normalization annotation: exploring syntactic and semantic validation

    2021

  • Detection of Puffery on the English Wikipedia

    2021

  • EntityBERT: Entity-centric Masking Strategy for Model Pretraining for the Clinical Domain

    2021

  • Domain adaptation in practice: Lessons from a real-world information extraction pipeline

    2021

  • Assessing the Russian Troll Efforts to Sow Discord on Twitter during the 2016 U.S. Election

    2021

  • NEPA performance: conceptualizing multi-dimensional policy objectives

    2021

  • Do pretrained transformers infer telicity like humans?

    2021

  • The University of Arizona at SemEval-2021 Task 10: Applying Self-training, Active Learning and Data Augmentation to Source-free Domain Adaptation

    2021

  • Proceedings of the 3rd Clinical Natural Language Processing Workshop

    2020

  • Fine-tuning for multi-domain and multi-label uncivil language detection

    2020

  • Having Your Cake and Eating It Too: Training Neural Retrieval for Language Inference without Losing Lexical Match

    2020

  • How does BERT's attention change when you fine-tune? An analysis methodology and a case study in negation scope

    2020

  • Defining and Learning Refined Temporal Relations in the Clinical Narrative

    2020

  • Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering

    2020

  • A BERT-based One-Pass Multi-Task Model for Clinical Temporal Relation Extraction

    2020

  • Assisting Undergraduate Students in Writing Spanish Methodology Sections

    2020

  • Unified Medical Language System resources improve sieve-based generation and Bidirectional Encoder Representations from Transformers (BERT)–based ranking for concept normalization

    2020

  • A Dataset and Evaluation Framework for Complex Geographical Description Parsing

    2020

  • A Generate-and-Rank Framework with Semantic Type Regularization for Biomedical Concept Normalization

    2020

  • Rethinking domain adaptation for machine learning over clinical language

    2020

  • Does BERT need domain adaptation for clinical negation detection?

    2020

  • TTUI at SemEval-2020 Task 11: Propaganda Detection with Transfer Learning and Ensembles

    2020

  • Quick and (not so) Dirty: Unsupervised Selection of Justification Sentences for Multi-hop Question Answering

    2019

  • Proceedings of the 2nd Clinical Natural Language Processing Workshop

    2019

  • Incivility Detection in Online Comments

    2019

  • University of Arizona at SemEval-2019 Task 12: Deep-Affix Named Entity Recognition of Geolocation Entities

    2019

  • A BERT-based Universal Model for Both Within- and Cross-sentence Clinical Temporal Relation Extraction

    2019

  • Inferring missing metadata from environmental policy texts

    2019

  • Eidos, INDRA Delphi: From Free Text to Executable Causal Models

    2019

  • Pre-trained Contextualized Character Embeddings Lead to Major Improvements in Time Normalization: a Detailed Analysis

    2019

  • Alignment over Heterogeneous Embeddings for Question Answering

    2019

  • A Model for Identifying Steps in Undergraduate Thesis Methodology

    2019

  • Eidos, INDRA, & Delphi: From Free Text to Executable Causal Models

    2019

  • From Characters to Time Intervals: New Paradigms for Evaluation and Neural Parsing of Time Normalizations

    2018

  • Measuring the Latency of Depression Detection in Social Media

    2018

  • UArizona at the MADE1.0 NLP Challenge

    2018

  • Deep Affix Features Improve Neural Named Entity Recognizers

    2018

  • CUILESS2016: a clinical corpus applying compositional normalization of text mentions

    2018

  • Self-training improves Recurrent Neural Networks performance for Temporal Relation Extraction

    2018

  • A Survey on Recent Advances in Named Entity Recognition from Deep Learning models

    2018

  • Proceedings of The 12th International Workshop on Semantic Evaluation (SemEval-2018)

    2018

  • Embedding User Behavioral Aspect in TF-IDF Like Representation

    2018

  • SemEval 2018 Task 6: Parsing Time Normalizations

    2018

  • UArizona at the CLEF eRisk 2017 Pilot Task: Linear and Recurrent Models for Early Depression Detection

    2017

  • Infusing Latent User-Concerns from User Reviews into Collaborative Filtering

    2017

  • Unsupervised Domain Adaptation for Clinical Negation Detection

    2017

  • SemEval-2017 Task 12: Clinical TempEval

    2017

  • Towards generalizable entity-centric clinical coreference resolution

    2017

  • Improving Implicit Semantic Role Labeling by Predicting Semantic Frame Arguments

    2017

  • Neural Temporal Relation Extraction

    2017

  • Recurrent Neural Network Architectures for Event Extraction from Italian Medical Reports

    2017

  • Representations of Time Expressions for Temporal Relation Extraction with Convolutional Neural Networks

    2017

  • Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)

    2017

  • SemEval-2016 Task 12: Clinical TempEval

    2016

  • Visualizing the Content of a Children's Story in a Virtual World: Lessons Learned

    2016

  • Analysis of Anxious Word Usage on Online Health Forums

    2016

  • Proceedings of the Clinical Natural Language Processing Workshop (ClinicalNLP)

    2016

  • Facing the most difficult case of Semantic Role Labeling: A collaboration of word embeddings and co-training

    2016

  • Semi-supervised CLPsych 2016 Shared Task System Submission

    2016

  • Improving Temporal Relation Extraction with Training Instance Augmentation

    2016

  • Domain Adaptation for Authorship Attribution: Improved Structural Correspondence Learning

    2016

  • Towards Extracting Coherent User Concerns and Their Hierarchical Organization from User Reviews

    2016

  • Extracting Hierarchy of Coherent User-Concerns to Discover Intricate User Behavior from User Reviews

    2016

  • Efficient identification of nationally mandated reportable cancer cases using natural language processing and machine learning

    2016

  • Why Do They Leave: Modeling Participation in Online Depression Forums

    2016

  • A Semantically Compositional Annotation Scheme for Time Normalization

    2016

  • Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)

    2016

  • Multilayered temporal modeling for the clinical domain

    2016

  • DLS@CU at SemEval-2016 Task 1: Supervised Models of Sentence Similarity

    2016

  • Age and Gender Prediction on Health Forum Data

    2016

  • Domain Adaptation in Semantic Role Labeling Using a Neural Language Model and Linguistic Resources

    2015

  • Developing Language-tagged Corpora for Code-switching Tweets

    2015

  • DLS@CU: Sentence Similarity from Word Alignment and Semantic Vector Composition

    2015

  • SemEval-2015 Task 6: Clinical TempEval

    2015

  • Feature-Rich Two-Stage Logistic Regression for Monolingual Alignment

    2015

  • A survey on the application of recurrent neural networks to statistical language modeling

    2015

  • Extracting Time Expressions from Clinical Text

    2015

  • Predicting Continued Participation in Online Health Forums

    2015

  • Not All Character N-grams Are Created Equal: A Study in Authorship Attribution

    2015

  • CUAB: Supervised Learning of Disorders and their Attributes using Relations

    2015

  • Adapting Coreference Resolution for Narrative Processing

    2015

  • Dense Event Ordering with a Multi-Pass Architecture

    2014

  • An Annotation Framework for Dense Event Ordering

    2014

  • Towards automatic identification of core concepts in educational resources

    2014

  • Cross-Topic Authorship Attribution: Will Out-Of-Topic Data Help?

    2014

  • Overview for the First Shared Task on Language Identification in Code-Switched Data

    2014

  • Easy Does It: More Usable CAPTCHAs

    2014

  • Descending-Path Convolution Kernel for Syntactic Structures

    2014

  • The Stanford CoreNLP Natural Language Processing Toolkit

    2014

  • Text Mining for Open Domain Semi-Supervised Semantic Role Labeling

    2014

  • Temporal Annotation in the Clinical Domain

    2014

  • DLS@CU: Sentence Similarity from Word Alignment

    2014

  • ClearTK 2.0: Design Patterns for Machine Learning in UIMA

    2014

  • Proceedings of the EACL 2014 Workshop on Computational Approaches to Causality in Language (CAtoCL)

    2014

  • Back to Basics for Monolingual Alignment: Exploiting Word Similarity and Contextual Evidence

    2014

  • Identifying Weak Sentences in Student Drafts: A Tutoring System

    2014

  • SemEval-2013 Task 3: Spatial Role Labeling

    2013

  • Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing

    2013

  • Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium

    2013

  • Discovering body site and severity modifiers in clinical texts

    2013

  • ClearTK-TimeML: A minimalist approach to TempEval 2013

    2013

  • Characterizing and Predicting the Multifaceted Nature of Quality in Educational Web Resources

    2013

  • Automatic extraction of core learning goals and generation of pedagogical sequences through a collection of digital library resources

    2013

  • Discovering Temporal Narrative Containers in Clinical Text

    2013

  • CU: Computational Assessment of Short Free Text Answers - A Tool for Evaluating Students' Understanding

    2013

  • DLS@CU-CORE: A Simple Machine Learning Model of Semantic Textual Similarity

    2013

  • 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop

    2013

  • A Synchronous Context Free Grammar for Time Normalization

    2013

  • SemEval-2012 Task 3: Spatial Role Labeling

    2012

  • Identifying science concepts and student misconceptions in an interactive essay writing tutor

    2012

  • Citation-based bootstrapping for large-scale author disambiguation

    2012

  • Annotating Story Timelines as Temporal Dependency Structures

    2012

  • Extracting Narrative Timelines as Temporal Dependency Structures

    2012

  • Skip N-grams and Ranking Functions for Predicting Script Events

    2012

  • Using Query Patterns to Learn the Duration of Events

    2011

  • Model-Portability Experiments for Textual Temporal Analysis

    2011

  • How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation

    2010

  • Semantically Informed Machine Translation (SIMT)

    2010

  • Crowdsourcing and language studies: the new generation of linguistic data

    2010

  • Early ERP Effects of the Metaphorical Profile of a Word

    2010

  • Automatically assessing resource quality for educational digital libraries

    2009

  • ClearTK: A Framework for Statistical Natural Language Processing

    2009

  • Automatically characterizing resource quality for educational digital libraries

    2009

  • Towards Temporal Relation Discovery from the Clinical Narrative

    2009

  • Building Test Suites for UIMA Components

    2009

  • Topic model methods for automatically identifying out-of-scope resources

    2009

  • Decaptcha: Breaking 75\% of eBay Audio CAPTCHAs

    2009

  • Topic Model Analysis of Metaphor Frequency for Psycholinguistic Stimuli

    2009

  • Instructor's Solution Manual for Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition (Second Edition)

    2008

  • Building a Corpus of Temporal-Causal Structure

    2008

  • ClearTK: A UIMA Toolkit for Statistical Natural Language Processing

    2008

  • Semantic Role Labeling for Protein Transport Predicates

    2008

  • CU-TMP: Temporal Relation Classification Using Syntactic and Semantic Features

    2007

  • Semantic integration in learning from text

    2007

  • Timelines from Text: Identification of Syntactic Temporal Relations

    2007

  • Finding Temporal Structure in Text: Machine Learning of Syntactic Temporal Relations

    2007

  • Finding Event, Temporal and Causal Structure in Text: A Machine Learning Approach

    2007

  • Identification of Event Mentions and their Semantic Class

    2006

  • Extracting opinion propositions and opinion holders using syntactic and lexical cues

    2005

  • Automatic extraction of opinion propositions and their holders

    2004

  • Building a foundation system for producing short answers to factual questions

    2002

Grants
Citations
H-Index
Patents
News
Books
Opportunities