Member of the Graduate Faculty | Associate Dean, Research and Graduate Academic Affairs | Professor, School of Information
P. Bryan Heidorn's areas of research include natural language processing, text mining for metadata and information retrieval, particularly in biodiversity literature, and museum informatics. Heidorn became the Director of the School of Information when the School of Information Resources and Library Science (SIRLS) merged with the School of Information Science, Technology and Arts (SISTA) in July 2015. From 2009 to 2015 he was the SIRLS Director. Dr. Heidorn stepped down as Director in January 2019 and now serves as the Associate Dean of Research and the Associate Dean for Graduate Academic Affairs. Prior to coming to the UA, Heidorn was a faculty member of the Graduate School of Library and Information Science at the University of Illinois at Urbana-Champaign. For the last two years he also served as a program manager of the Division of Biological Infrastructure at the National Science Foundation. Bryan served in a number of posts including President, VP and Grants Committee Chair of the JRS Biodiversity Foundation which funds work in Africa and South America for biodiversity informatics projects for 7 years ending 2013. He has served to several years on the Organization for Tropical Studies Science and Informatics Committees. He is currently the the Chair of the Informatics Committee and on the OTS Board of Directors.
VR of Historical Places - Shared Churches after the 30-years war For students who have completed Virtual Reality and simulation game classes with tools such as the Unity Engine. A global team of scholars is collecting information including 360 degree (photosphere) images of simultaneum mixtum, shared Catholic and Protestant churches from 16th century and later. We are looking for students to explore methods of making this information available in VR as well as WWW environments. End users should be able to explore an interior space in VR and point to teleport to a new location or bring up additional information such as images, text, and audio describing objects or spaces in the scene. https://sharedchurches.arizona.edu/
iSchool Capstone or Directed Study Topic Analysis and Financial Modeling of Federal Grant Programs How have federal funding patterns for research changed over the past 25 years? How many funds have gone to each topic? What topics are likely to be funded next and at what levels? How successful have the funding programs been in terms of papers published and data produced per unit investment? We have previously analyzed data from a couple of programs for just a few years at the National Science Foundation. We would like to scale in terms of programs at NSF and other agencies and look further back in time. Our initial analysis used Latent Dirichlet Allocation (LDA), a Bayesian network to do analysis. We’d like to try out other topic analysis techniques such as Doc2Vec, Bert, correlated topic models, and others for these data sets. Other students have moved the tools to a supercomputing network but more could be done to enable larger models. In this capstone project, you could analyze different data from NSF or data from NASA or the Department of Energy. You could work on different algorithms or visualization techniques. The final project depends on your interests and skills.
Ecological Image Processing and Analysis iSchool Capstone or Directed Study Professor Heidorn has a couple of datasets and problems that may be appropriate for the MS capstone projects. The data selection process has largely been worked out and most data cleaning tools have been developed. This should allow you to advance a little faster in the single-semester course. While Dr. Heidorn can help you to understand the data and the types of problems that can be solved with the data, each student is expected to develop their own unique capstone project that will demonstrate the skills that they have when approaching graduation from our program. This is different from an internship because you need to develop your own project and project report. Should any of this work eventually lead to publication, students will be appropriately acknowledged. Professor Heidorn has a couple of datasets and problems that may be appropriate for the MS capstone projects. The data selection process has largely been worked out and most data cleaning tools have been developed. This should allow you to advance a little faster in the single-semester course. While Dr. Heidorn can help you to understand the data and the types of problems that can be solved with the data, each student is expected to develop their own unique capstone project that will demonstrate the skills that they have when approaching graduation from our program. This is different from an internship because you need to develop your own project and project report. Should any of this work eventually lead to publication, students will be appropriately acknowledged. We wish to be able to automatically track plant phenophase from camera images. In this capstone project, you would develop tools to improve this process. This might be through statistics, supervised or unsupervised machine learning, data integration, and/or visualization techniques. In the face of climate change, plants and animals are having varied success in adapting to changing environments. One way that this can be studied is through the evaluation of changes in phenophase. These changes can have cascading impacts on ecosystems. For example, the timing of when leaves form on trees drives the populations of insects that eat the leaves. In temperate environments, many birds time their migration back to the region in the spring to coincide with increasing insect populations. If the leaves emerge sooner, the insects may increase their numbers sooner. Unfortunately, the birds that time their migration on hours of sunlight in a day may not migrate back north in time to eat the insects and have sufficient nutrition to feed their young. Traditionally data on the phenophase for the trees have been collected by human observers. However, this is labor intensive and there are not enough people collecting data. About 5 years ago a consortium of researchers and government officials started placing cameras on towers and at ground level at hundreds of locations around the country and the world in the Phenocam project. The cameras take photographs every 5-15 minutes 24 hours a day in visible light and infra-red. There is a growing database of hundreds of thousands of images that can be used in this capstone project. The USA National Phenology Network provides a database of human observations of phenophase for many but not all of the camera locations. Developing image analysis tools is challenging. Most machine learning methods for images have been for human-built landscapes and not the natural environment. While there are many images their depth in time is only about 5 years so deep learning methods that need a lot of data do not work in a straightforward manner. In this capstone project, you would build tools to identify changes in phenophase from the images. You would need to develop your own approach to some subproblems in this space and report on that.