Tools and data
Software, web apps, and datasets built and released by members of the lab. Everything here is open and free to use.
autogradeR is expected to help instructors grade pre-defined assignments using a particular and relatively simple set of functions. Authograder generates basic templates that can only be accessed by instructors using a pre-defined encrypted password. The current release is based on RMarkdown files.
Barrio Map offers fundamental functionalities like finding locations by conducting searches or selecting coordinates. Users can select from predefined scales and export the scaled maps to particular page sizes. Maps generated using Barrio Map are formal descriptions of particular sites with defined values of resolution, scales, and explicit information on distances. Barrio Map, in conjunction with OpenStreetMaps, is written in R, with functionalities largely leveraged from packages such as Leaflet and Shiny.
CityComp is a Shiny app developed to support the use of the comparative framework developed in Arechiga et al. (2023). In the app, users can directly visualize the results presented in the same paper. Specifically, a graphical and interactive version of Fig. 6 in the same paper is presented in the app. Users can further define a target city and estimate their similarity to cities across the globe. Dynamic filtering and mapping are implemented within the app.
Bayesian Additive Regression Networks. We apply Bayesian Additive Regression Tree (BART) principles to training an ensemble of small neural networks for regression tasks. Using Markov Chain Monte Carlo, Bayesian Additive Regression Net- works (BARN) samples from the space of single hidden layer neural networks that are conditioned on their fit to data. You can find more information in barnpy and its associated documentation
BayClump is a Shiny Dashboard application that is associated with the bayclumpr R package. All the functions implemented in BayClump are sourced from bayclumpr. We eveloped BayClump to allow users with less coding experience to be able to access standarized resources to calibrate and derive reconstructions using clumped isotope datasets.
bayclumpr is a self-contained R package that supports the use of Bayesian models and the analytical framework developed in Román-Palacios et al. (2022) for clumped isotope calibration, temperature reconstructions, and facilitatation of comparisons on Bayesian and classical models. bayclumpr fits both frequentist and Bayesian linear regressions to calibration datasets and performs temperature reconstructions under both frameworks.
This database includes seven datasets surrounding freshwater fish diversity used within the 2020 paper by our PI, Cristian Roman, and Elizabeth Miller.
The Urban Green Space (UGS) is a dataset used for improving public health and the sustainability of cities. This dataset was used by lab members Ian Estacio and Cristian Román-Palacios for a paper surrounding the walkability of cities.
The dataset includes haploid chromosome numbers for 80% of species in the most comprehensive species-level chronogram for the Brassicaceae. It supports analyses of polyploidy’s influence on diversification rates, species richness, and long-term evolutionary patterns within the family.
The dataset includes survey data from 538 plant and animal species over time, with 44% experiencing local extinctions. It supports analyses of climate change impacts on biodiversity, identifying key climatic drivers of extinction and assessing the roles of dispersal and niche shifts in species survival projections.
The dataset includes species richness and phylogenetic diversity data across terrestrial, marine, and freshwater habitats in animals and plants. It supports analyses of habitat-related diversification rates, ancestral habitat reconstructions, and the evolutionary origins of biodiversity patterns.
Enhannced PPE2 is a Human-Machine Interface (HMI) tailored for real-time object detection, with a specific focus on hard hat compliance monitoring in industrial settings. Although this project is tailored for hard hat compliance monitoring, the modular architecture allows easy adaptation for other applications like gengeral PPE detection including masks, goggles, vest and so on. This project focuses on developing the UI associated to the ML model detection algorithm.
This repo hosts a simple Firefox extension that checks on whether the requierements for a given program (or subplan) are being met by the student. This version focuses on Data Science and Information Science programs in InfoSci. However, this extension can be easily modified to enable other programs or subplans. It can also be adjusted in the requirements it check for. I did not implement an automatic submission.
This Neural Network-based tool facilitates extraction of building facade color patterns from Google Street View Images.
The phruta R package is designed to simplify the basic phylogenetic pipeline. All the code is run within the same program and data from intermediate steps are saved in independent folders (optional). phruta retrieves gene sequences, combines newly downloaded to local gene sequences, performs sequence alignments, and basic phylogenetic inference.
This is a basic shiny dashboard to perform pairwise comparison of different file submissions. The app is intended to help identify instances of potential plagiarism in coding-based assignments.
Chorus is a live polling platform designed for conferences, events, workshops, meetings, and similar events. Admins can create unique rooms, design questions, and enable access for attendees via unique tokens. Attendees can suggest answers and vote in real-time. Admins can manage questions, voting settings, and get summary statistics.
Salphycon is a shiny app that extends the functionalities of the (phruta) R package. Salphycon is able to (1) find potentially (phylogenetically) relevant gene regions for a given set of taxa on GenBank, (2) retrieve gene sequences and curate taxonomic information from the same database, (3) combine downloaded and local gene sequences, and (4) perform sequence alignment, phylogenetic inference, and basic tree dating tasks. Both phruta and salphycon are focused on species-level analyses.
This tool predicts Escherichia coli (E. coli) levels in the Upper Santa Cruz River using a model trained on public data from 2009-2022. It is intended as a warning of possible high bacterial loads; water quality must still be verified using coliform testing procedures to ensure public safety.
ssarp (Species-/Speciation-Area Relationship Projector) is an R package that provides a suite of functions to help users create speciation- and species- area relationships for island-dwelling taxa using occurrence data from GBIF (Global Biodiversity Information Facility) or the user’s own occurrence data.
The ACC is the largest database of animal chromosomal counts. We have curated chromosome numbers across the animal Tree of Life to make data accessible for people interested in understanding the potential links between biological processes and patterns to broad chromosomal changes in animals.
The Animal Culture Database (ACDB) is a database of cultural behaviors across nonhuman animal species. It synthesizes current literature on intra-species behavioral variation in contexts such as communication, foraging, and migration, along with data on effects of human disturbances to the environment, to facilitate comparative research on interactions between animal social learning and climate change.
The data.table package enables high-performance extended functionality for data tables in R. treedata.table is a wrapper for data.table for phylogenetic analyses that matches a phylogeny to the data.table, and preserves matching during data.table operations.
This research is being conducted with the following goals in mind:
- Facilitate evolutionary genomic analysis in the eukaryotic domain
- Make public genomic data more accessible by increasing information content per unit volume of data
- Develop improved methods and algorithms for genome informatics
- Streamline downstream analyses in genomic workflows
- Analyze patterns of genome evolution during the process of adaptation, speciation, diversification and domestication
The Southwest Center on Resilience for Climate Change and Health (SCORCH) brings together transdisciplinary research groups to conduct solutions-oriented team-science projects responsive to the health needs of arid lands communities adapting to climate change. The Integrated Data Visualization Core (IDVC) supports the overall Center mission through the provision of data science, visualization, and management expertise.