Data Analyst in Bioinformatics and Ecology at IGB

Theodore Hermann, Ph.D.

2025-08-21

Background

  • 🏞️ Ph.D. in freshwater biology
  • 💻 Biologist with a programmer’s mindset
  • 🌐 Reproducibility & open practices
  • 🤝 Service-focused

Experience

🐟 Postdoc: FHAST (UC Santa Cruz)

  • Open-source agent-based modeling toolkit for fish habitat analysis → Allowed research groups and agencies to run habitat simulations reproducibly and without advanced coding expertise.

🔌 Consultation: EPRI Heat Exchanger ML Pipeline

  • Machine learning pipeline for heat exchanger fault detection → Helped engineers rapidly identify component failures, reducing downtime and maintenance costs.

📝 Freelance: Technical Editing & Writing Support

  • Editor for scholars & companies → helped clients publish, secure grants, and communicate clearly

Science Gets Messy

Solutions that Work

library(targets)
library(tarchetypes)
library(here)

tar_option_set(
  packages = c("tidyverse")
)

tar_source()

list(
    tar_target(
        name = path,
        command = here("path/to/data.csv"),
        format = "file"
    ),
    tar_target(
        name = raw_data,
        command = read_data(path)
    ),
    tar_target(
        name = analyzed_data,
        command = analyze_data(raw_data)
    ),
    tar_render(
        name = manuscript,
        path = here("path/to/manuscript.qmd")
    )
)
  • Pipelines → teammates can rerun analyses without guesswork

  • Controlled environments → collaborators get the same results every time

  • Version control → workflows open for team and community to reuse

  • Documentation → future researchers can build on the work without me

Reality of Research

How I Can Contribute

  • 🏞️ Freshwater expertise
  • 💻 Technical range (R/Python, pipelines)
  • 🌐 Open, shareable workflows
  • 🤝 Reproducibility as a service

Questions?