Frazier N. Baker

PhD Student · Agentic AI for Scientific Discovery

My research focuses on agentic AI for scientific applications. I believe AI should possess reasoning and decision-making capabilities grounded in scientific principles to accelerate discovery across disciplines.

Research

AI for Chemistry

MMORF — Multi-agent Multi-Objective Retrosynthesis Framework

A modular framework for constructing multi-agent systems for retrosynthesis planning under multiple objectives.

LARC — Language Agentic framework for Retrosynthesis planning under Constraints

Agentic system integrating evaluation directly into retrosynthesis planning for constrained tasks.

LIDDiA — Language-based Intelligent Drug Discovery Agent

Agentic system for automating key stages of preclinical drug discovery through language-guided reasoning.

RLSynC — Reinforcement Learning for Synthon Completion

Offline-online reinforcement learning framework for improving synthon completion in retrosynthesis workflows.

AI for Biology

CoeViz

Web-based platform for interactive visualizations of amino acid coevolution.

Coevolution-based Metal Binding Site Prediction

Deep learning models for predicting metal-binding sites in proteins from amino acid coevolution data.

Comparative Analysis of Gene Expression Profiles Perturbed by Lupus Inducing Drugs

Unsupervised analysis of gene expression changes in drugs to identify etiological patterns for drug-induced lupus.

AI for Government Applications (Selected)

MASS — Multi-platform Autonomy Simulation Suite

Simulation testbed configurable via GUI for evaluating coordination across multiple autonomous platforms in complex environments.

Built using SCRIMMAGE · Used in AFRL (US) / DSTL (UK) SSAI Competition
Population Health Surveillance

Data analysis pipeline for anomaly detection and distribution shifts in health signals from heterogeneous sensor inputs.

Lead: Mike Farrell, GTRI
Language-model-assisted Annotation Systems

Protocol for finetuning and evaluating language models to align with human annotations, accelerating research workflows.

Lead: Courtney Crooks, GTRI
Metagenomics Dashboard

Bioinformatics platform for visualizing and analyzing metagenomic data.

Lead: Rebecca Hutchins, GTRI

Selected Collaborations

LlaSMol

Instruction-tuned large language model for chemistry tasks using large-scale curated datasets.

Lead: Botao Yu, OSU
ScienceAgentBench

Benchmark for evaluating language agents in scientific discovery tasks.

Lead: Ziru Chen, OSU

Connect

Email: lastname dot 3239 at buckeyemail dot osu dot edu

Download full CV