LIAM
KOZMA

MS Statistics  ·  BS Biochemical Engineering

High-dimensional statistics, protein language models, and HPC-scale machine learning. I build rigorous, reproducible pipelines at the boundary of computation and biology.

Selected Work 09 / 09
01

Recovery Thresholds in Protein Language Models

Scaling-law regression on PLM recovery under distribution shift. GMM-driven dispersion parameterized by δ, N=36 observations, executed on the Sapelo2 cluster with Python and Nextflow.

MS Thesis Scaling Laws PLM HPC Nextflow
2025
02

Chimera Trajectory Filter

Custom UCSF Chimera filter for rendering molecular dynamics trajectories. Frame-level processing tuned for high-fidelity structural movies.

Molecular Dynamics Chimera Rendering
2026
03

L-Asparaginase Strain Engineering

E. coli engineered for elevated L-asparaginase yield. Fed-batch reactor mass balance optimized in Python with SciPy and DEAP. Year-long capstone; top-three at UGA Quick Pitch.

Metabolic Eng. Python SciPy / DEAP Project Lead
2023
04

Senior Bioprocess Lab Series

Four-unit series oriented around ethanol production: microbial growth under varied Air/N2 ratios, Ziegler-Nichols controller tuning, base-conversion kinetics, and fractional distillation.

Process Control Kinetics Distillation
2023
05

Biofilter Reactor Kinetics

Kinetic model for a VOC-scrubbing biofiltration system. Predicted effluent concentration at 220,000 gal/min and 95% removal of the rate-limiting compound; 5% cost reduction.

Reactor Design Kinetic Modeling VOC
2022
06

Metabolic & Synthetic Biology Lab

PCR amplification of a target gene, transformation into E. coli, recombinant protein expression, and downstream isolation to application-grade purity.

PCR Cloning Protein Purification
2022
07

Junior Transport & Kinetics Lab Series

Five-unit series spanning Fourier conduction, diffusion-constant measurement, reaction-order determination by spectrophotometry, ion-exchange deionization, and PID tuning.

Heat Transfer Diffusion Spectrophotometry
2022
08

Monoclonal Antibody Bioprocess

Simulated 100 g/day mAb facility. Murine myeloma and immunized B-cell culture through filtration, purification, and lyophilization, with full process economics.

Bioprocess Design Simulation Downstream
2021
09

Skateboard Reverse-Engineering

Full teardown and dimensional reconstruction of a skateboard deck and trucks in AutoCAD.

AutoCAD Engineering Graphics
2020
About

MS in Statistics, BS in Biochemical Engineering. Work sits at the intersection of high-dimensional statistics, bioinformatics, HPC, and machine learning, with recent focus on protein language models under data distribution shift. Equally at home in wet-lab bioprocess, cluster-scale regression, and reproducible Nextflow pipelines.

On Rotation