Hi, Iβm Edwin β an AI / Machine Learning Engineer with 3+ years of experience building and deploying production ML systems. I specialize in applied machine learning, ML pipelines, and LLM-powered tools data-intensive applications, with a focus on reliability, scalability, and real-world impact.
|
Production AI assistant integrated into the GenePattern platform, supporting multimodal data analysis, workflow automation, and researcher productivity at scale. Stack: Python Β· LangChain Β· AWS Β· LLMs |
Interactive portal for exploring extrachromosomal DNA (ecDNA) across cancer samples. Stack: Django Β· MongoDB Β· JS |
- Production ML pipelines and reproducible workflows
- Applied ML for biomedical and health data
- GPU acceleration and performance optimization
- LLM integration into real-world systems
π Selected publications and production engineering contributions in large-scale biomedical ML systems
Luebeck J, Huang E, et al.
AmpliconSuite: Analyzing Focal Amplifications in Cancer Genomes.
Genomics and Informatics, 2024.
π ScienceDirect
β Deployed ML pipelines and a cloud-hosted repository for large-scale ecDNA analysis, enabling reproducible research across 20 + projects (funded >$25 M).
Liefeld T, Huang E, Wenzel A, Yoshimoto K, Sharma A, Sicklick J, Mesirov J, Reich M.
NMF Clustering: Accessible NMF-based Clustering Utilizing GPU Acceleration.
Genomics and Informatics, 2024.
π Fortune Journals
β Implemented RAPIDS AI, CuPy, and custom CUDA kernels to achieve 27Γ runtime speedup on HPC clusters.
Reich M, Tabor T, Liefeld J, Joshi J, Kim F, Huang E, Thorvaldsdottir H, Blankenberg D, Mesirov J.
Genomics to Notebook (g2nb): Extending the Electronic Notebook to Address the Challenges of Bioinformatics Analysis.
Genomics and Informatics, 2024.
π Fortune Journals
β Contributed to extending Jupyter-based infrastructure (g2nb) for scalable bioinformatics workflows.
Reich M, Tabor T, Liefeld J, Huang E, Kim F, Mesirov J.
The GenePattern Ecosystem for Cancer Bioinformatics.
AACR Cancer Research (Abstract 7426), 2024.
π AACR Abstract
β Presented cloud-based GenePattern workflows supporting cancer informatics and LLM integration.


