Software Engineer

LLM Systems, RAG & AI Infrastructure

C++ / Python

Contact Me

Software Engineer

LLM Systems, RAG & AI Infrastructure

C++ / Python

New York& Singapore

Open to New-Grad AI/ML &Software Engineering Roles from June 2026

LLMSystems

AIInfrastructure

Contact Me

I build

AI systems

that earn

trust

in production

Latency, grounding, and reliability decide whether an AI product gets used or ignored. I care about the full stack behind that outcome, from ingestion and retrieval quality to evaluation, model behavior, and the product experience around every answer.

Core focus

LLM Systems

RAG Pipelines

AI Infrastructure

Backend Engineering

Retrieval & Evaluation

Applied ML

From ingestion
to reliable answers

Experience

AdvancedGRC

Software Engineer (AI/ML Systems)

New York City | May 2025 - Present

Primary engineer on a PDF-backed RAG support product, owning ingestion, retrieval, reranking, evaluation, and grounded UX across a FastAPI backend and React/Vite surfaces. Shipped Osmos for daily internal use and cut repetitive support tickets by 36%.

→

DigiPen Institute of Technology Singapore

Teaching Assistant

Singapore | Aug 2023 - Dec 2024

Led labs, debugging sessions, office hours, and grading across programming, software engineering, and math-heavy computer science modules. Helped students build stronger fundamentals through structured 1:1 support and clear evaluation standards.

→

DT Asia Pte Ltd

Software Engineer

Singapore | Jul 2019 - Jul 2020

Converted from intern to full-time engineer, building enterprise log-management and security workflows with Syslog-ng, Azure, and CloudWatch integrations. Also represented the team through live product demos and customer-facing technical presentations.

→

Projects

Tech stack

01

LLM / Retrieval Systems

Grounded answers, measurable iteration

01

Production-minded RAG systems for PDF-grounded question answering, retrieval tuning, reranking, confidence gating, and benchmark-driven iteration.

Python

PyTorch

Transformers

vLLM

Ollama

Chroma

FAISS

CUDA

Multi-KB RAGEmbeddingsRerankingConfidence GatingGPU InferenceBenchmark Suites

02

Backend / AI Product Delivery

APIs, jobs, UX delivery

02

FastAPI services and React/Vite interfaces for ingestion workflows, persistence, internal tooling, and embeddable AI product surfaces.

FastAPI

SQLite

React

Vite

Docker

Linux

Git

PDF IngestionSession PersistenceWidget + Full AppOperational Tooling

03

Computer Vision / Numerical

Dynamic programming, visualization

03

Interactive algorithm tooling for content-aware image resizing, visual diagnostics, threaded desktop workflows, and side-by-side algorithm comparisons.

Python

NumPy

OpenCV

Tkinter UIScharr EnergySeam Insertion + RemovalPause / Step ControlsRuntime Metrics

04

C++ / Graphics Systems

Rendering, tooling, geometry

04

Custom engines, render pipelines, spatial data structures, and procedural graphics work spanning editors, runtime tooling, and gameplay-facing systems.

C++

C#

OpenGL

Mono

FFmpeg

FMOD

ECSDeferred RenderingBVH / Octrees / BSPPoisson SamplingPerlin NoiseEditor Tooling

Contact

Let's
talk.

Seeking new-grad software engineering opportunities starting June 2026, with a strong interest in AI/ML, LLM systems, retrieval infrastructure, and performance-critical systems. I'd be glad to connect about roles building practical, production-grade products across AI and software engineering.

Contact Me

yankhang.koh@gmail.comlinkedin.com/in/yan-khang

LLM Systems, RAG & AI Infrastructure

LLM Systems, RAG & AI Infrastructure

LLM / Retrieval Systems

Backend / AI Product Delivery

Computer Vision / Numerical

C++ / Graphics Systems

Let'stalk.

Let's
talk.