Software Engineer

LLM Systems, RAG & AI Infrastructure

C++ / Python

New York
& Singapore

Open to New-Grad AI/ML &
Software Engineering Roles from June 2026

LLM
Systems

AI
Infrastructure

Contact Me

I build

AI systems

that earn

trust

in production

Latency, grounding, and reliability decide whether an AI product gets used or ignored. I care about the full stack behind that outcome, from ingestion and retrieval quality to evaluation, model behavior, and the product experience around every answer.

Core focus

LLM Systems
RAG Pipelines
AI Infrastructure
Backend Engineering
Retrieval & Evaluation
Applied ML

Experience

AdvancedGRC logo

AdvancedGRC

Software Engineer (AI/ML Systems)

New York City Metropolitan Area | May 2025 - Present

Design and ship applied AI systems for GRC workflows, focused on LLM/RAG products, retrieval infrastructure, document ingestion, evaluation, and internal developer tooling.

DigiPen Institute of Technology Singapore logo

DigiPen Institute of Technology Singapore

Teaching Assistant

Singapore | Aug 2023 - Dec 2024

Supported undergraduate computer science courses through labs, office hours, grading, and structured debugging help across programming, software engineering, and math-heavy modules.

DT Asia Pte Ltd logo

DT Asia Pte Ltd

Software Engineer

Singapore | Jul 2019 - Jul 2020

Started as an intern and was promoted to full-time, building cybersecurity and log-management systems for enterprise environments with secure pipelines, automation, and operational tooling.

Projects

Tech stack

The tools I reach for most often across model behavior, retrieval, backend delivery, and full-stack product work.

01

ML / LLM

Models, embeddings, eval

Applied model work for embeddings, ranking, semantic retrieval, and benchmark-driven iteration in production-minded support systems.

PyTorch
PyTorch
Transformers
Transformers
RAGRerankingEmbeddingsSemantic SearchLLM Evaluation
02

Inference / Retrieval

Serving, search, confidence

Serving and retrieval tooling for grounded answers, multi-KB search, vector indices, and confidence-aware answer routing.

vLLM
vLLM
Ollama
Ollama
Chroma
Chroma
FAISS
FAISS
CUDA
CUDA
Confidence Gating
03

Backend / Deployment

APIs, jobs, persistence

FastAPI services, ingestion pipelines, containerized local development, and durable data flows for AI products that need to ship cleanly.

FastAPI
FastAPI
Docker
Docker
SQLite
SQLite
Linux
Linux
Git
Git
04

Languages / Frontend

Core languages, UI delivery

Core implementation languages plus the frontend tools I use to ship full-stack AI interfaces, widgets, and internal engineering tools.

Python
Python
C++
C++
C
C
C#
C#
React
React
Vite
Vite

Contact

Let's
talk.

Open to new-grad AI/ML and software engineering roles from June 2026. If you're hiring for LLM systems, backend infrastructure, or applied ML work, I'd be glad to talk.