AI Engineer · Healthcare & Operations

I turn messy, real-world data into production AI.

Patient records, lab reports, operational data — I build the LLM, multi-agent, document-AI, and computer-vision systems that turn them into clean, decision-ready output. All of it runs in production.

View my work Get in touch

Open to opportunities 📍 Kuala Lumpur · Cyberjaya, Malaysia

AI Engineer @ N1 Healthcare

−75%

LLM API calls cut — same accuracy, lower cost

coordinated agents generating healthcare reports

pilot stores running production computer vision

≈RM 5k

hardware cost avoided per new store (≈USD 1.1k)

About

Built for the data you actually have.

I'm an AI Engineer focused on LLM-powered systems that hold up in production — document understanding, structured medical extraction, and multi-agent workflows.

At N1 Healthcare, I build the pipeline that turns complex, messy patient records into clean structured data and Comprehensive Healthcare Reports: document routers, parser-evaluation loops, clinical extraction, and LangGraph multi-agent systems built on LLMs, VLMs, OCR, and careful prompt & context engineering.

Before healthcare, I built computer vision and cloud-AI systems for a convenience-retail chain (FamilyMart Malaysia) over two years — object counting, on-shelf availability, product-availability monitoring, and real-time loss prevention, running on Kubernetes and AWS.

What I care about: reliable, cost-efficient AI that works with real-world data — not the data you wish you had — and genuinely supports decisions in healthcare and operations.

Capabilities

What I build with.

A pragmatic toolkit — chosen per task, and measured before it ships.

LLM & Agentic AI

LangGraphRAG / Agentic SearchMulti-agentContext EngineeringPrompt Engineering

LLM Ops & Evaluation

LangfuseLangSmithPrompt VersioningEval PipelinesObservability

Models & Inference

GPTClaudeGeminiQwenGLMvLLMDeepSeekKimi

Document AI

OCRVLMsMedical Doc ParsingMinerUDoclingPaddleOCR-VL

Computer Vision

YOLO / YOLOv8DeepSORTRe-IDOpenCVROS2 · SLAM

Cloud & Tools

DockerKubernetesAWSSQLPower BIGitLinux

Programming

PythonC++JavaKotlinFlutter / DartBash

Experience

Where I've shipped.

AI Engineer · N1 Healthcare

Full-time · Cyberjaya, Selangor · On-site

Sep 2025 — Present

Built LLM-powered systems for medical document parsing, structured extraction, and automated patient health summaries — N1's Comprehensive Healthcare Reports.

Built a medical document router that cut downstream parser LLM calls by ~75% with no loss in extraction accuracy.
Raised extraction quality through evaluation pipelines — deduplicating and normalising biomarkers across labs (so the same test from two providers maps to one clean record), adding medication coverage, and engineering prompts and context.
Designed a LangGraph multi-agent system that generates these patient summaries from raw records — coordinating specialist agents for clinical analysis, literature research, narrative generation, and fact validation.
Benchmarked 10+ LLMs, VLMs, and parsing tools (GPT, Claude, Gemini, Qwen, MinerU, Docling, PaddleOCR-VL) — choosing the right model per task on measured accuracy and cost, not assumptions.
Ran prompt iteration, observability, and model comparison through Langfuse and LangSmith.

LangGraphRAGVLMs / OCRLangfuseContext Eng.

AI Engineer · QL Maxincome — FamilyMart Malaysia

Internship → Part-time · Computer Vision & Cloud AI

Sep 2023 — Aug 2025

Computer vision and operations AI for a convenience-retail chain — from edge cameras to live Power BI dashboards.

Architected and deployed a Ready-To-Eat product-availability system across 2 pilot stores, with a scalable rollout underway.
Containerised inference on Kubernetes-on-AWS-EC2 with a central GPU cluster — avoiding ≈RM 5,000 of hardware per new store.
Built an end-to-end data pipeline (edge → S3 → Python ETL → Power BI) giving managers live replenishment KPIs.
Shipped on-shelf availability detection, customer & product flow counting (multi-object tracking with re-identification to avoid double-counts), a food-fraud prototype, and cash-transaction recognition.
Prototyped an autonomous shelf-auditing robot — ROS2 navigation with SLAM map-building for aisle patrol — plus an internal policy Q&A assistant using retrieval-augmented generation (RAG).

YOLOv8DeepSORT · Re-IDKubernetes · AWSPower BIROS2 · SLAM

Selected work

Production work & research.

Filter by where each one lives — agentic LLM systems, computer vision, or ML research.

Multi-agent healthcare report system diagram

Agentic · LLM2025

Multi-Agent Healthcare Report Generator

A LangGraph multi-agent system that turns raw patient records into full health summaries (N1's Comprehensive Healthcare Reports) — specialist agents for analysis, research, generation, and validation.

Orchestration 4 coordinated agents

LangGraphRAGMulti-agent

Agentic · LLM2025

Medical Document Router

Routes each document to the right parser so the expensive LLM only runs when it's actually needed — accuracy held, cost cut hard.

Impact −75% parser LLM calls

RoutingLLMCost-opt

Agentic · LLM2025

Parser Evaluation Pipeline

Re-checks extraction quality after every prompt or model change — so improvements don't quietly break what already worked — with biomarker dedup and name-normalisation for consistency across labs.

Reliability accuracy held across changes

EvalLangfuseCanonicalization

▶ Demo · 0:42

Agentic · LLM2025

HiVision — Voice-Only Phone Assistant

A Flutter app that streams live frames from a low-cost ESP32 camera to Google Gemini, which parses spoken commands and triggers on-screen actions automatically — hands-free phone control, end to end.

▶ Watch on YouTube ↗

Flutter / DartGeminiESP32-S3AutoGUI

Retrieval-augmented generation system diagram

Agentic · LLM2024

Retrieval-Augmented Assistant

An early RAG assistant on OpenAI / Gemini with a vector store, answering policy questions with grounded citations instead of guesses.

Grounding cited answers

RAGVector DBOpenAI / Gemini

Computer Vision2024

Ready-To-Eat Availability

Edge cameras estimate hot-food fullness and sync to AWS / Kubernetes, turning shelf images into waste-reduction and replenishment insight.

Deployment 2 pilot stores

YOLOEdgeAWS · K8s

Computer Vision2024

Object Counting & Re-ID

Multi-object tracking and re-identification measuring in-store and outdoor foot traffic — feeding new-store planning and site-selection models.

Tracking YOLOv8 + DeepSORT

YOLOv8DeepSORTRe-ID

Computer Vision2024

Food-Fraud Detection

A CCTV prototype that flags unauthorized consumption in real time, plus on-shelf availability alerts — both built for internal loss prevention.

Use case loss prevention

CCTVDetectionOpenCV

Computer Vision2024

Autonomous Shelf-Checking Robot

A ROS2-navigated robot using SLAM to patrol aisles and audit stock automatically — a moving sensor for shelf state.

Robotics ROS2 + SLAM

ROS2SLAMNavigation

ML Research2024

“Sibyl” Pre-Crime Prediction

A multimodal pipeline — ViT visual features, LSTM temporal modeling, and auxiliary cues (emotion, pose, motion) — to flag pre-criminal behaviour in surveillance footage, trained on the UCF-Crime benchmark.

Multimodal ViT + LSTM

Vision TransformerLSTMMultimodal

Productized AI

Not just prototypes — solutions.

Cortex · ERP-AI Solution

A 24/7 AI employee for your ERP.

A company-specific decision system that sits on top of an existing ERP — answering questions in plain language, writing the reports, and catching anomalies before anyone has to ask. The system of record stays the truth; the AI just makes it usable.

Ask in plain language → sourced answer in seconds
Auto-generated daily / weekly / monthly reports
24/7 anomaly watch, routed to the right person
Connects data across departments — not locked to a fixed dashboard

View the full solution →

Cortex · live example

“Why did revenue drop this month?”

Revenue down 15%, concentrated in two estates — traced to delivery-recognition timing, not lower production.

→ review delivery timing first

Education

Foundations & beyond.

2021 — 2025

B.Eng — Computer Engineering (AI)

UCSI University · Kuala Lumpur

Computer engineering with an AI specialisation, graduating with first-class honours.

★ CGPA 3.83 ★ Dean's List · all eligible semesters ★ UCSI Trust Scholarship

Leadership & languages

Beyond the work

President — UCSI Wolverines Muay Thai Club (2023–24). Led 30+ members across competition prep, weekly technical training, and coach coordination — growing the club's inter-varsity participation.

          EnglishMandarinMalayCantonese
        

Get in touch

Let's build something that works.

Open to AI / ML engineering roles and collaborations — especially where messy real-world data meets real decisions.

junantann@gmail.com linkedin.com/in/junantannn

📍 Kuala Lumpur · Cyberjaya, Selangor · Malaysia