J Tan Jun An
AI Engineer · Healthcare & Operations

I turn messy, real-world data into production AI.

Patient records, lab reports, operational data — I build the LLM, multi-agent, document-AI, and computer-vision systems that turn them into clean, decision-ready output. All of it runs in production.

Open to opportunities 📍 Kuala Lumpur · Cyberjaya, Malaysia
Tan Jun An AI Engineer @ N1 Healthcare
−75%
LLM API calls cut — same accuracy, lower cost
4
coordinated agents generating healthcare reports
2
pilot stores running production computer vision
≈RM 5k
hardware cost avoided per new store (≈USD 1.1k)
About

Built for the data you actually have.

I'm an AI Engineer focused on LLM-powered systems that hold up in production — document understanding, structured medical extraction, and multi-agent workflows.

At N1 Healthcare, I build the pipeline that turns complex, messy patient records into clean structured data and Comprehensive Healthcare Reports: document routers, parser-evaluation loops, clinical extraction, and LangGraph multi-agent systems built on LLMs, VLMs, OCR, and careful prompt & context engineering.

Before healthcare, I built computer vision and cloud-AI systems for a convenience-retail chain (FamilyMart Malaysia) over two years — object counting, on-shelf availability, product-availability monitoring, and real-time loss prevention, running on Kubernetes and AWS.

What I care about: reliable, cost-efficient AI that works with real-world data — not the data you wish you had — and genuinely supports decisions in healthcare and operations.

Capabilities

What I build with.

A pragmatic toolkit — chosen per task, and measured before it ships.

LLM & Agentic AI

LangGraphRAG / Agentic SearchMulti-agentContext EngineeringPrompt Engineering

LLM Ops & Evaluation

LangfuseLangSmithPrompt VersioningEval PipelinesObservability

Models & Inference

GPTClaudeGeminiQwenGLMvLLMDeepSeekKimi

Document AI

OCRVLMsMedical Doc ParsingMinerUDoclingPaddleOCR-VL

Computer Vision

YOLO / YOLOv8DeepSORTRe-IDOpenCVROS2 · SLAM

Cloud & Tools

DockerKubernetesAWSSQLPower BIGitLinux

Programming

PythonC++JavaKotlinFlutter / DartBash
Experience

Where I've shipped.

AI Engineer · N1 Healthcare

Full-time · Cyberjaya, Selangor · On-site
Sep 2025 — Present

Built LLM-powered systems for medical document parsing, structured extraction, and automated patient health summaries — N1's Comprehensive Healthcare Reports.

  • Built a medical document router that cut downstream parser LLM calls by ~75% with no loss in extraction accuracy.
  • Raised extraction quality through evaluation pipelines — deduplicating and normalising biomarkers across labs (so the same test from two providers maps to one clean record), adding medication coverage, and engineering prompts and context.
  • Designed a LangGraph multi-agent system that generates these patient summaries from raw records — coordinating specialist agents for clinical analysis, literature research, narrative generation, and fact validation.
  • Benchmarked 10+ LLMs, VLMs, and parsing tools (GPT, Claude, Gemini, Qwen, MinerU, Docling, PaddleOCR-VL) — choosing the right model per task on measured accuracy and cost, not assumptions.
  • Ran prompt iteration, observability, and model comparison through Langfuse and LangSmith.
LangGraphRAGVLMs / OCRLangfuseContext Eng.

AI Engineer · QL Maxincome — FamilyMart Malaysia

Internship → Part-time · Computer Vision & Cloud AI
Sep 2023 — Aug 2025

Computer vision and operations AI for a convenience-retail chain — from edge cameras to live Power BI dashboards.

  • Architected and deployed a Ready-To-Eat product-availability system across 2 pilot stores, with a scalable rollout underway.
  • Containerised inference on Kubernetes-on-AWS-EC2 with a central GPU cluster — avoiding ≈RM 5,000 of hardware per new store.
  • Built an end-to-end data pipeline (edge → S3 → Python ETL → Power BI) giving managers live replenishment KPIs.
  • Shipped on-shelf availability detection, customer & product flow counting (multi-object tracking with re-identification to avoid double-counts), a food-fraud prototype, and cash-transaction recognition.
  • Prototyped an autonomous shelf-auditing robot — ROS2 navigation with SLAM map-building for aisle patrol — plus an internal policy Q&A assistant using retrieval-augmented generation (RAG).
YOLOv8DeepSORT · Re-IDKubernetes · AWSPower BIROS2 · SLAM
Selected work

Production work & research.

Filter by where each one lives — agentic LLM systems, computer vision, or ML research.

Multi-agent healthcare report system diagram
Agentic · LLM2025

Multi-Agent Healthcare Report Generator

A LangGraph multi-agent system that turns raw patient records into full health summaries (N1's Comprehensive Healthcare Reports) — specialist agents for analysis, research, generation, and validation.

Orchestration 4 coordinated agents
LangGraphRAGMulti-agent
Cost-aware document router diagram
Agentic · LLM2025

Medical Document Router

Routes each document to the right parser so the expensive LLM only runs when it's actually needed — accuracy held, cost cut hard.

Impact −75% parser LLM calls
RoutingLLMCost-opt
Parser evaluation pipeline diagram
Agentic · LLM2025

Parser Evaluation Pipeline

Re-checks extraction quality after every prompt or model change — so improvements don't quietly break what already worked — with biomarker dedup and name-normalisation for consistency across labs.

Reliability accuracy held across changes
EvalLangfuseCanonicalization
HiVision voice-only assistant
▶ Demo · 0:42
Agentic · LLM2025

HiVision — Voice-Only Phone Assistant

A Flutter app that streams live frames from a low-cost ESP32 camera to Google Gemini, which parses spoken commands and triggers on-screen actions automatically — hands-free phone control, end to end.

▶ Watch on YouTube ↗
Flutter / DartGeminiESP32-S3AutoGUI
Retrieval-augmented generation system diagram
Agentic · LLM2024

Retrieval-Augmented Assistant

An early RAG assistant on OpenAI / Gemini with a vector store, answering policy questions with grounded citations instead of guesses.

Grounding cited answers
RAGVector DBOpenAI / Gemini
Ready-to-eat hot food counter
Computer Vision2024

Ready-To-Eat Availability

Edge cameras estimate hot-food fullness and sync to AWS / Kubernetes, turning shelf images into waste-reduction and replenishment insight.

Deployment 2 pilot stores
YOLOEdgeAWS · K8s
Object counting and tracking demo
Computer Vision2024

Object Counting & Re-ID

Multi-object tracking and re-identification measuring in-store and outdoor foot traffic — feeding new-store planning and site-selection models.

Tracking YOLOv8 + DeepSORT
YOLOv8DeepSORTRe-ID
Food fraud detection illustration
Computer Vision2024

Food-Fraud Detection

A CCTV prototype that flags unauthorized consumption in real time, plus on-shelf availability alerts — both built for internal loss prevention.

Use case loss prevention
CCTVDetectionOpenCV
Autonomous shelf-checking robot with SLAM map
Computer Vision2024

Autonomous Shelf-Checking Robot

A ROS2-navigated robot using SLAM to patrol aisles and audit stock automatically — a moving sensor for shelf state.

Robotics ROS2 + SLAM
ROS2SLAMNavigation
Pre-crime behaviour prediction research
ML Research2024

“Sibyl” Pre-Crime Prediction

A multimodal pipeline — ViT visual features, LSTM temporal modeling, and auxiliary cues (emotion, pose, motion) — to flag pre-criminal behaviour in surveillance footage, trained on the UCF-Crime benchmark.

Multimodal ViT + LSTM
Vision TransformerLSTMMultimodal
Productized AI

Not just prototypes — solutions.

Cortex · ERP-AI Solution

A 24/7 AI employee for your ERP.

A company-specific decision system that sits on top of an existing ERP — answering questions in plain language, writing the reports, and catching anomalies before anyone has to ask. The system of record stays the truth; the AI just makes it usable.

  • Ask in plain language → sourced answer in seconds
  • Auto-generated daily / weekly / monthly reports
  • 24/7 anomaly watch, routed to the right person
  • Connects data across departments — not locked to a fixed dashboard
View the full solution →
Cortex · live example
“Why did revenue drop this month?”
Revenue down 15%, concentrated in two estates — traced to delivery-recognition timing, not lower production.
→ review delivery timing first
Education

Foundations & beyond.

2021 — 2025

B.Eng — Computer Engineering (AI)

UCSI University · Kuala Lumpur

Computer engineering with an AI specialisation, graduating with first-class honours.

CGPA 3.83 Dean's List · all eligible semesters UCSI Trust Scholarship

Leadership & languages

Beyond the work

President — UCSI Wolverines Muay Thai Club (2023–24). Led 30+ members across competition prep, weekly technical training, and coach coordination — growing the club's inter-varsity participation.

EnglishMandarinMalayCantonese
Get in touch

Let's build something that works.

Open to AI / ML engineering roles and collaborations — especially where messy real-world data meets real decisions.

📍 Kuala Lumpur · Cyberjaya, Selangor · Malaysia