John Smith

Mochamad Khaairi

Passionate about

Skills & Stack

Languages
AI Frameworks & Libraries
Data Processing
Data Visualization
Web Application
Databases
Backend & Tools

Projects

Undergraduate Thesis: Student Facial Expression Recognition System

Developed as a final-year thesis, this project focuses on building an efficient and accurate facial expression recognition system using a Hybrid MobileNetV3 and Vision Transformer (ViT) architecture. To ensure domain applicability, a specialized dataset of student expressions in classroom settings was constructed for rigorous model testing. The system is deployed via a Streamlit web interface, featuring an integrated LLM (Large Language Model) pipeline to analyze emotional trends from video inputs and generate actionable pedagogical recommendations for teachers.

Repository

CineBot: Intelligent AI Movie Companion

Designed and built CineBot, an AI-powered conversational agent using Streamlit and Google Gemini. This application leverages an Agentic Workflow via LangChain to fetch real-time movie data from the TMDB API and display it through rich visual cards. Key features include customizable AI personas, a smart watchlist management system, and a Retrieval-Augmented Generation (RAG) pipeline powered by Qdrant and HuggingFace embeddings, allowing users to upload and interactively analyze movie scripts or PDFs. Additionally, the bot integrates the Google Search API to provide real-time 'Now Playing' cinema schedules and showtimes specific to Indonesian theaters.

Repository

Multimodal RAG Recipe Assistant

Designed and built ChefBot, an AI-powered culinary assistant using Streamlit and Google Gemini. This application leverages a Multimodal RAG pipeline to retrieve precise cooking advice from PDF cookbooks, utilizing IBM Docling for layout-aware document parsing that accurately separates ingredients from instructions. Key features include an ingredient detection system powered by Computer Vision (Gemini Vision/LLaVA) that suggests recipes from user photos, and a robust Hybrid Vector Architecture that automatically fails over from Qdrant Cloud to local ChromaDB for high availability. Additionally, the system supports privacy-focused local inference via Ollama, orchestrated seamlessly through LangChain.

Repository

AI-Powered Job-CV Matching with Vector Search

Developed for the Compfest AI Innovation Challenges 2024 Preliminary, this project streamlines the recruitment process for IT professionals through an advanced AI-driven recommendation engine. The system leverages a fine-tuned SBERT model and Vector Search technology (powered by Zilliz Milvus) to perform high-precision similarity matching on a custom dataset of over 85,000 entries. Key features include seamless resume uploading and real-time job recommendations, ensuring highly relevant matches between candidates and job descriptions.

Repository

Clothing Multilabel Classification

Developed for the Hology Data Mining 2024 Preliminary, this project fine-tuned a Vision Transformer (ViT) for multi-label classification to predict clothing type and color. Achieving an Exact Match Ratio (EMR) of 0.9829, the project secured a top-8 position out of 196 teams, showcasing its high accuracy and innovative approach.

Repository

Multiclass Text Classification

Developed for the Satria Data 2024 preliminary round, this project focuses on multiclass text classification for election-related content across 8 distinct categories. By addressing data challenges like slang and class imbalance through rigorous preprocessing (using IndoNLP) and fine-tuning the indobenchmark/indobert-base-p1 model, the project achieved a Balanced Accuracy of 0.7085 on the test set.

Repository

SILELA: Sistem Pelayanan Elektronik Isola

A mobile application designed to digitize administrative correspondence processes for the Kelurahan Isola government office. As a Backend Developer, I designed and implemented the MySQL database schema and built robust RESTful APIs using FastAPI to handle complex business logic.

Repository is currently private

Certifications and Achievements

DCML
Bangkit 2024 Batch 1 Machine Learning Cohort
DCML
Dev Certification for Machine Learning with TensorFlow (DCML)
Capture Project
Hology 7.0 2024 Finalist of Data Mining
Capture Project
Qdrant Hackathon 2025 Best-in-Caregory Twelvelabs