Generative AI Tool

AI CV Optimizer

A full-stack, local-first web application designed to evaluate and rewrite resume bullet points into highly professional, metric-driven statements using a custom fine-tuned Qwen2.5-1.5B model.

1.5B Param Model Mac GPU Accelerated 100-pt Heuristic Engine < 20s Latency
CV Improver Interface
Role

Solo Full-Stack Developer

Timeline

Sep 2024 – Feb 2025

Domain

LLMs, NLP, Web App

Core Tech

Flask, PyTorch, Qwen2.5-1.5B, LoRA

01 · Problem The Challenge

Job seekers frequently struggle to articulate their professional experience in a way that is impactful enough to pass Applicant Tracking Systems (ATS). While cloud-based AI tools exist, they often generate generic "AI-speak" and require sending personal data over the internet. The goal was to build a local-first, highly specialized application that could run entirely on a Mac's unified memory, transforming amateur text into metric-driven statements suitable for tech recruiting, while mathematically proving the improvement.

02 · Solution System Architecture

I engineered a Flask-based web application with a specialized Qwen/Qwen2.5-1.5B-Instruct model at its core. Instead of relying on generic base model capabilities, I injected a custom Parameter-Efficient Fine-Tuning (PEFT) LoRA adapter, fine-tuned on a proprietary synthetic dataset of tech resumes. To ensure speed, the weights are permanently fused upon boot, eliminating abstraction layers and utilizing PyTorch's Metal Performance Shaders (MPS) for Mac GPU acceleration.

To quantify the text transformation, I developed a rigid 100-point Heuristic Scoring Engine. This algorithm evaluates Flesch Reading Ease (penalizing overly complex or conversational text), enforces strict grammar rules via a local Java-based LanguageTool server, analyzes sentiment confidence using a HuggingFace pipeline, and measures action verb density against a proprietary keyword list.

03 · Engineering Technical Highlights

LoRA Fine-Tuning

Locally trained a custom LoRA adapter (r=16, alpha=32) targeting attention layers to transform the massive 1.5B parameter model into an "Expert Tech Recruiter" persona.

Hardware Optimization

Leveraged Apple Silicon unified memory by offloading tensor computations to the GPU via MPS, and reduced inference latency to under 20 seconds using early-stopping tokens.

100-Point Scoring

Engineered a multi-layered NLP scoring algorithm combining readability metrics, strict grammar checks, and proprietary keyword matching to evaluate CV quality.

04 · Results The Outcome

The AI CV Optimizer successfully rewrites amateur resume bullet points into highly professional statements. By running entirely locally with customized fine-tuning and hardware acceleration, it provides a fast, private, and mathematically proven tool for job seekers aiming for tech roles.

1.5B
Model Parameters
Qwen2.5-1.5B-Instruct
< 20s
Inference Latency
Early-stopping tokens
100
Point Scoring
Rigid heuristic system
100%
Local Execution
Mac GPU acceleration (MPS)

05 · Stack Tech Stack & Infrastructure

AI & Machine Learning
Qwen2.5-1.5B LoRA / PEFT PyTorch (MPS) TRL
Application & Scoring
Python / Flask LanguageTool textstat Vanilla JS
Previous Project
Next Project Discord Bot
View All Projects