Mobile + ML inference

StudyAI

AI-native iOS study companion for live lecture capture

Built an iOS study companion that turns in-class live notes and voice capture into structured summaries and machine-generated quizzes, powered by Meta's BART seq2seq model on a FastAPI inference layer with offline-first Firestore and Core Data sync.

Role: Solo Developer
Timeframe: May 2025
Status: Not shipped (iOS prototype)
Links: Video demo

Product screenshots

Context and problem

Students struggle to convert raw lecture capture—live notes, voice memos, and unstructured material—into review-ready summaries and formative quizzes fast enough to reinforce learning while content is still fresh.

Challenge

Close the gap between live lecture delivery and effective self-review by converting unstructured in-class input into retention-ready summaries and assessments on a mobile-first, latency-sensitive path.

Constraints

Mobile-first UX with intermittent campus connectivity—offline access required for post-lecture review
Inference latency must stay low enough for interactive study sessions during and after class
Quiz quality depends on summarization fidelity from noisy, real-time speech and note input

Key contributions

Built in-class live note capture with speech-to-text ingestion, streaming lecture content into the processing pipeline without disrupting session flow.
Deployed Meta BART (`facebook/bart-large-cnn`) for abstractive summarization and quiz-item generation—producing study briefs and formative assessment prompts grounded in captured material.
Architected an offline-first SwiftUI client with Firestore + Core Data dual persistence and a FastAPI layer targeting p95 inference latency under 400ms.

Architecture

Step 1SwiftUI iOS clientLive note capture, voice input, quiz/flashcard UI
→
Step 2FastAPI inference serviceBART summarization and quiz-generation pipelines
→
Step 3Firestore + Core DataReal-time sync with offline-first local persistence

Key engineering decisions

Selected Meta BART (`facebook/bart-large-cnn`) for bounded seq2seq summarization and text generation instead of open-ended LLM calls—better cost and latency control for structured study outputs
Kept live capture lightweight on-device while delegating summarization and quiz synthesis to the backend inference path
Used dual-layer persistence (Firestore + Core Data) so students can review generated material without continuous network access

Security and reliability

Offline-first sync for note and quiz access across intermittent campus connectivity
Optimized inference paths with p95 response times under 400ms for interactive study workflows

Outcome

Built an iOS study companion prototype that turns in-class live notes and voice capture into BART-powered summaries and auto-generated quizzes—structured review material from raw lecture input in a single mobile workflow.

Limitations

Not shipped; iOS prototype only—no App Store release
Quiz quality varies with speech recognition accuracy and lecture audio conditions
Model scope limited to summarization and generation—no multimodal or long-document RAG pipeline

Next steps

Add retrieval grounding for course materials (syllabi, slides) to improve quiz relevance
Expand evaluation harnesses for summary and quiz quality across subject domains

Tech stack

SwiftUI
FastAPI
BART (facebook/bart-large-cnn)
Firebase
Firestore
Core Data
Speech-to-text