The Bean Keeper

A mobile-first coffee tracking application that integrates multiple AI providers (Groq/Llama, Google Gemini, Tesseract OCR) to automatically extract, structure, and enrich coffee data from bag label photos.

Live: the-bean-keeper.onrender.com

Why This Project

This started as a personal tool and became a proving ground for multi-provider AI integration. Each AI provider was selected for what it does best:

Groq (Llama 3.1 8B) for structured data extraction: fast inference, JSON mode, low cost per call. Chosen over GPT/Claude for this task because extraction needs speed, not reasoning depth.
Google Gemini Pro for video generation pipeline: multimodal capabilities for converting screen recordings into polished product demos.
Tesseract.js for client-side OCR: runs in the browser, zero API cost, handles multilingual text (English + Chinese).
ElevenLabs for voice synthesis in the video pipeline.

The architecture uses fallback logic between providers. If AI extraction fails, regex-based extraction catches common patterns. If cloud storage fails, local filesystem takes over. Every external dependency has a graceful degradation path.

Features

AI-Powered Extraction: Upload coffee bag photos. AI extracts roaster, origin, variety, process method, roast level, and more.
Bilingual Interface: Full English and Traditional Chinese support with auto-detection
Notion OAuth: Multi-user auth where each user gets their own isolated Notion database
Guest Mode: Browse the owner's collection without logging in (read-only)
Advanced Filtering: Filter by roast level, rating, origin with dynamic sort
AI Video Pipeline: Converts screen recordings into product demos (Remotion + Gemini + ElevenLabs) at $0 cost
Mobile-First: Dual photo upload (camera + file picker), responsive 2-5 column grid

Tech Stack

AI / ML Pipeline

Layer	Provider	Why This Provider
OCR	Tesseract.js (client-side)	Zero API cost, browser-native, multilingual
Data extraction	Groq AI (Llama 3.1 8B)	Fast inference, JSON mode, structured output
Video generation	Gemini Pro (Google)	Multimodal, long context for video scripts
Voice synthesis	ElevenLabs	Natural speech for product demos
Fallback	Regex patterns	Graceful degradation when AI fails

Application

Frontend: React 18, TypeScript, Vite, TanStack Query, shadcn/ui, Tailwind CSS
Backend: Express.js, TypeScript, Notion SDK, Cloudinary
Auth: Notion OAuth 2.0 with session management
Deployment: Render.com with Cloudinary for persistent photo storage

📦 Quick Start

Prerequisites

Node.js 18+
Groq API key (groq.com)
Notion Internal Integration (notion.so/my-integrations)
Google Maps API key (optional)

Installation

# Clone the repository
git clone https://github.com/YFC-ophey/The-Bean-Keeper.git
cd the-bean-keeper

# Install dependencies
npm install

# Copy environment variables
cp .env.example .env
# Edit .env with your API keys

# Run development server
npm run dev

Visit http://localhost:5000

🔑 Environment Variables

Create a .env file with:

# Required
GROQ_API_KEY=your_groq_api_key
NOTION_API_KEY=your_notion_internal_integration_token
NOTION_DATABASE_ID=your_notion_database_id

# Optional
VITE_GOOGLE_MAPS_API_KEY=your_google_maps_key
PORT=5000

See .env.example for details.

🗄️ Database Setup

Option 1: Create Notion Database Automatically

# Create a page in Notion and get its ID
# Then run:
npx tsx create-database.ts <notion-page-id>

Option 2: Manual Setup

See NOTION_DATABASE_STRUCTURE.md for the complete schema.

📱 Deployment

Deploy to Render (Free)

Full deployment guide: DEPLOYMENT.md

Quick start: DEPLOY_QUICK_START.md

# 1. Push to GitHub
git add .
git commit -m "Ready for deployment"
git push

# 2. Create web service on Render
# 3. Connect GitHub repository
# 4. Add environment variables
# 5. Deploy!

Auto-deploys on every git push to main branch.

🧪 Development

# Development server
npm run dev

# Type checking
npm run check

# Production build
npm run build

# Production server
npm start

# Test Groq AI extraction
npx tsx test-groq.ts

# Test Notion connection
npx tsx test-notion-setup.ts

📸 Screenshots

Dashboard

Mobile-first grid layout with Instagram-style coffee cards

AI Extraction

Upload photo → AI extracts roaster, origin, variety, process, roast level

Bilingual Support

Toggle between English and Traditional Chinese

Statistics

Track your coffee journey with collection insights

🗂️ Project Structure

The-Bean-Keeper/
├── client/                 # React frontend
│   ├── src/
│   │   ├── components/    # UI components
│   │   ├── pages/         # Page components
│   │   ├── i18n/          # Translations (EN/ZH)
│   │   └── lib/           # API client
│   └── public/            # Static assets
├── server/                # Express backend
│   ├── index.ts           # Server entry
│   ├── routes.ts          # API endpoints
│   ├── groq.ts            # Groq AI client
│   ├── notion.ts          # Notion operations
│   └── notion-storage.ts  # Storage layer
├── shared/                # Shared types
│   └── schema.ts          # TypeScript + Zod schemas
├── DEPLOYMENT.md          # Full deployment guide
├── DEPLOY_QUICK_START.md  # Quick deployment steps
└── CLAUDE.md              # Development guide

🎨 Key Features Detail

AI-Powered Extraction

OCR: Tesseract.js extracts raw text from photos
AI Processing: Groq Llama 3.1 8B structures the data
Smart Detection: Automatically identifies roast level, origin, variety
Graceful Fallback: Regex extraction if AI fails

Internationalization

Full bilingual support (EN + ZH 繁體中文)
Automatic language detection
LocalStorage persistence
6 translation namespaces

Mobile-First Design

Responsive 2-5 column grid
Dual photo upload methods
Touch-optimized interactions
Vintage coffee journal aesthetic

Collection Management

Advanced filtering (roast, rating, origin)
Multiple sort options
Duplicate detection
Statistics dashboard

🤝 Contributing

This is a portfolio project, but suggestions are welcome!

📄 License

MIT License - See LICENSE file for details

👨‍💻 Author

Ophelia Chen

Portfolio: Coming Soon
LinkedIn: https://www.linkedin.com/in/opheliandata/
GitHub: @YFC-ophey

🙏 Acknowledgments

Claude Code - My Fav Vibe Coding Tool
Groq - Lightning-fast AI inference
Notion - Database and API
Tesseract.js - OCR engine
shadcn/ui - UI components
Clash Display - Typography
Render - Cloud Application Platform
Cloudinary - Image and Media API Platform

Built with ☕ and AI | Powered by Groq + Notion

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
.claude/commands		.claude/commands
FontshareKit-2512001683/Clash Display		FontshareKit-2512001683/Clash Display
Logo		Logo
client		client
server		server
shared		shared
.env.example		.env.example
.gitignore		.gitignore
.replit		.replit
CLAUDE.md		CLAUDE.md
DATABASE_VISUAL_MOCKUP.txt		DATABASE_VISUAL_MOCKUP.txt
DEPLOYMENT.md		DEPLOYMENT.md
DEPLOY_QUICK_START.md		DEPLOY_QUICK_START.md
FRONTEND_IMPLEMENTATION.md		FRONTEND_IMPLEMENTATION.md
GROQ_COST_ANALYSIS.md		GROQ_COST_ANALYSIS.md
GROQ_NOTION_SETUP.md		GROQ_NOTION_SETUP.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
MOBILE_IMPROVEMENTS.md		MOBILE_IMPROVEMENTS.md
NOTION_DATABASE_STRUCTURE.md		NOTION_DATABASE_STRUCTURE.md
NOTION_OAUTH_SETUP.md		NOTION_OAUTH_SETUP.md
NOTION_SETUP_GUIDE.md		NOTION_SETUP_GUIDE.md
README.md		README.md
USER_FLOW.md		USER_FLOW.md
bean-keeper-market-research.md		bean-keeper-market-research.md
check-database.ts		check-database.ts
check-datasource.ts		check-datasource.ts
check-notion-entries.ts		check-notion-entries.ts
components.json		components.json
create-database.ts		create-database.ts
debug-notion-token.ts		debug-notion-token.ts
design_guidelines.md		design_guidelines.md
drizzle.config.ts		drizzle.config.ts
migrate-origin-field.ts		migrate-origin-field.ts
ngrok		ngrok
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
render.yaml		render.yaml
replit.md		replit.md
tailwind.config.ts		tailwind.config.ts
test-create-entry.ts		test-create-entry.ts
test-groq.ts		test-groq.ts
test-happy-goat.ts		test-happy-goat.ts
test-notion-setup.ts		test-notion-setup.ts
test-query-with-resolve.ts		test-query-with-resolve.ts
tsconfig.json		tsconfig.json
update-datasource.ts		update-datasource.ts
vite.config.ts		vite.config.ts

Folders and files

Latest commit

History

Repository files navigation

The Bean Keeper

Why This Project

Features

Tech Stack

AI / ML Pipeline

Application

📦 Quick Start

Prerequisites

Installation

🔑 Environment Variables

🗄️ Database Setup

Option 1: Create Notion Database Automatically

Option 2: Manual Setup

📱 Deployment

Deploy to Render (Free)

🧪 Development

📸 Screenshots

Dashboard

AI Extraction

Bilingual Support

Statistics

🗂️ Project Structure

🎨 Key Features Detail

AI-Powered Extraction

Internationalization

Mobile-First Design

Collection Management

🤝 Contributing

📄 License

👨‍💻 Author

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages