drag up/down to scroll
↕ scroll · ←→ keys · 👌 right hand pinch+drag to scroll · left hand pinch+drag to spin orbit · CapsLock pause · ↕ invert button
🤚gesture control
qalarc.com · computer vision · AI systems

Computer
Vision Suite

8 working systems. All local-first. All live.

all demos live fivelidz.com github.com/fivelidz
Computer Vision Health Tech Real Estate Developer Tools
8 projects · all working

What I build

👓
Smart Glasses
Face recognition + spoken memory recall. Press a button, hear who you're looking at.
smart-glasses-face-tracker
👁
Eye Tracking
Gaze + fixation + micro-expressions. Any site, script tag, zero hardware.
parallax-viewer
📦
Object Parallax
3D objects orbit your head. Off-axis projection. Three.js WebGL.
parallax-threejs
Gesture Control
21-landmark hands → audio, UI, games. Includes a full castle defense game.
gesture-control
🎯
Gesture SDK
GestureEngine as a reusable npm package. Any gesture → any output. Browser-native, zero hardware.
npm install gesture-engine
🩺
Patient Voice
AI GP consultation. Voice triage, drug checks, structured clinical export.
healthconnect.vip
🏠
HiddenHome
3D real estate marketplace. Head-tracked rooms. Direct bidding. No agents.
hiddenhome.io
gmux
Gesture-aware AI terminal multiplexer. Run 10 agents. Navigate by hand and voice.
pip install gmux
Computer Vision · Hardware in hand

👓 Smart Glasses Memory Assistant

Face tracker
478-landmark FaceMesh · ArcFace recognition · 30fps
live XKOENPRO_16B1 open demo
Glasses 12MP cam → MediaPipe 478 landmarks @ 30fps
InsightFace ArcFace 512-d · 8ms CPU · 99.7% LFW
Cosine match → SQLite WAL: persons, sightings, transcripts
Whisper ASR + pyannote diarization → who said what
AVRCP button → Claude recall → TTS → glasses speaker
The moment

"Press play looking at someone. Glasses say: Alex Chen. Met 3 times. March meetup. Talked about her startup."

AWS unlocks

DynamoDB — multi-device sync, works away from home
Transcribe Streaming — 200ms ASR, speaker ID built in
Bedrock — Claude in AWS infra, IAM auth
Lambda — recall API when laptop is off

"A cognitive prosthetic for social memory."
Computer Vision · Browser Native · Zero Install

📦 Object Parallax + Eye Tracking

Head-tracked 3D object viewer · off-axis projection · webcam only
object viewer three.js room path walker v2
Layer 1
3D parallax
Off-axis projection
Three.js WebGL
Layer 2
Calibrated gaze
Fixation engine
Kalman filter
Layer 3
FACS micro-expr
7 emotions
40–200ms window
Analytics today
Hotjar / FullStory
where users clicked
This system
Where they looked
+ what they felt
"Any website. Script tag. Done."
AWS unlocks

Lambda + API Gateway → drop-in ingest endpoint
Kinesis → real-time gaze stream, thousands of users
SageMaker → population-level calibration model
Personalize → gaze + emotion → adaptive content

Input Layer · Browser Native · Zero Hardware

✋ Gesture Control — Hand of the King

Gesture
21-landmark hand tracking · gesture classification · 30fps
gesture mapper hand of the king
Theremin
Y→pitch · X→reverb
instant wow factor
Mapper v4.7
any source→target
JSON export
Castle Defense
Full gesture game
KV leaderboard
🕹 Hand of the King
A full tower-defense game controlled entirely by hand gestures in the browser. No mouse. No keyboard. The same GestureEngine class powers this showcase's pinch-scroll.
Combined interface
Gaze — eye tracking selects the target
Gesture — hand confirms or acts
Expression — face signals intent
AWS unlocks

Lambda — sync preset libraries across devices
Personalize — per-user gesture profiles
IVS — stream live gesture performances

Developer Tools · Gesture + Voice + AI · pip install gmux

⚡ gmux — Gesture-Aware AI Terminal

1: doofing
◉ working 5/7
2: AI_diary
◉ working 6/6
3: claude_TUI
● waiting 3/4
4: gmux
! permission 2/5
5: goblin
◉ working 1/1
6: shell
○ idle
👋 swipe → navigate 🎤 "next red" → jump to 🔴
pip install gmux paru -S gmux
What it does
Status bar — 🟢🔴🟡 per pane, todo progress
Gesture nav — point 600ms to select, swipe to navigate
Voice commands — "next red", "kalarc fix this"
Phone remote — web dashboard, accept/deny
Session restore — re-launches all agents
Why nobody else has this
herdr (214★), recon (192★), cmux (13.3k★ — macOS only), Codeman (289★). None have hand gesture pane control, voice-navigated multiplexing, wake-word terminal control, or TTS responses from agents. gmux combines all of it on Linux.
AWS unlocks

Bedrock — cloud AI backend, no local model needed
Lambda — sync session state and bindings
CodeWhisperer — voice/gesture triggered suggestions

Computer Vision · Real-time · Webcam only

📷 Camera Analysis Suite

📷
Face Tracker v10
qalarc.com/projects/demos/smart-glasses-face-tracker/
open in new tab
478
face landmarks
30fps
continuous video mode
8ms
ArcFace inference
99.7%
LFW accuracy
What the camera sees
478 landmarks — full face mesh at 30fps, VIDEO mode (not snapshot)
Iris tracking — precise gaze vector, blink detection, EAR
Micro-expressions — 14 FACS action units, 7 emotion classes, 40–200ms window
Face identity — ArcFace 512-d embeddings, cosine match against memory DB
Mood signals — blink rate, attention state, expression intensity over time
The combined system
Eye tracking selects the target. Gesture confirms or acts. Micro-expression provides intent signal. Three systems — one complete hands-free UI paradigm — functional in a browser, requiring only a webcam. No hardware. No install.
AWS unlocks

Rekognition — cloud fallback for low-confidence face matches
SageMaker — train population-level expression classifiers
Kinesis — stream per-frame emotion events at scale

Health Tech · Heidi Hackathon 2025

🩺 Patient Voice — AI Medical Triage

🩺
Patient Voice
healthconnect.vip
healthconnect.vip open in new tab
What it does
Voice interface — patient speaks naturally
AI triage — routes to correct consultation type
Drug contraindication checks, automatic
Structured export — Google Sheets for clinician
50+ languages — Whisper multilingual ASR
Heidi Health Hackathon 2025
GP consultations, prescription requests, sick notes, specialist referrals. Not emergency triage.
AWS unlocks

Transcribe Medical — clinical ASR, medical vocab
HealthLake — structured consultation records
Comprehend Medical — NLP on clinical notes

Real Estate · 3D Marketplace · Sydney

🏠 HiddenHome — 3D Real Estate

🏠
HiddenHome 3D Viewer
hiddenhome.io/demo/viewer
hiddenhome.io browse map bidding
The pitch

Browse properties as head-tracked 3D walkthroughs. Place transparent verified bids directly to owners. No agents. No commissions.

What's live now
Head-tracked 3D viewer — MediaPipe FaceMesh
Sydney map — 5 demo properties, Leaflet dark
Bidding UI — leaderboard, $50 deposit flow
1,627 Newtown buildings — real OSM 3D data
Marble shader system — 5 procedural presets
AWS unlocks

S3 + CloudFront — 3D model and splat delivery
Lambda — bid processing, notifications
SageMaker — Gaussian splatting pipeline

All projects · live · drag to spin · gesture-spinnable on this slide

All Live Demos

drag to spin · pinch+drag ↔ rotates the ring on this slide
Infrastructure · Partnership · Growth

Working Together

These systems are working today — locally, privately, without infrastructure cost. Cloud infrastructure turns each one from a prototype into a product. Below is what becomes possible with the right backing, and why each one makes sense.

👁
Gaze Analytics SDK
Any website · script tag · attention heatmaps + emotion data per user. Lambda ingest, Kinesis stream, SageMaker calibration model, Personalize adaptive content. 90-day build to production.
gmux Cloud Tier
Bedrock as the managed AI backend — no user API key on paid tier. Lambda syncs session state and keybindings across machines. The developer tools SaaS play with a clear free/paid boundary.
👓
Glasses Recall API
Face memory as a service. DynamoDB replaces SQLite — recall works when the laptop is off. Transcribe Streaming replaces local Whisper. B2B: accessibility, sales, aged care, enterprise.
🎯
Gesture SDK + Accessibility
GestureEngine as a hosted npm package with cloud preset sync. AppSync keeps gesture profiles across devices. IoT Greengrass for embedded deployments. The accessibility and industrial play that needs zero hardware and runs in a browser.
What I can deliver
A working cloud-native gaze analytics platform within 90 days of infrastructure access.
fivelidz.com github.com/fivelidz qalarc.com Signal/WhatsApp: +61 493 484 788