Hello, I'm
Youhana Sheriff
Senior Software Engineer—Edge-AI & LLM Systems
I build AI that runs in the real world — on-device & edge inference on constrained hardware, LLM/RAG systems, and the full-stack apps that ship them.
An electronics engineer who deploys AI on constrained hardware. Full-stack capability across mobile, web, and cloud — but edge AI is where I specialize.

5+
Years Exp.
20+
Projects
About
Edge-AI & LLM Systems Engineer
An electronics engineer building AI systems that run on constrained hardware. I specialize in edge AI, on-device inference, and LLM/RAG systems — with full-stack capability across mobile, web, and cloud.

LLM/RAG Systems Engineer
Builds LLM applications with RAG, embeddings, vector search, agent memory, model orchestration, and practical evaluation loops for real product workflows.

Edge-AI Systems Engineer
Electronics engineer deploying AI on constrained hardware: YOLO/ONNX pipelines, LoRA and MLX fine-tuning, llama.cpp, quantization, and edge-focused reasoning systems.

Full-Stack & Mobile Engineer
Ships the apps around the AI: Flutter, React Native Expo, Next.js, TypeScript, Firebase, AWS, offline-first flows, payments, GPS, and app-store delivery.

Cloud & Product Infrastructure
Designs cloud-backed product systems with AWS, GCP, Firebase, Docker, Kubernetes, Cloudflare, GitHub Actions, PostgreSQL, and GraphQL.

Open Source Package Author
Publishes packages for LLM agent memory, AWS cleanup, Flutter Stripe Connect, and type-safe Dart translation code generation.
Philosophy
How I Work
End-to-End Ownership
I prefer working in small, high-ownership teams where I can own features from schema design to polished UI—and ship them myself.
AI With Intent
I don't bolt on AI for the sake of it. I pick the right model, structure the prompt/RAG pipeline, and measure quality—because the value is in how and why you use AI.
Quality Code
I write code with long-term maintenance in mind: typed languages, good CI/CD, observability, and patterns that future-me will thank current-me for.
Portfolio
AI Engineering & Systems Work
Open-source AI systems first, then shipped AI products, then full-stack and mobile work.
AI Engineering & Open Source
Open-source systems work across edge inference, RAG, agents, and AI media tooling.
models-edge-devices
Split-architecture edge AI that pairs real-time YOLO/RF-DETR detection with a LoRA-tuned TinyLlama reasoning model for constrained NPU/BPU hardware.
pocket_ai
Offline real-time obstacle-detection assistant that turns a camera feed into spoken guidance with YOLO11, depth estimation, and spatial reasoning.
SimpleMem
Lifelong memory for LLM agents with semantic structured compression, multi-view indexing, adaptive retrieval, and vector search.
llm-rag
RAG over PDFs with document ingestion, embeddings, vector similarity search, and LLM-powered question answering.
video-composer
AI short-form video pipeline for TikTok, YouTube Shorts, and Reels using TTS, Whisper transcription, GPT-4 Vision validation, and stock-footage assembly.
AI Products Shipped to Users
AI-enabled products packaged into mobile and web experiences for real users.
Kizu
AI personal-finance app using ML Kit OCR receipt scanning, Claude for transaction extraction and categorization, and a RAG pipeline for financial insights.

Chatiko AI
AI chatbot companion — a personal virtual assistant in text form. Live on Google Play (sold to Sheriax).

Talky
Language-learning app with AI content moderation for safer generated and user-submitted content.
Also Built
Full-stack and mobile systems that show the shipping range behind the AI work.
Drawink
Local-first collaborative whiteboard with multi-board workspaces, real-time collaboration, cloud sync, and offline-first architecture.
Echify
Video-first social-commerce app with a multi-vendor marketplace, Stripe Connect payouts, and creator monetization — Mako IT Lab.
RouteX
Offline-first construction logistics app with GPS tracking, route workflows, and field-ready cross-platform mobile UX.
Nidaa & TNTJ Apps
Community and utility apps shipped across Flutter, React Native Expo, and Next.js with mobile publishing and production operations.
Expertise
Skills & Experience
AI/ML

LLMs (Claude, Gemini, OpenAI, OpenRouter, local)

RAG

Vector Search

LoRA/MLX Fine-tuning

llama.cpp
ONNX

YOLO

ML Kit

Embeddings

LLM Integration
Kimi (Moonshot AI)
GLM 5 (Z.AI)
Codex GPT
Languages
TypeScript
Python
Dart
Rust
Kotlin
JavaScript
Frameworks
Next.js
React
React Native (Expo)
Flutter
Node.js/NestJS/Fastify
Cloud/Infra
AWS
GCP
Firebase
Docker
Kubernetes
Cloudflare
GitHub Actions
Data
PostgreSQL
GraphQL
Tooling
Git
Jira
Figma
Storybook
Sentry
Auth0
Work Experience
Sr. Software Engineer
Mako IT Lab
Lead development of cross-platform mobile & web applications for multinational clients. Key projects: Cubanin (social commerce with React Native Expo, Next.js, AWS Amplify), Echify (Flutter social commerce with Stripe Connect and video editing), RouteX (construction logistics with offline-first architecture, real-time GPS tracking, and route optimization). Mentor junior developers and manage agile sprints.
Founder & Product Engineer
Sheriax Solutions
Led development of multiple products: Kizu — AI-powered financial recovery app with Flutter, Firebase, ML Kit OCR, and Claude AI; Drawink — local-first collaborative whiteboard with React, TypeScript, real-time sync; Talky — language learning app with AI content moderation. Built edge AI solutions with MLX, LoRA fine-tuning, and llama.cpp. Managed cloud infrastructure (AWS, Firebase) and CI/CD with GitHub Actions.
Software Engineer
Rescript Welltech Private Limited
Built Anyo, a stress management and wellness mobile app using Flutter. Implemented venting platform with real-time feeds, integrated AI chatbot for stress management solutions, and developed a psychologist booking platform. Focused on mental health features and user engagement.
Front-End Developer
Mywam Sdn Bhd
Developed and maintained front-end mobile and web applications. Built responsive user interfaces with modern JavaScript frameworks, handled API integrations, and ensured cross-browser compatibility.
Software Developer
Francium Tech
Developed and maintained cross-platform mobile applications using Flutter (Dart) and web applications with React.js, Next.js, and React Native. Integrated REST APIs and state management solutions. Collaborated with cross-functional teams using Git, Jira, and agile methodologies.
Front-End Engineer
Cappricio Securities
Built the company website and blog platform from scratch using HTML, CSS, and JavaScript. Delivered a professional landing page with integrated course listings for their educational institution.
Web Developer
SnowBirds
Designed and developed a calm-themed e-commerce landing page using HTML, CSS, and JavaScript to showcase product offerings with a clean, user-friendly interface.
Published Packages
NPM & Pub.dev
Tools and libraries I've shipped: LLM agent memory, Flutter plugins, DevOps utilities, translation codegen.
My Projects
@sheriax/simplemem
LLM agent memory plus vector search for TypeScript agents.
aws-nuke-all
Delete all AWS resources across regions from dev and test accounts.
flutter_stripe_connect
Stripe Connect embedded components for Flutter.
translations_code_gen
Type-safe translation code generation for Dart and Flutter JSON localization files.
Contributed To
ZeroClaw
Fast, small, and fully autonomous AI assistant infrastructure built in Rust. Contributed to core functionality and documentation.
NanoBot
Ultra-lightweight AI agent framework. Contributed to framework improvements and bug fixes.
TinyClaw
Personal autonomous AI companion. Contributed to companion features and integrations.
Writing
Latest Articles
Deep dives on AI architecture, mobile patterns, and lessons from shipping real products.
Testimonials
What People Say

“I didn't expect it to be this good since it is purely JavaScript only. However, it is satisfying and I'm happy with the outcome.”
Karthi
Cappricio Securities
Companies I've Worked With


Get in Touch
Let's Connect
Feel free to reach out for collaborations or just a friendly hello
© 2026 Youhana Sheriff. All rights reserved.












