Skip to content
View infiniV's full-sized avatar
📟
📟

Block or report infiniV

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
infiniV/README.md

Raahim Arbaz

Computer vision engineer, based in Lahore. I do some research on the side and build the tooling I end up using.

Portfolio LinkedIn Email

What I'm up to

CV founding engineer at a clinical computer vision startup. I lead the annotation team, research new architectures, and run deployment and testing on edge hardware.

Projects

Project About Stack
VoiceFlow Local voice dictation on faster-whisper. Runs on your GPU, nothing leaves your machine. Python faster-whisper Pyloid
Vision-Dissect Cracks open CV models to compare layer activations and attention maps across YOLO11, SAM, and DepthPro. PyTorch ONNX Transformers
Android-Ui-MCP MCP server for Android UI automation and testing workflows. TypeScript MCP
ultra-instinct-claude-code 176 Claude Code tips distilled from 17 repos and 500k+ stars. Tagged by difficulty, nothing to install. Research Docs

Research

Mapping Air Pollution Sources with Sequential Transformer Chaining
NeurIPS 2024 Climate Change AI Workshop. Second author.

Chained Vision Transformers with Remote CLIP to find factory and brick-kiln chimneys in South Asian satellite imagery. Filtered a 600K+ image dataset down to the ~1% that actually contained pollution sources. Paper.

LocaGraph: Learning Localized Graph Attention with Anisotropic Adaptation
NeurIPS 2025 submission. Lead author. Graph neural networks for spatial data, under review.

Pinned Loading

  1. VoiceFlow VoiceFlow Public

    Open-source voice dictation for Windows and Linux. Hold a hotkey, talk, and the transcript shows up at your cursor. Runs offline with Whisper.

    TypeScript 353 23

  2. Android-Ui-MCP Android-Ui-MCP Public

    MCP server for AI-powered UI feedback across React Native, Flutter, and native Android development.

    TypeScript 20 3

  3. Vision-Dissect Vision-Dissect Public

    Learning repository for exploring deep learning vision models

    HTML

  4. ultra-instinct-claude-code ultra-instinct-claude-code Public

    We read 17 Claude Code repos (500k+ stars) so you don't have to. 176 tips, nothing to install. Consensus-filtered, tagged by difficulty.

    TypeScript 19 3

  5. claude-usage-waybar claude-usage-waybar Public

    Waybar module for Claude Code usage tracking on Linux. Shows 5-hour and 7-day rate limits, daily token usage and spend via ccusage, and live session stats in a GTK4/libadwaita popover. Built for Ar…

    Python 10

  6. ultra-ml-intern ultra-ml-intern Public

    ultra-instinct ML engineering intern for Claude Code. Reads papers, audits datasets, ships SFT/DPO/LoRA runs to Hugging Face.

    Shell 3