Gary Short

Principal AI Engineer | AI Architect | Generative AI Systems

Ex-Microsoft AI architect and principal engineer building production-grade AI systems across LLMs, agentic workflows, computer vision, probabilistic modelling, and cloud-native platforms.

I help organisations turn AI from experimentation into operational capability.

Recent work includes:

Agentic AI systems and autonomous workflows
Enterprise RAG and LLM evaluation platforms
AI-enabled analytics systems delivering multi-million-pound savings
Computer vision systems using YOLO and multimodal pipelines
Decision intelligence platforms for high-uncertainty environments
Model optimisation, distillation, and scalable inference architectures

Current Focus

Agentic AI & LLM Systems

Designing production-grade AI systems using:

LLMs
RAG
AI agents
tool orchestration
evaluation pipelines
cloud-native inference architectures

AI Cost Optimisation

Helping organisations reduce dependency on expensive frontier-model inference through:

model distillation
workflow optimisation
hybrid architectures
targeted local inference

Decision Intelligence Under Uncertainty

Building systems that convert incomplete information and expert judgement into quantified, defensible decision outputs.

Example: 👉 https://www.darach.ai/risklens/

Selected Projects

RiskLens

Decision intelligence for high-stakes uncertainty.

RiskLens converts expert judgement into quantified probability ranges, scenario analysis, and board-ready decision outputs using:

Bayesian modelling
probabilistic simulation
expert elicitation
uncertainty modelling

Designed for:

infrastructure
investment decisions
strategic planning
operational risk
rare-event analysis

🔗 https://www.darach.ai/risklens/

LLM Distillation & Cost Optimisation

A repository exploring practical strategies for:

reducing LLM inference costs
targeted model distillation
hybrid model architectures
evaluation workflows
post-frontier AI delivery strategies

🔗 https://github.com/garyshort/llm_distil

Insurance AI Platform

Designed and delivered Azure-based multimodal AI systems that:

converted damage imagery into supplier-specific estimates
automated insurance documentation generation
combined computer vision with LLM workflows
integrated RAG and structured document generation

Technologies: Python • Azure AI • YOLO • MLflow • Kubernetes • Azure OpenAI

Infrastructure AI & Risk Modelling

Architected AI and analytics systems for infrastructure and asset-management decision making.

Delivered:

probabilistic risk modelling
AI-powered inspection analysis
synthetic training data generation
operational optimisation systems

Estimated annual impact: ~£15M–£20M savings

Technology Stack

AI & Data

Python • PyTorch • TensorFlow • scikit-learn • OpenCV • YOLO • MLflow • RAG • Agentic AI • Azure OpenAI • Llama • Phi • GPT

Cloud & Infrastructure

Azure • Kubernetes • Docker • Azure ML • Databricks • Synapse • Data Factory • Event Hub • CI/CD

Engineering

Python • C# • TypeScript • Go • Rust • Scala • Spark • APIs • distributed systems

Background

Previously:

Cloud Solution Architect at Microsoft
Microsoft C# MVP (6 years)
Principal AI/Data Science consultant across:
- insurance
- infrastructure
- mobility
- humanitarian analytics
- enterprise AI transformation

Working Style

Fully hands-on.

I build:

production systems
prototypes
architecture
evaluation frameworks
cloud platforms
AI workflows
technical strategy

Current tooling:

Cursor
Claude Code
modern AI-native engineering workflows

Contact

Areas of Interest

Agentic AI
LLM systems engineering
AI architecture
inference optimisation
probabilistic modelling
AI governance
cloud-native AI platforms
multimodal systems
decision intelligence
synthetic data generation

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gary Short

Principal AI Engineer | AI Architect | Generative AI Systems

Current Focus

Agentic AI & LLM Systems

AI Cost Optimisation

Decision Intelligence Under Uncertainty

Selected Projects

RiskLens

LLM Distillation & Cost Optimisation

Insurance AI Platform

Infrastructure AI & Risk Modelling

Technology Stack

AI & Data

Cloud & Infrastructure

Engineering

Background

Working Style

Contact

Areas of Interest

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Gary Short

Principal AI Engineer | AI Architect | Generative AI Systems

Current Focus

Agentic AI & LLM Systems

AI Cost Optimisation

Decision Intelligence Under Uncertainty

Selected Projects

RiskLens

LLM Distillation & Cost Optimisation

Insurance AI Platform

Infrastructure AI & Risk Modelling

Technology Stack

AI & Data

Cloud & Infrastructure

Engineering

Background

Working Style

Contact

Areas of Interest

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages