Skip to content

garyshort/gary-short-profile

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Gary Short

Principal AI Engineer | AI Architect | Generative AI Systems

Ex-Microsoft AI architect and principal engineer building production-grade AI systems across LLMs, agentic workflows, computer vision, probabilistic modelling, and cloud-native platforms.

I help organisations turn AI from experimentation into operational capability.

Recent work includes:

  • Agentic AI systems and autonomous workflows
  • Enterprise RAG and LLM evaluation platforms
  • AI-enabled analytics systems delivering multi-million-pound savings
  • Computer vision systems using YOLO and multimodal pipelines
  • Decision intelligence platforms for high-uncertainty environments
  • Model optimisation, distillation, and scalable inference architectures

Current Focus

Agentic AI & LLM Systems

Designing production-grade AI systems using:

  • LLMs
  • RAG
  • AI agents
  • tool orchestration
  • evaluation pipelines
  • cloud-native inference architectures

AI Cost Optimisation

Helping organisations reduce dependency on expensive frontier-model inference through:

  • model distillation
  • workflow optimisation
  • hybrid architectures
  • targeted local inference

Decision Intelligence Under Uncertainty

Building systems that convert incomplete information and expert judgement into quantified, defensible decision outputs.

Example: 👉 https://www.darach.ai/risklens/


Selected Projects

RiskLens

Decision intelligence for high-stakes uncertainty.

RiskLens converts expert judgement into quantified probability ranges, scenario analysis, and board-ready decision outputs using:

  • Bayesian modelling
  • probabilistic simulation
  • expert elicitation
  • uncertainty modelling

Designed for:

  • infrastructure
  • investment decisions
  • strategic planning
  • operational risk
  • rare-event analysis

🔗 https://www.darach.ai/risklens/


LLM Distillation & Cost Optimisation

A repository exploring practical strategies for:

  • reducing LLM inference costs
  • targeted model distillation
  • hybrid model architectures
  • evaluation workflows
  • post-frontier AI delivery strategies

🔗 https://github.com/garyshort/llm_distil


Insurance AI Platform

Designed and delivered Azure-based multimodal AI systems that:

  • converted damage imagery into supplier-specific estimates
  • automated insurance documentation generation
  • combined computer vision with LLM workflows
  • integrated RAG and structured document generation

Technologies: Python • Azure AI • YOLO • MLflow • Kubernetes • Azure OpenAI


Infrastructure AI & Risk Modelling

Architected AI and analytics systems for infrastructure and asset-management decision making.

Delivered:

  • probabilistic risk modelling
  • AI-powered inspection analysis
  • synthetic training data generation
  • operational optimisation systems

Estimated annual impact: ~£15M–£20M savings


Technology Stack

AI & Data

Python • PyTorch • TensorFlow • scikit-learn • OpenCV • YOLO • MLflow • RAG • Agentic AI • Azure OpenAI • Llama • Phi • GPT

Cloud & Infrastructure

Azure • Kubernetes • Docker • Azure ML • Databricks • Synapse • Data Factory • Event Hub • CI/CD

Engineering

Python • C# • TypeScript • Go • Rust • Scala • Spark • APIs • distributed systems


Background

Previously:

  • Cloud Solution Architect at Microsoft

  • Microsoft C# MVP (6 years)

  • Principal AI/Data Science consultant across:

    • insurance
    • infrastructure
    • mobility
    • humanitarian analytics
    • enterprise AI transformation

Working Style

Fully hands-on.

I build:

  • production systems
  • prototypes
  • architecture
  • evaluation frameworks
  • cloud platforms
  • AI workflows
  • technical strategy

Current tooling:

  • Cursor
  • Claude Code
  • modern AI-native engineering workflows

Contact

LinkedIn Calendly Email


Areas of Interest

  • Agentic AI
  • LLM systems engineering
  • AI architecture
  • inference optimisation
  • probabilistic modelling
  • AI governance
  • cloud-native AI platforms
  • multimodal systems
  • decision intelligence
  • synthetic data generation

About

My profile

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors