Skip to content

bundle-adjuster/photoevalagent

Repository files navigation

Photo Evaluation Vision Agent

A comprehensive computer vision system for analyzing and evaluating photos using state-of-the-art deep learning models.

Features

  • Facial Analysis: Detection, recognition, expression analysis, and face grouping
  • Aesthetic Scoring: Visual quality assessment and print-worthiness evaluation
  • Semantic Tagging: CLIP-based tagging and BLIP image captioning
  • Similarity Search: DINOv2 embeddings with FAISS vector search
  • Scene Understanding: Object detection and scene classification

Quick Start

# Install dependencies
pip install -r requirements.txt

# Analyze a photo
python scripts/demo_photo_eval.py path/to/photo.jpg

# Batch process
python scripts/demo_photo_eval.py path/to/photos/ --batch --group-faces

Sample Output

Full pipeline run on four freely-licensed sample images — reproduce with python scripts/generate_sample_outputs.py:

portrait pet food scene

Image Caption (BLIP) Scene (CLIP) Aesthetic Faces Top tags (CLIP)
portrait "a woman in an orange space suit with a space shuttle in the background" open sky (0.76) 62.9/100 1 bright, park, woman, sunny, bird
pet "a close up of a cat's face with a blurry background" park or garden (0.15) 51.3/100 0 cat, animal, sunny, bright, cloudy
food "a cup of coffee on a saucer with a spoon on a table" restaurant or cafe (0.55) 52.6/100 0 dark, food, sitting, standing, day
scene "a spacex falcon rocket taking off from the launch pad" open sky (0.85) 52.4/100 0 night, day, bright, person, people

The portrait yields a real face detection (confidence 1.0) plus a 512-d Facenet512 embedding and age/gender/expression attributes; non-portrait images correctly report 0 faces. See PHOTO_EVALUATION_README.md for details.

Documentation

Architecture

PhotoEvaluationAgent
├── FacialAnalysisAgent (DeepFace)
├── AestheticAnalysisAgent (CLIP + LAION aesthetic head)
├── SemanticAnalysisAgent (CLIP + BLIP)
├── SimilarityAnalysisAgent (DINOv2 + FAISS)
└── SceneDetectionAgent (DETR + CLIP zero-shot)

Requirements

License

MIT License - see LICENSE file for details

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors