EU Values Survey LLM Integration

This project implements a comprehensive system for administering the European Values Study (ZA7500) survey using Large Language Models (LLMs) across multiple languages and instruction-tuned models.

Overview

The EU Values Survey LLM Integration provides tools for:

Multilingual Survey Administration: Support for 7 languages (English, Spanish, Italian, Czech, Hungarian, Serbian, Russian)
Multi-Model LLM Support: Compatible with multiple instruction-tuned models:
- Gemma 2 27B
- Apertus (7B, 70B)
- Qwen 3 30B A3B
- EuroLLM
- Salamandra
- Minimistral-3
Jinja2-based Prompt Templates: Model-specific and language-specific prompt rendering
Response Parsing & Validation: Structured constraint enforcement for survey responses
Batch Processing: Automated workflow for processing survey questions across language-model combinations

Project Structure

.
├── query_survey_llm.py              # Main orchestration script with argument parsing
├── local_llm_questions.ipynb         # Jupyter notebook with complete workflow
├── prompts/                          # Jinja2 templates for survey prompts
│   └── survey_prompt/
│       ├── survey_prompts_final_answer.jinja2
│       ├── survey_prompts_final_answer_gemma27b.jinja2
│       ├── survey_prompts_final_answer_apertus.jinja2
│       ├── survey_prompts_final_answer_qwen3_30b.jinja2
│       ├── survey_prompts_final_answer_eurollm.jinja2
│       ├── survey_prompts_final_answer_salamandra.jinja2
│       └── survey_prompts_final_answer_minimistral3.jinja2
├── Surveys/                         # Original survey data (7 languages)
│   ├── ZA7500_q_gb.csv
│   ├── ZA7500_q_es.csv
│   ├── ZA7500_q_it.csv
│   ├── ZA7500_q_cz.csv
│   ├── ZA7500_q_hu.csv
│   ├── ZA7500_q_rs.csv
│   └── ZA7500_q_ru.csv
├── Surveys_parsed/                  # Parsed survey data (cleaned/standardized)
├── Surveys_responses/               # LLM-generated responses (CSV + JSON formats)
└── src/                             # Python scripts and notebooks
  ├── query_survey_llm.py
  ├── extract_survey_csv_from_pdf.py
  └── examples_parsing_query_survey_llm.ipynb

Installation

Local setup

git clone https://github.com/Telefonica-Scientific-Research/EUValues.git
cd EUValues
pip install -r requirements.txt

Requirements

numpy>=1.24.0
pandas>=2.0.0
requests>=2.28.0
jinja2>=3.1.0

Usage

Command Line

Basic Usage

Process survey questions for a specific language and model:

python query_survey_llm.py --language es --models apertus

With Custom Server Configuration

python query_survey_llm.py \
  --host 192.168.1.100 \
  --port 8000 \
  --languages es it en_gb \
  --models apertus gemma27b qwen3_30b \
  --csv-dir ./Surveys_parsed \
  --output-dir ./Surveys_responses \
  --timeout 60

Command Line Arguments

--host: LLM server hostname (default: 127.0.0.1)
--port: LLM server port (default: 10000)
--languages: Space-separated list of language codes (default: all 7)
--models: Space-separated list of LLM model names (default: all 6)
--csv-dir: Directory containing survey CSV files (default: ./Surveys_parsed)
--output-dir: Output directory for responses (default: ./Surveys_responses)
--timeout: Request timeout in seconds (default: 120)

Python API

from query_survey_llm import load_template, query_llm

# Load a model-specific template
template = load_template("apertus")

# Render template with survey variables
prompt = template.render(
    language="es",
    question_id="Q1",
    question_text="¿Cuál es tu opinión sobre...?",
    variable="tolerance",
    option_text="Completamente de acuerdo",
    response_scale="1-5"
)

# Query LLM
response = query_llm(prompt, host="127.0.0.1", port=10000)
print(response)

Jupyter Notebook

Open local_llm_questions.ipynb for an interactive workflow that includes:

Survey data loading and exploration
Template rendering with variable substitution
LLM querying with error handling
Response parsing and validation
Results aggregation and analysis

Prompt Template Architecture

Templates use Jinja2 with the following variables:

language: Survey language (gb, es, it, cz, hu, rs, ru)
question_id: Question identifier
question_text: Survey question in target language
variable: Research variable being measured
option_text: Response option text
response_scale: Format for allowed responses (e.g., "1-5" or "yes/no")

Example Template Structure

{% set instruction_prompt %}
{% if language == "es" %}
Responde la siguiente pregunta de una encuesta sobre valores europeos:
{% elif language == "gb" %}
Answer the following question from a survey about European values:
{% endif %}

**Pregunta**: {{ question_text }}
**Variable**: {{ variable }}
**Escala de respuesta**: {{ response_scale }}

Final answer: @@<response>@@
{% endset %}

<|im_start|>user
{{ instruction_prompt }}<|im_end|>
<|im_start|>assistant

LLM Server Integration

This project requires a compatible LLM server implementing the OpenAI /v1/chat/completions endpoint.

Recommended Setup: llama.cpp Server

# Build llama.cpp
git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp
make

# Download a quantized model
wget https://huggingface.co/models/model.gguf

# Start server
./server -m model.gguf -ngl 33 --port 10000

Docker Alternative

See Containerfile for containerized deployment options.

Response Format

Responses are saved in both CSV and JSON formats:

CSV Format

question_id,question_text,variable,language,model,response,timestamp
Q1,"¿Cuál es tu opinión...?",tolerance,es,apertus,"Completamente de acuerdo",2024-01-20 10:30:00

JSON Format

{
  "metadata": {
    "language": "es",
    "model": "apertus",
    "timestamp": "2024-01-20T10:30:00"
  },
  "responses": [
    {
      "question_id": "Q1",
      "question_text": "¿Cuál es tu opinión...?",
      "variable": "tolerance",
      "response": "Completamente de acuerdo"
    }
  ]
}

Survey Data

The project includes data from the European Values Study (ZA7500):

Coverage: 7 European countries with different languages
Questions: Survey on European values, attitudes, and beliefs
Format: CSV files with question text in native languages

Available Languages

Code	Language	File
gb	English (British)	ZA7500_q_gb.csv
es	Spanish	ZA7500_q_es.csv
it	Italian	ZA7500_q_it.csv
cz	Czech	ZA7500_q_cz.csv
hu	Hungarian	ZA7500_q_hu.csv
rs	Serbian	ZA7500_q_rs.csv
ru	Russian	ZA7500_q_ru.csv

Performance & Constraint Enforcement

The system implements multiple strategies to ensure LLM compliance with survey response constraints:

Numbered Selection with Examples (⭐⭐⭐⭐⭐ highest reliability)
- Forces selection from numbered options
- Includes correct/incorrect format examples
JSON Schema Constraints (⭐⭐⭐⭐⭐)
- Leverages LLM JSON mode when available
- Strict schema validation
Explicit List + Repetition (⭐⭐⭐⭐)
- Lists valid options multiple times
- Works well for open-source models
Few-Shot Examples (⭐⭐⭐)
- Demonstrates correct response format
- Adds token overhead

Running the Analysis Pipeline

After collecting LLM responses (see Usage), the analysis pipeline produces PCA projections and radar charts comparing LLM behaviour against human EVS responses, plus a consensus/Krippendorff α report for the human data.

The full pipeline is packaged in run_analysis.sh.

Quick start

# From the EUValues/ directory
bash run_analysis.sh

This runs both stages with all options enabled:

Stage	Script	Outputs
Human responses	`src/data_preprocessing/human_responses_analysis.py`	PCA plot, by-country radar, consensus stats
LLM responses	`src/llm_responses_analysis/llm_responses_analysis.py`	`llm_pca_projection.png`, `llm_pca_projection_by_model.png`, `llm_radar_vs_human.png`

All figures are written to src/data_preprocessing/output/.

Script options

bash run_analysis.sh [OPTIONS]

  --human-only     Run only the human responses analysis
  --llm-only       Run only the LLM responses analysis
  --no-pca         Skip PCA plots in the human analysis
  --no-consensus   Skip consensus/Krippendorff analysis
  --help           Show usage and exit

Custom virtual environment

By default the script looks for .venv/ one level above the repository root (i.e. the parent workspace directory). Override with the VENV environment variable:

VENV=/path/to/my/env bash run_analysis.sh

Running scripts individually

# Human analysis only (with all options)
python src/data_preprocessing/human_responses_analysis.py \
    --pca --by-country --consensus

# LLM analysis only
python src/llm_responses_analysis/llm_responses_analysis.py

# Restrict to specific models or countries
python src/llm_responses_analysis/llm_responses_analysis.py \
    --models gemma4 qwen3-30B-A3B \
    --countries es fr de it

Note on matplotlib backend: both scripts set matplotlib.use('Agg') internally, so no display server is required and they run safely in headless/SSH environments.

Citation

If you use this codebase in your research, please cite:

@software{euvalues2024,
  title={EU Values Survey LLM Integration},
  author={ELOQUENCE Project},
  organization={Telefonica-Scientific-Research},
  year={2024},
  url={https://github.com/Telefonica-Scientific-Research/EUValues}
}

License

This project is licensed under the MIT License - see LICENSE file for details.

Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

Support

For issues, questions, or suggestions, please open an issue on GitHub Issues.

Project Context

This work is part of the ELOQUENCE Project - European Language Understanding and Question Answering in a Converged European Research Space.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github		.github
Surveys		Surveys
Surveys_parsed		Surveys_parsed
Surveys_responses		Surveys_responses
Surveys_responses_parsed		Surveys_responses_parsed
demo		demo
docs		docs
figs		figs
prompts/survey_prompt		prompts/survey_prompt
src		src
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Containerfile		Containerfile
HISTORY.md		HISTORY.md
LICENSE		LICENSE
README.md		README.md
ZA7500_q_es_math.json		ZA7500_q_es_math.json
ZA7500_q_es_math.tex		ZA7500_q_es_math.tex
ZA7500_q_es_text_tables.json		ZA7500_q_es_text_tables.json
ZA7500_q_es_text_tables.tex		ZA7500_q_es_text_tables.tex
mkdocs.yml		mkdocs.yml
requirements.txt		requirements.txt
run_analysis.sh		run_analysis.sh

Folders and files

Latest commit

History

Repository files navigation

EU Values Survey LLM Integration

Overview

Project Structure

Installation

Local setup

Requirements

Usage

Command Line

Basic Usage

With Custom Server Configuration

Command Line Arguments

Python API

Jupyter Notebook

Prompt Template Architecture

Example Template Structure

LLM Server Integration

Recommended Setup: llama.cpp Server

Docker Alternative

Response Format

CSV Format

JSON Format

Survey Data

Available Languages

Performance & Constraint Enforcement

Running the Analysis Pipeline

Quick start

Script options

Custom virtual environment

Running scripts individually

Citation

License

Contributing

Support

Project Context

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages