fix: add fail-fast YAML schema validation with actionable errors by praisonai-triage-agent[bot] · Pull Request #2128 · MervinPraison/PraisonAI

praisonai-triage-agent · 2026-06-21T08:53:40Z

Summary

Implements comprehensive YAML configuration validation with fail-fast behavior, aggregated error reporting, and a new 'praisonai validate' CLI command.

Changes

Schema Validation (New)

Pydantic models for agents/tasks/workflow configuration in config/schema.py
Required field validation (role, goal, backstory)
Type checking for all configuration values
Backward compatibility (agents to roles, topic to input, stream to streaming)

Cross-Reference Validation

Task to Agent references validated
Tool references resolved through ToolResolver
Handoff targets checked against defined roles
Workflow steps to Agent references validated

Fail-Fast Behavior

Configuration now fails immediately on validation errors (not just warnings)
Aggregated error reporting shows all problems at once
Clear, actionable error messages with context

CLI Command

praisonai validate file.yaml - validate a single file
praisonai validate check directory - validate all YAML files
praisonai validate schema - display schema documentation
--strict mode treats warnings as errors
--json output for CI integration

Testing

Validated with test files containing various error conditions:

Missing required fields
Invalid cross-references
Unknown tools
Type mismatches

All errors are caught and reported with actionable messages.

Backward Compatibility

Existing valid configurations continue to work
Unknown field detection defaults to warning (not error)
Environment variable PRAISONAI_VALIDATE_STRICT for global strict mode
Deprecation window before warnings become errors

Summary by CodeRabbit

Release Notes

New Features
- Added CLI validation commands: validate to check single YAML files, check to scan directories, and schema to view configuration documentation.
- Strict validation mode (--strict) with JSON output support (--json).
- Enhanced validation with detailed error and warning messages; quiet mode (--quiet) available.

#2125) - Add comprehensive Pydantic-based schema validation for agents/tasks/workflow - Implement fail-fast validation with aggregated error reporting - Add cross-reference validation for agents, tasks, tools, and handoffs - Create 'praisonai validate' CLI command for pre-commit/CI validation - Update agents_generator to fail on validation errors (not just warn) - Add pydantic dependency for schema validation Key improvements: - Required fields are now enforced (role, goal, backstory) - Type checking for all configuration values - Cross-references validated (tasks->agents, tools->definitions, handoffs->roles) - Unknown fields detected with suggestions - Aggregated error messages show all problems at once - --strict mode treats warnings as errors Co-authored-by: MervinPraison <MervinPraison@users.noreply.github.com>

MervinPraison · 2026-06-21T08:53:49Z

@coderabbitai review

MervinPraison · 2026-06-21T08:53:50Z

/review

qodo-code-review · 2026-06-21T08:53:54Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

coderabbitai · 2026-06-21T08:53:55Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2026-06-21T08:54:03Z

Important

Review skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 0790f73a-4bb9-4d69-a60c-e403cfb2815c

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

Introduces a Pydantic-based YAML configuration validation system. A new config/schema.py defines typed models for agents, tasks, and workflows. config/validator.py adds ConfigValidator for schema, cross-reference, and tool resolution checks. AgentsGenerator._validate_agents_config is rewritten to use this validator with fail-fast aggregated errors. A new validate CLI command is added with validate, check, and schema subcommands.

Changes

Pydantic YAML Validation System

Layer / File(s)	Summary
Pydantic config schema models `src/praisonai/praisonai/config/schema.py`	Defines all Pydantic models and enums for the YAML config shape: `ProcessType`, `HandoffPolicy`, `ToolRetryPolicy`, `HandoffConfig`, `ApprovalConfig`, `RuntimeConfig`, `CliBackendConfig`, `AgentConfig`, `TaskConfig`, `WorkflowStep`, `WorkflowConfig`, `YAMLConfig` (with structure and cross-reference validators), and `ValidationResult` with `format_message()`.
ConfigValidator orchestration and tool resolution `src/praisonai/praisonai/config/validator.py`	Introduces `ConfigValidator` with `validate_yaml_string`, `validate_yaml_file`, and `validate_config` methods; aggregates Pydantic schema errors into field-path messages, runs cross-reference checks, resolves tool references via `ToolResolver`, emits unknown-field warnings, and promotes warnings to errors in strict mode.
config package exports and pydantic dependency `src/praisonai/praisonai/config/__init__.py`, `src/praisonai/pyproject.toml`	Populates `config/__init__.py` with re-exports of all schema types and `ConfigValidator` via `__all__`; adds `pydantic>=2.0.0` to `pyproject.toml` dependencies.
AgentsGenerator wired to ConfigValidator `src/praisonai/praisonai/agents_generator.py`	Replaces inline `known_fields`/`difflib` warning logic in `_validate_agents_config` with `ConfigValidator` construction, `PRAISONAI_VALIDATE_STRICT` env-var check, warning logging, and aggregated `ValueError` raise on invalid configs.
CLI validate/check/schema subcommands `src/praisonai/praisonai/cli/app.py`, `src/praisonai/praisonai/cli/commands/validate.py`	Registers a lazy `validate` command in the CLI; implements three Typer subcommands — `validate` (single file with `--strict`/`--quiet`/`--json`), `check` (directory scan with rich table and `--stop-on-error`), and `schema` (prints agent/task JSON schema documentation).

Sequence Diagram(s)

sequenceDiagram
  actor User
  participant CLI as praisonai validate
  participant ConfigValidator
  participant YAMLConfig as YAMLConfig (Pydantic)
  participant ToolResolver

  User->>CLI: praisonai validate agents.yaml --strict
  CLI->>ConfigValidator: validate_yaml_file("agents.yaml", strict=True)
  ConfigValidator->>ConfigValidator: yaml.safe_load(file)
  ConfigValidator->>YAMLConfig: model_validate(config_dict)
  YAMLConfig-->>ConfigValidator: validated model / ValidationError list
  ConfigValidator->>YAMLConfig: validate_cross_references()
  YAMLConfig-->>ConfigValidator: cross-reference error list
  ConfigValidator->>ToolResolver: resolve_tool(name) per referenced tool
  ToolResolver-->>ConfigValidator: resolved / unknown / optional warning
  ConfigValidator->>ConfigValidator: _check_unknown_fields() → warnings → errors (strict)
  ConfigValidator-->>CLI: ValidationResult(valid, errors, warnings)
  CLI-->>User: exit 0 (valid) or exit 1 + error list

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

MervinPraison/PraisonAI#1633: Both PRs modify AgentsGenerator._validate_agents_config — the retrieved PR adds the difflib typo-suggestion inline logic that this PR replaces with ConfigValidator.
MervinPraison/PraisonAI#1713: This PR's ConfigValidator injects and uses ToolResolver; the retrieved PR changes how ToolResolver consults an injected ToolRegistry, directly affecting resolution behavior used during validation.
MervinPraison/PraisonAI#2037: Both PRs extend the lazy command registry in cli/app.py by adding different subcommands (validate vs permissions).

Suggested labels

Review effort 3/5

Suggested reviewers

MervinPraison

Poem

🐰 Hop hop, no more guessing games,
Pydantic now knows all the field names!
Bad YAML? Caught before the first call,
praisonai validate — the tool for all.
Errors aggregated, clear and bright,
The rabbit checks configs with delight! ✨

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title accurately summarizes the main objective: implementing fail-fast YAML schema validation with actionable error messages, which aligns with the core changes across schema validation, validator, CLI command, and integration with the agents generator.
Linked Issues check	✅ Passed	The pull request successfully addresses all coding requirements from issue `#2125`: implements Pydantic-based schema validation [schema.py], fail-fast validation with aggregated error reporting [validator.py, agents_generator.py], cross-reference validation including task-to-agent and tool resolution, adds the 'praisonai validate' CLI command with --strict flag [validate.py, app.py], maintains backward compatibility through deprecation window via PRAISONAI_VALIDATE_STRICT, and ensures existing valid configurations continue to work.
Out of Scope Changes check	✅ Passed	All changes are directly in scope: schema validation models, validator implementation, CLI command integration, agents_generator refactoring to use ConfigValidator, config module exports, and pydantic dependency addition. No unrelated modifications detected outside the defined validation and CLI enhancement objectives.
Docstring Coverage	✅ Passed	Docstring coverage is 94.74% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch claude/issue-2125-20260621-0842

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

MervinPraison · 2026-06-21T08:54:10Z

@copilot Do a thorough review of this PR. Read ALL existing reviewer comments above from Qodo, Coderabbit, and Gemini first — incorporate their findings.

Review areas:

Bloat check: Are changes minimal and focused? Any unnecessary code or scope creep?
Security: Any hardcoded secrets, unsafe eval/exec, missing input validation?
Performance: Any module-level heavy imports? Hot-path regressions?
Tests: Are tests included? Do they cover the changes adequately?
Backward compat: Any public API changes without deprecation?
Code quality: DRY violations, naming conventions, error handling?
Address reviewer feedback: If Qodo, Coderabbit, or Gemini flagged valid issues, include them in your review
Suggest specific improvements with code examples where possible

coderabbitai

Actionable comments posted: 6

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/praisonai/praisonai/cli/commands/validate.py`:
- Line 12: The validation command exits early when a file is missing before
checking if the --json flag is set, causing it to emit formatted text via rich
print instead of structured JSON output. Reorganize the code to ensure the file
existence pre-check occurs after the JSON flag is evaluated, or wrap the file
missing failure with conditional logic that outputs JSON when --json is enabled
and formatted text otherwise. This applies to the pre-check logic and the error
handling branches around lines 62-67 and 75-85 in addition to the initial
imports.

In `@src/praisonai/praisonai/config/schema.py`:
- Around line 330-343: The handoff target validation in the loop starting with
iterating through agents_dict is checking if targets exist in all_roles while
simultaneously populating all_roles in the same pass. This causes forward
references to be incorrectly flagged as undefined when an agent's handoff
references a role defined later in the configuration. To fix this, separate the
logic into two passes: first iterate through all agents_dict values to
completely populate the all_roles set, then in a second pass iterate again to
validate that all handoff targets in agent_config.handoff.to exist in the
now-complete all_roles set.
- Around line 193-206: The validate_step_type method checks for known step types
like 'task', 'parallel', 'loop', and 'route' but lacks validation to reject
unknown or unsupported step types, allowing invalid configurations to pass
silently. Add an else clause at the end of the conditional chain that raises a
ValueError when the step type does not match any of the supported types,
including the step name and the invalid type in the error message to provide
clear context during debugging.
- Around line 116-123: The handle_stream_alias validator in the stream field
validator only processes the stream field value but does not update the
streaming field itself, so backward-compatible configs using stream still result
in streaming being False. Replace or supplement this single field_validator with
a model_validator that runs after all fields are populated, checking if stream
is provided and streaming is at its default value, then explicitly setting
streaming to match the stream value. This ensures that providing stream: true
actually enables the streaming functionality as intended.

In `@src/praisonai/praisonai/config/validator.py`:
- Around line 211-245: The _is_known_optional_tool method in the validator.py
file is too permissive because it returns True for any tool name ending with
'Tool', even if it's not in the known_optional set. This masks typos and unknown
tools by treating them as optional dependencies instead of failing validation.
Remove the or tool_name.endswith('Tool') fallback condition from the return
statement in _is_known_optional_tool so that only tools explicitly listed in the
known_optional set are recognized as optional tools, allowing unknown tool names
to fail validation properly.
- Around line 85-103: The validate_config method does not validate that the
config parameter is a dictionary before attempting to unpack it with `**config`
into YAMLConfig. Non-mapping YAML input (None, lists, or scalar values) will
raise a TypeError that bypasses the aggregated error reporting mechanism. Add a
type check before the try block that instantiates YAMLConfig to verify that
config is a dictionary; if it is not, append an error message to the errors list
using the file_prefix format and continue with the validation, ensuring a proper
ValidationResult is always returned instead of raising an unhandled exception.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 31e6f1c1-edfc-420e-9bb3-a23136b1c4f8

📥 Commits

Reviewing files that changed from the base of the PR and between 5f39d66 and 4e281bd.

📒 Files selected for processing (7)

src/praisonai/praisonai/agents_generator.py
src/praisonai/praisonai/cli/app.py
src/praisonai/praisonai/cli/commands/validate.py
src/praisonai/praisonai/config/__init__.py
src/praisonai/praisonai/config/schema.py
src/praisonai/praisonai/config/validator.py
src/praisonai/pyproject.toml

coderabbitai · 2026-06-21T09:00:02Z

+from typing import Optional
+
+import typer
+from rich import print


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

--json mode should emit JSON for missing-file failures too.

The pre-check exits before the JSON branch, so --json returns formatted text instead of structured output on a missing file. This breaks CI parsing for one of the most common failure cases.

Suggested fix

@@ -import typer -from rich import print +import typer from rich.console import Console from rich.table import Table @@ - # Check if file exists - file_path = Path(file) - if not file_path.exists(): - console.print(f"[red]✗ File not found:[/red] {file}", style="bold") - sys.exit(1) + file_path = Path(file) @@ if json_output: import json output = { "file": str(file_path), "valid": result.valid, "errors": result.errors, "warnings": result.warnings, "strict_mode": strict } - print(json.dumps(output, indent=2)) + sys.stdout.write(json.dumps(output, indent=2) + "\n") sys.exit(0 if result.valid else 1)

Also applies to: 62-67, 75-85

🧰 Tools

🪛 Ruff (0.15.17)

[error] 12-12: Import print is shadowing a Python builtin

(A004)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/cli/commands/validate.py` at line 12, The validation command exits early when a file is missing before checking if the --json flag is set, causing it to emit formatted text via rich print instead of structured JSON output. Reorganize the code to ensure the file existence pre-check occurs after the JSON flag is evaluated, or wrap the file missing failure with conditional logic that outputs JSON when --json is enabled and formatted text otherwise. This applies to the pre-check logic and the error handling branches around lines 62-67 and 75-85 in addition to the initial imports.

coderabbitai · 2026-06-21T09:00:02Z

+    @field_validator('stream')
+    @classmethod
+    def handle_stream_alias(cls, v, info):
+        """Handle stream as alias for streaming."""
+        if v is not None and 'streaming' not in info.data:
+            return v
+        return info.data.get('streaming', False) if v is None else v
+


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

stream alias does not actually enable streaming.

stream: true is stored on stream, but streaming remains its default (False), so backward-compatible configs lose intended behavior.

Suggested fix

- `@field_validator`('stream') - `@classmethod` - def handle_stream_alias(cls, v, info): - """Handle stream as alias for streaming.""" - if v is not None and 'streaming' not in info.data: - return v - return info.data.get('streaming', False) if v is None else v + `@model_validator`(mode='before') + `@classmethod` + def normalize_stream_alias(cls, data): + """Map legacy 'stream' into canonical 'streaming'.""" + if isinstance(data, dict) and 'streaming' not in data and 'stream' in data: + data['streaming'] = data['stream'] + return data

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/config/schema.py` around lines 116 - 123, The handle_stream_alias validator in the stream field validator only processes the stream field value but does not update the streaming field itself, so backward-compatible configs using stream still result in streaming being False. Replace or supplement this single field_validator with a model_validator that runs after all fields are populated, checking if stream is provided and streaming is at its default value, then explicitly setting streaming to match the stream value. This ensures that providing stream: true actually enables the streaming functionality as intended.

coderabbitai · 2026-06-21T09:00:02Z

+    def validate_step_type(self):
+        """Validate step configuration based on type."""
+        if self.type == 'task':
+            if not self.agent or not self.task:
+                raise ValueError(f"Task step '{self.name}' requires both 'agent' and 'task' fields")
+        elif self.type in ('parallel', 'loop'):
+            if not self.steps:
+                raise ValueError(f"{self.type.capitalize()} step '{self.name}' requires 'steps' field")
+            if self.type == 'loop' and self.count is None:
+                raise ValueError(f"Loop step '{self.name}' requires 'count' field")
+        elif self.type == 'route':
+            if not self.routes:
+                raise ValueError(f"Route step '{self.name}' requires 'routes' field")
+


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Unknown workflow step types currently pass validation silently.

validate_step_type() checks known types but never rejects unsupported ones, so invalid configs bypass fail-fast validation.

Suggested fix

`@model_validator`(mode='after') def validate_step_type(self): """Validate step configuration based on type.""" + allowed = {'task', 'parallel', 'loop', 'route'} + if self.type not in allowed: + raise ValueError( + f"Step '{self.name}' has invalid type '{self.type}'. " + f"Allowed values: {', '.join(sorted(allowed))}" + ) + if self.type == 'task': if not self.agent or not self.task: raise ValueError(f"Task step '{self.name}' requires both 'agent' and 'task' fields")

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/config/schema.py` around lines 193 - 206, The validate_step_type method checks for known step types like 'task', 'parallel', 'loop', and 'route' but lacks validation to reject unknown or unsupported step types, allowing invalid configurations to pass silently. Add an else clause at the end of the conditional chain that raises a ValueError when the step type does not match any of the supported types, including the step name and the invalid type in the error message to provide clear context during debugging.

coderabbitai · 2026-06-21T09:00:02Z

+        all_roles = set()
+        for agents_dict in [self.roles, self.agents]:
+            if agents_dict:
+                for agent_config in agents_dict.values():
+                    all_roles.add(agent_config.role)
+
+                    # Check handoff targets
+                    if agent_config.handoff and isinstance(agent_config.handoff, HandoffConfig):
+                        for target in agent_config.handoff.to:
+                            if target not in all_roles:
+                                errors.append(
+                                    f"Agent '{agent_config.role}' handoff references undefined role '{target}'. "
+                                    f"Available roles: {', '.join(sorted(all_roles))}"
+                                )


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Handoff target validation is order-dependent and rejects forward references.

all_roles is populated and validated in the same pass, so a handoff to a role defined later can be incorrectly flagged as undefined.

Suggested fix

- all_roles = set() - for agents_dict in [self.roles, self.agents]: - if agents_dict: - for agent_config in agents_dict.values(): - all_roles.add(agent_config.role) - - # Check handoff targets - if agent_config.handoff and isinstance(agent_config.handoff, HandoffConfig): - for target in agent_config.handoff.to: - if target not in all_roles: - errors.append( - f"Agent '{agent_config.role}' handoff references undefined role '{target}'. " - f"Available roles: {', '.join(sorted(all_roles))}" - ) + all_roles = set() + all_agent_configs = [] + for agents_dict in [self.roles, self.agents]: + if agents_dict: + all_agent_configs.extend(agents_dict.values()) + for agent_config in agents_dict.values(): + all_roles.add(agent_config.role) + + for agent_config in all_agent_configs: + if agent_config.handoff and isinstance(agent_config.handoff, HandoffConfig): + for target in agent_config.handoff.to: + if target not in all_roles: + errors.append( + f"Agent '{agent_config.role}' handoff references undefined role '{target}'. " + f"Available roles: {', '.join(sorted(all_roles))}" + )

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/config/schema.py` around lines 330 - 343, The handoff target validation in the loop starting with iterating through agents_dict is checking if targets exist in all_roles while simultaneously populating all_roles in the same pass. This causes forward references to be incorrectly flagged as undefined when an agent's handoff references a role defined later in the configuration. To fix this, separate the logic into two passes: first iterate through all agents_dict values to completely populate the all_roles set, then in a second pass iterate again to validate that all handoff targets in agent_config.handoff.to exist in the now-complete all_roles set.

coderabbitai · 2026-06-21T09:00:02Z

+    def validate_config(self, config: Dict[str, Any], strict: bool = False, file_path: Optional[str] = None) -> ValidationResult:
+        """Validate configuration dictionary.
+
+        Args:
+            config: Configuration dictionary
+            strict: If True, treat warnings as errors
+            file_path: Optional file path for better error messages
+
+        Returns:
+            ValidationResult with errors and warnings
+        """
+        errors = []
+        warnings = []
+        file_prefix = f"{file_path}: " if file_path else ""
+
+        # Validate against schema
+        try:
+            yaml_config = YAMLConfig(**config)
+        except ValidationError as e:


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Non-mapping YAML can crash validation before returning ValidationResult.

YAMLConfig(**config) assumes config is a dict. Empty YAML (None) or list/scalar YAML will raise TypeError and bypass aggregated error reporting.

Suggested fix

def validate_config(self, config: Dict[str, Any], strict: bool = False, file_path: Optional[str] = None) -> ValidationResult: @@ errors = [] warnings = [] file_prefix = f"{file_path}: " if file_path else "" + + if config is None: + return ValidationResult( + valid=False, + errors=[f"{file_prefix}Configuration is empty. Expected a YAML mapping at the root."] + ) + if not isinstance(config, dict): + return ValidationResult( + valid=False, + errors=[f"{file_prefix}Invalid root type '{type(config).__name__}'. Expected a YAML mapping/object."] + ) # Validate against schema try: yaml_config = YAMLConfig(**config)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

def validate_config(self, config: Dict[str, Any], strict: bool = False, file_path: Optional[str] = None) -> ValidationResult:

"""Validate configuration dictionary.

Args:

config: Configuration dictionary

strict: If True, treat warnings as errors

file_path: Optional file path for better error messages

Returns:

ValidationResult with errors and warnings

"""

errors = []

warnings = []

file_prefix = f"{file_path}: " if file_path else ""

# Validate against schema

try:

yaml_config = YAMLConfig(**config)

except ValidationError as e:

def validate_config(self, config: Dict[str, Any], strict: bool = False, file_path: Optional[str] = None) -> ValidationResult:

"""Validate configuration dictionary.

Args:

config: Configuration dictionary

strict: If True, treat warnings as errors

file_path: Optional file path for better error messages

Returns:

ValidationResult with errors and warnings

"""

errors = []

warnings = []

file_prefix = f"{file_path}: " if file_path else ""

if config is None:

return ValidationResult(

valid=False,

errors=[f"{file_prefix}Configuration is empty. Expected a YAML mapping at the root."]

)

if not isinstance(config, dict):

return ValidationResult(

valid=False,

errors=[f"{file_prefix}Invalid root type '{type(config).__name__}'. Expected a YAML mapping/object."]

)

# Validate against schema

try:

yaml_config = YAMLConfig(**config)

except ValidationError as e:

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/config/validator.py` around lines 85 - 103, The validate_config method does not validate that the config parameter is a dictionary before attempting to unpack it with `**config` into YAMLConfig. Non-mapping YAML input (None, lists, or scalar values) will raise a TypeError that bypasses the aggregated error reporting mechanism. Add a type check before the try block that instantiates YAMLConfig to verify that config is a dictionary; if it is not, append an error message to the errors list using the file_prefix format and continue with the validation, ensuring a proper ValidationResult is always returned instead of raising an unhandled exception.

coderabbitai · 2026-06-21T09:00:02Z

+    def _is_known_optional_tool(self, tool_name: str) -> bool:
+        """Check if a tool name is a known optional tool.
+
+        Args:
+            tool_name: Tool name to check
+
+        Returns:
+            True if it's a known optional tool
+        """
+        # List of known optional tools that require extra dependencies
+        known_optional = {
+            # Database tools
+            'PostgreSQLTool', 'MySQLTool', 'SQLiteTool', 'MongoDBTool', 'RedisTool',
+            'SurrealDBTool', 'CassandraTool', 'ElasticsearchTool',
+
+            # Web/API tools
+            'SlackTool', 'DiscordTool', 'TelegramTool', 'EmailTool',
+            'TwitterTool', 'LinkedInTool', 'GitHubTool',
+
+            # Cloud tools
+            'AWSTool', 'AzureTool', 'GCPTool', 'S3Tool',
+
+            # AI/ML tools
+            'HuggingFaceTool', 'OpenAITool', 'AnthropicTool',
+
+            # Data tools
+            'PandasTool', 'NumpyTool', 'ScipyTool',
+
+            # Other
+            'BrowserTool', 'SeleniumTool', 'PlaywrightTool',
+            'KubernetesTool', 'DockerTool', 'TerraformTool',
+        }
+
+        return tool_name in known_optional or tool_name.endswith('Tool')
+


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

endswith('Tool') turns unknown tool typos into warnings instead of errors.

Any unresolved tool named like SomethingTool is treated as “optional dependency,” which undermines fail-fast unknown-tool validation.

Suggested fix

- return tool_name in known_optional or tool_name.endswith('Tool') + return tool_name in known_optional

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/praisonai/praisonai/config/validator.py` around lines 211 - 245, The _is_known_optional_tool method in the validator.py file is too permissive because it returns True for any tool name ending with 'Tool', even if it's not in the known_optional set. This masks typos and unknown tools by treating them as optional dependencies instead of failing validation. Remove the or tool_name.endswith('Tool') fallback condition from the return statement in _is_known_optional_tool so that only tools explicitly listed in the known_optional set are recognized as optional tools, allowing unknown tool names to fail validation properly.

MervinPraison · 2026-06-21T09:04:00Z

@claude You are the FINAL architecture reviewer. If the branch is under MervinPraison/PraisonAI (not a fork), you are able to make modifications to this branch and push directly. SCOPE: Focus ONLY on Python packages (praisonaiagents, praisonai). Do NOT modify praisonai-rust or praisonai-ts. Read ALL comments above from Gemini, Qodo, CodeRabbit, and Copilot carefully before responding.

Phase 1: Review per AGENTS.md

Protocol-driven: check heavy implementations vs core SDK
Backward compatible: ensure zero feature regressions
Performance: no hot-path regressions

Phase 2: FIX Valid Issues
4. For any VALID bugs or architectural flaws found by Gemini, CodeRabbit, Qodo, Copilot, or any other reviewer: implement the fix
5. Push all code fixes directly to THIS branch (do NOT create a new PR)
6. Comment a summary of exact files modified and what you skipped

Phase 3: Final Verdict
7. If all issues are resolved, approve the PR / close the Issue
8. If blocking issues remain, request changes / leave clear action items

praisonai-triage-agent · 2026-06-21T09:04:16Z

greptile-apps · 2026-06-21T09:05:00Z

Greptile Summary

This PR introduces Pydantic-based YAML schema validation with fail-fast error aggregation, a cross-reference checker, and a new praisonai validate CLI command. The _validate_agents_config method in AgentsGenerator is replaced with the new ConfigValidator, which raises ValueError on invalid configs.

AgentConfig has no tasks field: all nested task configs (the primary authoring pattern across every existing YAML in the repo) are silently dropped by Pydantic's extra='ignore' default, meaning the validator never checks required task fields, types, or unknown fields within agents.
llm/function_calling_llm typed Optional[str] while existing YAMLs use a dict ({model: "groq/..."}) — Pydantic raises a ValidationError, blocking all existing configs that configure LLMs inline.
ConfigValidator.__init__ unconditionally calls register_builtin_autogen_adapters(_suppress_deprecation_warning=True) on every validation run, silently suppressing deprecation guidance and repeating registration cost per-file in validate check.

Confidence Score: 2/5

Not safe to merge: the schema breaks all existing YAML files that configure LLMs with a dict, and silently skips validation for the primary nested-task pattern used across the whole repo.

Two separate issues prevent this from working correctly with the existing YAML corpus. First, llm/function_calling_llm are typed Optional[str] while every advanced YAML uses a dict — this causes a Pydantic ValidationError that propagates as a ValueError from _validate_agents_config, blocking all agent runs for configs that set LLM options inline. Second, AgentConfig has no tasks field, so the nested task dicts that are the dominant authoring pattern are silently dropped; the advertised comprehensive validation provides no coverage for task-level required fields or type errors.

src/praisonai/praisonai/config/schema.py requires the most attention: the AgentConfig field set is incomplete (missing tasks) and the llm/function_calling_llm types must accept Union[str, Dict[str, Any]].

Important Files Changed

Filename	Overview
src/praisonai/praisonai/config/schema.py	New Pydantic schema for YAML validation; AgentConfig lacks a `tasks` field so all nested task configs are silently dropped (Pydantic extra='ignore'), plus llm/function_calling_llm are typed Optional[str] while existing YAMLs use dict format
src/praisonai/praisonai/config/validator.py	New ConfigValidator wiring schema + tool + unknown-field checks; constructor unconditionally re-registers built-in autogen adapters with a suppressed deprecation warning; known_top_level missing `dependencies`
src/praisonai/praisonai/agents_generator.py	Replaces simple difflib typo-checker with ConfigValidator; raises ValueError on errors — behavior change is correct but the underlying validator has multiple issues that will surface at runtime
src/praisonai/praisonai/cli/commands/validate.py	New CLI subcommand (validate/check/schema) wired via typer; logic is straightforward but inherits all schema/validator correctness issues
src/praisonai/praisonai/cli/app.py	One-line addition registering the validate subcommand in the lazy command registry; no issues
src/praisonai/praisonai/config/init.py	New package init re-exporting all schema and validator symbols; straightforward
src/praisonai/pyproject.toml	Adds pydantic>=2.0.0 as a formal dependency; appropriate for the new Pydantic v2 models

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[YAML file / string] --> B[yaml.safe_load]
    B --> C{Parse error?}
    C -- yes --> ERR1[ValidationResult valid=False
YAML syntax error]
    C -- no --> D[ConfigValidator.validate_config]
    D --> E[YAMLConfig Pydantic model]
    E --> F{Pydantic
ValidationError?}
    F -- yes --> ERR2[Collect errors
return invalid]
    F -- no --> G[validate_cross_references
task→agent, workflow steps,
handoff targets]
    G --> H[_validate_tools
resolve each tool ref]
    H --> I[_check_unknown_fields
warn on extra keys]
    I --> J{strict mode?}
    J -- yes --> K[warnings promoted
to errors]
    J -- no --> L[ValidationResult
valid / errors / warnings]
    K --> L

    subgraph Gap [Silent gap]
        E2[AgentConfig
no tasks field
Pydantic extra=ignore]
        E2 --> SKIP[Nested task configs
completely skipped]
    end
    E -.-> E2

%%{init: {'theme': 'base', 'themeVariables': {"darkMode": true, "background": "#0d1117", "primaryColor": "#21262d", "primaryTextColor": "#e6edf3", "primaryBorderColor": "#8b949e", "lineColor": "#8b949e", "textColor": "#e6edf3", "edgeLabelBackground": "#161b22", "actorBkg": "#21262d", "actorBorder": "#8b949e", "actorTextColor": "#e6edf3", "actorLineColor": "#8b949e", "signalColor": "#8b949e", "signalTextColor": "#e6edf3", "noteBkgColor": "#373320", "noteBorderColor": "#d4a72c", "noteTextColor": "#f0e6c0", "labelBoxBkgColor": "#21262d", "labelBoxBorderColor": "#8b949e", "labelTextColor": "#e6edf3", "loopTextColor": "#e6edf3", "activationBkgColor": "#30363d", "activationBorderColor": "#8b949e"}}}%%
flowchart TD
    A[YAML file / string] --> B[yaml.safe_load]
    B --> C{Parse error?}
    C -- yes --> ERR1[ValidationResult valid=False
YAML syntax error]
    C -- no --> D[ConfigValidator.validate_config]
    D --> E[YAMLConfig Pydantic model]
    E --> F{Pydantic
ValidationError?}
    F -- yes --> ERR2[Collect errors
return invalid]
    F -- no --> G[validate_cross_references
task→agent, workflow steps,
handoff targets]
    G --> H[_validate_tools
resolve each tool ref]
    H --> I[_check_unknown_fields
warn on extra keys]
    I --> J{strict mode?}
    J -- yes --> K[warnings promoted
to errors]
    J -- no --> L[ValidationResult
valid / errors / warnings]
    K --> L

    subgraph Gap [Silent gap]
        E2[AgentConfig
no tasks field
Pydantic extra=ignore]
        E2 --> SKIP[Nested task configs
completely skipped]
    end
    E -.-> E2

_{Reviews (2): Last reviewed commit: "fix: address all 6 validation issues fro..." | Re-trigger Greptile}

greptile-apps · 2026-06-21T09:05:03Z

+    llm: Optional[str] = Field(default=None, description="LLM model to use")
+    function_calling_llm: Optional[str] = Field(default=None, description="LLM for function calling")


llm/function_calling_llm typed as Optional[str] but existing YAML uses dict format

The existing YAML files (e.g., agents-advanced.yaml) use a dict structure for llm:

llm: model: "groq/llama3-70b-8192" function_calling_llm: model: "openai/gpt-5-nano"

Because these fields are typed Optional[str], Pydantic v2 will raise a ValidationError when given a dict, which propagates up as a ValueError in _validate_agents_config and crashes the agent run for all existing YAML files that use dict-style LLM config. These fields should be typed Optional[Union[str, Dict[str, Any]]] to preserve backward compatibility.

greptile-apps · 2026-06-21T09:05:05Z

+        # Known top-level fields
+        known_top_level = {
+            'name', 'description', 'framework', 'process', 'type',
+            'roles', 'agents', 'tasks', 'workflow', 'steps',
+            'input', 'topic', 'tools', 'toolsets',
+            'config', 'llm', 'models', 'providers',
+            'deploy',  # Deployment configuration
+        }


dependencies is a common top-level field in existing YAML files but is not in known_top_level

Every existing YAML file in the repo (agents.yaml, agents-advanced.yaml, tests/agents.yaml, etc.) includes dependencies: [] at the top level. This field is not in known_top_level, so _check_unknown_fields emits a warning for every valid config. In strict mode this becomes a blocking error, breaking all existing configs with --strict.

Suggested change

# Known top-level fields

known_top_level = {

'name', 'description', 'framework', 'process', 'type',

'roles', 'agents', 'tasks', 'workflow', 'steps',

'input', 'topic', 'tools', 'toolsets',

'config', 'llm', 'models', 'providers',

'deploy', # Deployment configuration

}

# Known top-level fields

known_top_level = {

'name', 'description', 'framework', 'process', 'type',

'roles', 'agents', 'tasks', 'workflow', 'steps',

'input', 'topic', 'tools', 'toolsets',

'config', 'llm', 'models', 'providers',

'deploy', # Deployment configuration

'dependencies', # Task dependency declarations

}

- Fix JSON output for missing files in validate command - Fix stream alias not enabling streaming field (use model_validator before) - Add validation to reject unknown workflow step types - Fix handoff validation order-dependency (two-pass validation) - Handle non-dict YAML input gracefully without crashing - Remove overly permissive endswith('Tool') fallback These fixes ensure fail-fast validation works correctly with proper error aggregation and backward compatibility as intended.

praisonai-triage-agent Bot mentioned this pull request Jun 21, 2026

YAML: fail-fast schema validation for agents/tasks config with actionable errors (+ praisonai validate) #2125

Open

Copilot AI requested a review from MervinPraison June 21, 2026 08:54

coderabbitai Bot reviewed Jun 21, 2026

View reviewed changes

greptile-apps Bot reviewed Jun 21, 2026

View reviewed changes

		llm: Optional[str] = Field(default=None, description="LLM model to use")
		function_calling_llm: Optional[str] = Field(default=None, description="LLM for function calling")

Uh oh!

Conversation

praisonai-triage-agent Bot commented Jun 21, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Schema Validation (New)

Cross-Reference Validation

Fail-Fast Behavior

CLI Command

Testing

Backward Compatibility

Summary by CodeRabbit

Release Notes

Uh oh!

MervinPraison commented Jun 21, 2026

Uh oh!

MervinPraison commented Jun 21, 2026

Uh oh!

qodo-code-review Bot commented Jun 21, 2026

Qodo reviews are paused for this user.

Uh oh!

coderabbitai Bot commented Jun 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented Jun 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

MervinPraison commented Jun 21, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

MervinPraison commented Jun 21, 2026

Uh oh!

praisonai-triage-agent Bot commented Jun 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Architecture Review & Fixes ✅

Phase 1: Architecture Review ✅

Phase 2: Fixed Valid Issues ✅

Phase 3: Final Verdict ✅

Summary of Files Modified

Final Approval ✅

Uh oh!

greptile-apps Bot commented Jun 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 2/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps Bot Jun 21, 2026

Choose a reason for hiding this comment

praisonai-triage-agent Bot commented Jun 21, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 21, 2026 •

edited

Loading

coderabbitai Bot commented Jun 21, 2026 •

edited

Loading

praisonai-triage-agent Bot commented Jun 21, 2026 •

edited

Loading

greptile-apps Bot commented Jun 21, 2026 •

edited

Loading