Feat/quick review v2 by AntoMontagneDev · Pull Request #16 · montagne-dev/lampe

AntoMontagneDev · 2026-03-05T21:34:24Z

🔦 description

What change is being made?

Add a new list_directory_at_commit tool, integrate a hallucination-filter step into quick PR reviews, and replace brittle JSON parsing with a robust response parser across agentic and quick-review workflows, including tests and updated prompts.

Why are these changes being made?

These changes improve PR review reliability by enabling directory-orientation, reducing noise from investigation-requests, and handling LLM outputs more gracefully. They introduce new tools, prompts, and tests, which brings added maintenance and potential edge-case risks that should be monitored.

lampe-ci · 2026-03-05T21:37:03Z

+    stripped = content.strip()
+
+    # Try markdown code block with json language tag
+    match = re.search(r"```(?:json)?\s*\n(.*?)\n```", stripped, re.DOTALL)


🔦🐛

Regex-based block extraction is brittle; consider supporting more variants (extra spaces, language hints, nested blocks) or a more robust extraction strategy.

lampe-ci · 2026-03-05T21:37:05Z

+        return match.group(1).strip()
+
+    # Try generic code block
+    match = re.search(r"```\s*\n(.*?)\n```", stripped, re.DOTALL)


🔦🐛

Second code-block extraction also relies on a strict pattern; consider consolidating extraction logic or adding tests for edge cases (e.g., multiple fences, spacing).

lampe-ci · 2026-03-05T21:37:07Z

+        return None, False
+
+    # Workaround: some models insert newlines before closing quotes
+    normalized = extracted.replace('\n"', '"')


🔦🐛

Normalization step (replacing \n") is a hack. Prefer robust JSON parsing with error handling and optional normalization, to avoid edge cases with escaping.

lampe-ci · 2026-03-05T21:37:08Z

+    normalized = extracted.replace('\n"', '"')
+
+    try:
+        parser = PydanticOutputParser(output_cls=ValidationAgentResponseModel)


🔦🐛

Catch specific parsing/JSON errors instead of a broad Exception to avoid masking other bugs during parsing.

lampe-ci · 2026-03-05T21:37:10Z

+        parser = PydanticOutputParser(output_cls=ValidationAgentResponseModel)
+        parsed = parser.parse(normalized)
+        return parsed, True
+    except Exception:


🔦🐛

On failure you return (None, False) without logging; consider logging the failure and/or returning a clearer error signal.

lampe-ci · 2026-03-05T21:37:12Z

@@ -0,0 +1,54 @@
+"""Prompt for hallucination filter — mutes comments that ask the user to investigate instead of stating verified bugs."""
+
+HALLUCINATION_FILTER_SYSTEM_PROMPT = """


🔦🐛

System prompt content could be split into smaller, testable templates or loaded from config for easier maintenance.

lampe-ci · 2026-03-05T21:37:14Z

+
+For each issue whose comment is an "investigation request" (see below), call mute_issue with its issue_id and reason "investigation_request". Do NOT mute comments that state a verified bug. Prefer issuing all mute_issue calls in a single response.
+
+# Mute These (investigation requests)


🔦🐛

Hard-coded lists of investigation-phrases; consider moving to a config/module constant for easier updates and localization.

lampe-ci · 2026-03-05T21:37:16Z

+**Files Changed:**
+{files_changed}
+
+**Issues to Review (with IDs for muting):**


🔦🐛

Issues to Review uses placeholders; ensure formatting is robust if data is missing (e.g., missing issues_with_ids).

lampe-ci · 2026-03-05T21:37:18Z

+        self.verbose = verbose
+        self.logger = logging.getLogger(name=LAMPE_LOGGER_NAME)
+        self.llm = llm or LiteLLM(
+            model=MODELS.GPT_5_NANO_2025_08_07,


🔦🐛

Model choice (GPT_5_NANO_2025_08_07) should be verified for availability/licensing and whether a fallback is needed for environments without that model.

lampe-ci · 2026-03-05T21:37:20Z

+
+        # Skip if no findings to filter
+        issues_with_ids = _build_issues_with_ids(ev.agent_reviews)
+        if "_No issues to review._" in issues_with_ids:


🔦🐛

Sentinel check 'No issues to review.' relies on a specific string from a prior call; this is brittle. Consider a more explicit boolean/result from the aggregation step.

lampe-ci · 2026-03-05T21:37:21Z

+        )
+
+        try:
+            agent_ctx = WorkflowContext(self._agent)


🔦🐛

Ensure the context/agent lifecycle is correct when creating and using agent_ctx; confirm that storing/fetching 'muted_reasons' is safe across runs.

lampe-ci · 2026-03-05T21:37:23Z

+                start_event=MuteIssueStart(user_prompt=user_prompt),
+                ctx=agent_ctx,
+            )
+            muted_reasons = await agent_ctx.store.get("muted_reasons", default={})


🔦🐛

Getting 'muted_reasons' from the store uses a default of {}; ensure the store API actually returns a dict and not None.

lampe-ci · 2026-03-05T21:37:25Z

+            if self.verbose and muted_reasons:
+                self.logger.debug(f"Hallucination filter muted {len(muted_reasons)} issues")
+
+        except Exception as e:


🔦🐛

Broad exception handling (except Exception) can mask real issues; prefer catching specific known exceptions from the LL/LMM/agent layer.

lampe-ci · 2026-03-05T21:37:27Z

+def test_extract_json_from_plain_content():
+    """When no markdown block, return stripped content."""
+    content = '  {"no_issue": true, "findings": []}  '
+    assert extract_json_from_llm_content(content) == '{"no_issue": true, "findings": []}'


🔦🐛

Plain content extraction test looks fine.

lampe-ci · 2026-03-05T21:37:30Z

packages/lampe-review/tests/unit/workflows/agentic_review/test_response_parse.py (Line 23): Markdown JSON block extraction is tested; ensure isolated JSON is returned (not the surrounding text).

lampe-ci · 2026-03-05T21:37:31Z

+{"no_issue": true, "findings": []}
+```
+"""
+    result = extract_json_from_llm_content(content)


🔦🐛

Generic code-block extraction test; ensure behavior when code fences are present without language hints.

lampe-ci · 2026-03-05T21:37:33Z

+
+def test_extract_json_empty_content():
+    """Empty or whitespace returns empty string."""
+    assert extract_json_from_llm_content("") == ""


🔦🐛

Empty content test; good to cover whitespace/empty input.

lampe-ci · 2026-03-05T21:37:35Z

+def test_parse_validation_response_valid_json():
+    """Valid JSON returns parsed model and success."""
+    content = '{"no_issue": true, "findings": []}'
+    parsed, success = parse_validation_response(content)


🔦🐛

Valid JSON parse test; assumes parsed model exposes attributes (no_issue, findings). Confirm the model type aligns with tests.

lampe-ci · 2026-03-05T21:37:37Z

+        {"file_path": "src/a.py", "line_number": 42, "action": "fix",
+         "problem_summary": "Missing validation", "severity": "high", "category": "security"}
+    ]}"""
+    parsed, success = parse_validation_response(content)


🔦🐛

Test with findings checks nested fields; ensure tests align with actual model structure (dicts vs objects).

lampe-ci · 2026-03-05T21:37:39Z

+    assert parsed.findings[0]["line_number"] == 42
+
+
+def test_parse_validation_response_malformed_json_no_exception():


🔦🐛

Malformed JSON test; verify graceful failure without exceptions.

lampe-ci · 2026-03-05T21:37:41Z

+
+def test_parse_validation_response_truncated_json_no_exception():
+    """Truncated JSON returns (None, False) without raising."""
+    truncated = '{"no_issue": false, "findings": [{"file_path": "x'


🔦🐛

Truncated JSON test; ensure graceful failure path.

lampe-ci · 2026-03-05T21:37:43Z

+
+def test_parse_validation_response_empty_no_exception():
+    """Empty content returns (None, False) without raising."""
+    parsed, success = parse_validation_response("")


🔦🐛

Empty input test; covered.

lampe-ci · 2026-03-05T21:37:45Z

+
+def test_parse_validation_response_garbage_no_exception():
+    """Arbitrary garbage returns (None, False) without raising."""
+    for garbage in ["not json at all", "null", "[]", '{"x"}', "}{"]:


🔦🐛

Garbage inputs loop; ensures non-crashing behavior across varied inputs.

lampe-ci · 2026-03-05T21:37:47Z

+
+
+def test_validation_agent_parse_response_graceful_fallback():
+    """ValidationAgent._parse_response returns empty findings on malformed input (no traceback)."""


🔦🐛

Graceful fallback for ValidationAgent in a test; verify private parsing path behavior.

lampe-ci · 2026-03-05T21:37:48Z

+    """QuickReviewAgent._parse_response returns empty findings on malformed input (no traceback)."""
+    from lampe.review.workflows.quick_review.quick_review_agent import QuickReviewAgent
+
+    agent = QuickReviewAgent()


🔦🐛

Graceful fallback for QuickReviewAgent test; ensure compatibility with actual agent implementations.

Antoine added 3 commits February 28, 2026 00:41

feat: v2 improve prompt with more positive than negative

2032e56

try something

4a21560

Do something

871af95

AntoMontagneDev marked this pull request as ready for review March 5, 2026 21:34

lampe-ci Bot reviewed Mar 5, 2026

View reviewed changes

AntoMontagneDev closed this Mar 5, 2026

		@@ -0,0 +1,54 @@
		"""Prompt for hallucination filter — mutes comments that ask the user to investigate instead of stating verified bugs."""

		HALLUCINATION_FILTER_SYSTEM_PROMPT = """


		For each issue whose comment is an "investigation request" (see below), call mute_issue with its issue_id and reason "investigation_request". Do NOT mute comments that state a verified bug. Prefer issuing all mute_issue calls in a single response.

		# Mute These (investigation requests)

		assert parsed.findings[0]["line_number"] == 42


		def test_parse_validation_response_malformed_json_no_exception():



		def test_validation_agent_parse_response_graceful_fallback():
		"""ValidationAgent._parse_response returns empty findings on malformed input (no traceback)."""

Conversation

AntoMontagneDev commented Mar 5, 2026 • edited by lampe-ci Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔦 description

What change is being made?

Why are these changes being made?

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot commented Mar 5, 2026

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

Uh oh!

lampe-ci Bot Mar 5, 2026

Choose a reason for hiding this comment

🔦🐛

AntoMontagneDev commented Mar 5, 2026 •

edited by lampe-ci Bot

Loading