Results

`results`

Result models for Agent-friendly JSON output.

`AuditResult`

Bases: BaseModel

Aggregated result of a project audit.

Contains all individual check results and computed summary. quality_score and grade may be passed explicitly (e.g. in tests); otherwise they are computed from checks automatically.

Source code in packages/axm-audit/src/axm_audit/models/results.py

Python
class AuditResult(BaseModel):  # type: ignore[explicit-any]  # pydantic synthesizes __init__(**data: Any)
    """Aggregated result of a project audit.

    Contains all individual check results and computed summary.
    ``quality_score`` and ``grade`` may be passed explicitly (e.g. in
    tests); otherwise they are computed from checks automatically.
    """

    project_path: str | None = Field(
        default=None, description="Path to the audited project"
    )
    checks: list[CheckResult] = Field(default_factory=list)

    @computed_field  # type: ignore[prop-decorator]
    @property
    def success(self) -> bool:
        """True if all checks passed."""
        return all(c.passed for c in self.checks)

    @computed_field  # type: ignore[prop-decorator]
    @property
    def total(self) -> int:
        """Total number of checks."""
        return len(self.checks)

    @computed_field  # type: ignore[prop-decorator]
    @property
    def failed(self) -> int:
        """Number of failed checks."""
        return sum(1 for c in self.checks if not c.passed)

    @computed_field  # type: ignore[prop-decorator]
    @property
    def quality_score(self) -> float | None:
        """Weighted average across 9 code-quality categories.

        Categories and weights:
            Linting (15%), Type Safety (15%), Complexity (15%),
            Testing (10%), Test Quality (10%), Security (10%),
            Dependencies (10%), Architecture (10%), Practices (5%).

        Structure and tooling emit findings but are NOT scored
        (structure is handled by axm-init; tooling is informational).
        Returns None if no scored checks are present.
        """
        category_scores = collect_category_scores(self.checks)
        if not category_scores:
            return None

        # Weighted average: avg each category, then weight.
        # Normalize by sum of present weights so filtered audits
        # (e.g. category="lint") are not penalized for missing categories.
        total = sum(
            (sum(scores) / len(scores)) * _CATEGORY_WEIGHTS[cat]
            for cat, scores in category_scores.items()
        )
        weight_sum = sum(_CATEGORY_WEIGHTS[cat] for cat in category_scores)
        if weight_sum <= 0:
            return None
        return round(total / weight_sum, 1)

    @computed_field  # type: ignore[prop-decorator]
    @property
    def crashed_rules(self) -> list[str]:
        """rule_ids whose check raised, in encounter order.

        Crashed rules contribute ``score=0`` to their scored category (they
        are never silently dropped); this field makes a degraded audit
        traceable rather than silent.
        """
        return [c.rule_id for c in self.checks if _is_crashed(c)]

    @computed_field  # type: ignore[prop-decorator]
    @property
    def grade(self) -> str | None:
        """Letter grade derived from quality_score.

        A >= 90, B >= 80, C >= 70, D >= 60, F < 60.
        Returns None if quality_score is None.
        """
        score = self.quality_score
        if score is None:
            return None
        _thresholds = [
            (_GRADE_A, "A"),
            (_GRADE_B, "B"),
            (_GRADE_C, "C"),
            (_GRADE_D, "D"),
        ]
        return next((g for t, g in _thresholds if score >= t), "F")

    model_config = ConfigDict(extra="forbid")

`crashed_rules` `property`

rule_ids whose check raised, in encounter order.

Crashed rules contribute score=0 to their scored category (they are never silently dropped); this field makes a degraded audit traceable rather than silent.

`failed` `property`

Number of failed checks.

`grade` `property`

Letter grade derived from quality_score.

A >= 90, B >= 80, C >= 70, D >= 60, F < 60. Returns None if quality_score is None.

`quality_score` `property`

Weighted average across 9 code-quality categories.

Categories and weights

Linting (15%), Type Safety (15%), Complexity (15%), Testing (10%), Test Quality (10%), Security (10%), Dependencies (10%), Architecture (10%), Practices (5%).

Structure and tooling emit findings but are NOT scored (structure is handled by axm-init; tooling is informational). Returns None if no scored checks are present.

`success` `property`

True if all checks passed.

`total` `property`

Total number of checks.

`CheckResult`

Bases: BaseModel

Result of a single compliance check.

Designed for machine parsing by AI Agents.

Source code in packages/axm-audit/src/axm_audit/models/results.py

Python
class CheckResult(BaseModel):  # type: ignore[explicit-any]  # pydantic synthesizes __init__(**data: Any)
    """Result of a single compliance check.

    Designed for machine parsing by AI Agents.
    """

    rule_id: str = Field(..., description="Unique identifier for the rule")
    passed: bool = Field(..., description="Whether the check passed")
    message: str = Field(..., description="Human-readable result message")
    severity: Severity = Field(default=Severity.ERROR, description="Severity level")
    details: dict[str, object] | None = Field(
        default=None, description="Structured data (cycles, metrics)"
    )
    text: str | None = Field(
        default=None, description="Pre-rendered detail text for display"
    )
    fix_hint: str | None = Field(default=None, description="Actionable fix suggestion")
    category: str | None = Field(
        default=None, description="Scoring category (injected by auditor)"
    )
    metadata: dict[str, object] = Field(
        default_factory=dict,
        description="Rule-specific structured payload (clusters, verdicts, ...)",
    )
    score: int | None = Field(
        default=None,
        ge=0,
        le=100,
        description="Numeric score in [0, 100] for scored categories",
    )

    model_config = ConfigDict(extra="forbid")

`Severity`

Bases: StrEnum

Severity level for check results.

Source code in packages/axm-audit/src/axm_audit/models/results.py

Python
class Severity(StrEnum):
    """Severity level for check results."""

    ERROR = "error"  # Blocks audit pass
    WARNING = "warning"  # Non-blocking issue
    INFO = "info"  # Informational only

`collect_category_scores(checks)`

Group valid scores by category, filtering out unusable checks.

Reads the typed CheckResult.score field. When a check belongs to a scored category but exposes score is None, a warning is emitted naming the rule_id so that silently-dropped scores remain observable.

Source code in packages/axm-audit/src/axm_audit/models/results.py

Python
def collect_category_scores(
    checks: Sequence[CheckResult],
) -> dict[str, list[float]]:
    """Group valid scores by category, filtering out unusable checks.

    Reads the typed ``CheckResult.score`` field. When a check belongs to a
    scored category but exposes ``score is None``, a warning is emitted naming
    the rule_id so that silently-dropped scores remain observable.
    """
    category_scores: dict[str, list[float]] = {}
    for check in checks:
        cat = check.category
        if not cat or cat not in _CATEGORY_WEIGHTS:
            continue
        if check.score is None:
            _logger.warning(
                "rule %s in scored category %r returned score=None; dropped",
                check.rule_id,
                cat,
            )
            continue
        category_scores.setdefault(cat, []).append(float(check.score))
    return category_scores

`crash_check_result(rule_id, category, message, *, details=None)`

Build the canonical CheckResult for a rule whose check raised.

A crash in a scored category must penalize that category, not vanish from the composite. The crash is encoded explicitly as score=0 plus a crash marker in metadata (the discriminant is crash-vs-absent, not None-vs-not-None), so quality_score averages the 0 in and AuditResult.crashed_rules can surface the rule for traceability.

Source code in packages/axm-audit/src/axm_audit/models/results.py

Python
def crash_check_result(
    rule_id: str,
    category: str | None,
    message: str,
    *,
    details: dict[str, object] | None = None,
) -> CheckResult:
    """Build the canonical ``CheckResult`` for a rule whose check raised.

    A crash in a scored category must **penalize** that category, not vanish
    from the composite. The crash is encoded explicitly as ``score=0`` plus a
    crash marker in ``metadata`` (the discriminant is crash-vs-absent, not
    ``None``-vs-not-``None``), so ``quality_score`` averages the 0 in and
    ``AuditResult.crashed_rules`` can surface the rule for traceability.
    """
    return CheckResult(
        rule_id=rule_id,
        passed=False,
        message=message,
        severity=Severity.ERROR,
        fix_hint="Check rule configuration and dependencies",
        category=category,
        details=details,
        metadata={_CRASH_MARKER_KEY: True},
        score=0,
    )

`format_categories_help()`

Return a multi-line bullet list of categories with one-line summaries.

Source of truth for both the AuditTool MCP description and the CLI --category help text.

Source code in packages/axm-audit/src/axm_audit/models/results.py

Python
def format_categories_help() -> str:
    """Return a multi-line bullet list of categories with one-line summaries.

    Source of truth for both the ``AuditTool`` MCP description and the
    CLI ``--category`` help text.
    """
    all_cats = SCORED_CATEGORIES | EXTRA_NONSCORED_CATEGORIES
    lines = [f"- {cat}: {CATEGORY_SUMMARIES[cat]}" for cat in sorted(all_cats)]
    return "\n".join(lines)

Results

results

AuditResult

crashed_rules property

failed property

grade property

quality_score property

success property

total property

CheckResult

Severity

collect_category_scores(checks)

crash_check_result(rule_id, category, message, *, details=None)

format_categories_help()

`results`

`AuditResult`

`crashed_rules` `property`

`failed` `property`

`grade` `property`

`quality_score` `property`

`success` `property`

`total` `property`

`CheckResult`

`Severity`

`collect_category_scores(checks)`

`crash_check_result(rule_id, category, message, *, details=None)`

`format_categories_help()`