Architecture Overview

Jnkn's architecture in one page.

System Diagram

graph TB
    subgraph Input
        F[Source Files]
        C[Config]
    end

    subgraph "Parsing Layer"
        PE[Parser Engine]
        PP[Python Parser]
        PT[Terraform Parser]
        PK[K8s Parser]
    end

    subgraph "Graph Layer"
        G[(Dependency<br/>Graph)]
        S[Storage<br/>SQLite]
    end

    subgraph "Stitching Layer"
        SE[Stitching Engine]
        M[Matchers]
        R[Rules]
        CC[Confidence<br/>Calculator]
    end

    subgraph "Analysis Layer"
        BR[Blast Radius]
        EX[Explainer]
        DF[Differ]
    end

    subgraph Output
        CLI[CLI]
        JSON[JSON]
        SARIF[SARIF]
    end

    F --> PE
    C --> PE
    PE --> PP & PT & PK
    PP & PT & PK --> G
    G <--> S
    G --> SE
    SE --> M & R
    M --> CC
    R --> CC
    CC --> G
    G --> BR & EX & DF
    BR & EX & DF --> CLI & JSON & SARIF

Components

Parsing Layer

Responsibility: Convert source files into nodes and edges.

Parser Engine — Orchestrates file discovery and parser dispatch
Language Parsers — Extract patterns from specific languages
Extractors — Detect specific code patterns (e.g., os.getenv)

Graph Layer

Responsibility: Store and query the dependency graph.

Dependency Graph — In-memory NetworkX graph for traversal
Storage — SQLite for persistence and incremental updates

Stitching Layer

Responsibility: Create cross-domain edges.

Stitching Engine — Evaluates all node pairs against rules
Matchers — Compute similarity between artifact names
Rules — Define which node types can be linked
Confidence Calculator — Score match quality

Analysis Layer

Responsibility: Answer questions about the graph.

Blast Radius — BFS traversal to find downstream impact
Explainer — Show why matches were made/rejected
Differ — Compare graphs across git refs

Data Flow

Scan

Files → Parsers → Nodes/Edges → Graph → Stitching → Cross-Domain Edges → Storage

Query

User Query → Graph Load → Traversal → Results → Output Format

Key Design Principles

Incremental by default — Only re-scan changed files
Explainable — Every match can be explained
Configurable — Thresholds and rules are tunable
Extensible — Add new parsers/rules without core changes
Fast — SQLite + in-memory graph for speed

Technology Choices

Component	Technology	Rationale
Parsing	Tree-sitter	Language-agnostic, accurate AST
Graph	NetworkX	Rich algorithms, Python-native
Storage	SQLite	Zero-config, portable, fast
CLI	Click	Standard for Python CLIs
Config	YAML	Human-readable, version-controllable