BLR23:50:36
···18:20:36
00:00
X0.0000
Y0.0000
← Back to projects
In Development • Personal

ScholarOS - Research as Structured Execution

Five deterministic MCP services over a DAG orchestrator. Every claim bound to source evidence. No hallucinated synthesis in governance outputs.

5,479 Chunks processed
180 Claims extracted
76 Contradictions detected
100% Determinism rate

The Problem With Research Copilots

Generic AI tools applied to academic research produce fluent text with no evidence traceability. Hallucinated synthesis is structurally indistinguishable from grounded synthesis - you cannot tell from the output alone whether a claim was derived from the literature or confabulated. Every research workflow - literature review, contradiction detection, hypothesis validation, evidence extraction, proposal drafting - requires a separate manual process with no shared execution model.

Hypothesis stress-testing is the worst case: the same model that generated the hypothesis is asked to critique it. No adversarial challenge, no convergence gate, no provenance on the resulting claim. The output looks like analysis but carries none of the guarantees.

The Architecture

ScholarOS is a structured research execution platform - five capabilities delivered as a DAG-executed MCP workflow, not a chatbox. Every output is bound to source evidence. Nine services process research artifacts with rule-based, schema-defined, reproducible logic. No service imports another service - all data flows through the orchestrator via MCP tool invocations, eliminating hidden state.

The five capabilities: Literature Mapping (HDBSCAN clustering, LLM cluster labeling, paper ranking); Contradiction & Consensus (claim extraction, metric normalization, polarity/value divergence detection, Belief Engine confidence assignment); Hypothesis & Critique (bounded Hypothesis/Critic loop, max 5 iterations, grounded to source claim identifiers); Multimodal Evidence Extraction (tables, figures, metrics from PDFs to structured output); Proposal Assistant (validated hypotheses to Markdown/LaTeX with citation assembly).

Data layer: Chroma (vector), SQLite (metadata), Redis (session). Local inference: Ollama qwen2.5:32b, sentence-transformers all-MiniLM-L6-v2.

The Pipeline

The MCP Orchestrator executes workflows as DAGs with pause/resume, session management, and full trace logging. Only hypothesis critique is agentic - bounded to five iterations with required grounding to source claim identifiers. All other stages are deterministic with provenance tracked through typed artifacts.

100% determinism rate means identical inputs produce identical outputs. No stochastic processes in the deterministic pipeline. Nine independently testable services with no global state. Agent reasoning is explicitly bounded: convergence detection terminates the hypothesis loop before it runs unconstrained.

Outputs & Current State

Five output artifact types: ClusterMap (JSON), Contradiction Report (JSON), Validated Hypotheses (JSON), Research Proposals (Markdown • LaTeX), Extracted Evidence (CSV • JSON). Fully local and self-hostable - no external API dependency for any deterministic pipeline stage.

March 2026 validation run: 5,479 chunks processed, 180 claims extracted, 76 contradictions detected. Each claim is bound to source evidence. Contradiction detection marks where consensus breaks - keeping outputs falsifiable and useful beyond individual sessions.

↗ View on GitHub