Skip to content

mediaite-ghostink

Deterministic scrape → extract → analyze → report — combining statistical stylometry, embedding drift, and optional probability and AI-baseline tracks.

A deterministic four-stage forensic pipeline over a WordPress newsroom corpus. It produces statistical and documentary signals — change-points in stylometric features, embedding drift, convergence to an “AI baseline” — that reviewers can inspect, reproduce, and audit. The outputs are signals, not legal findings or definitive attribution of authorship or tool use.

This project quantifies stylistic and structural shifts in published text. It does not identify the author of a piece, prove the use of a specific tool, or render a legal verdict. Reviewers should pair every reported signal with domain context, editorial records, and corroborating evidence before drawing conclusions.