Methodology

Standards before scale.

Campus Evidence Lab tracks public-source records related to campus civil-rights matters, identity-based conflict, public enforcement activity, and institutional responses. The dataset is documentation infrastructure, not a verdict system.

Current Scope

The current dataset covers U.S. campus civil-rights records across religion, race, ethnicity, national origin, sex, pregnancy, disability, and athletic-equity categories. Community labels are assigned only when the public source supports them, and they describe what is documented in the source record rather than an independent project finding.

Inclusion Criteria

Minimum Record Fields

A publishable event record must identify the school, date or date precision, location, category, affected community label, summary, description, source IDs, source types, verification status, confidence label, update date, and record hash. If a field cannot be supported by public source material, the record should remain unpublished or use an explicit limited-value field instead of inference.

Accepted Source Types

Source Hierarchy

Official records and public datasets receive the strongest evidentiary weight. Reliable journalism and campus newspaper reporting can support inclusion when they identify the school, date, event type, and source basis. Advocacy or nonprofit reports require clear methodology and public source support before they can be used.

Deduplication Logic

Records should not be duplicated because the same public matter appears in multiple sources. A second source should usually be attached to the existing record unless it documents a distinct date, public action, legal step, institutional response, or source-backed update. Structured public datasets may use scoped deduplication keys such as school, year, geography scope, offense, and bias-code so aggregate cells remain auditable without being merged incorrectly.

Event Category Definitions

Affected Community Definitions

Affected communities are assigned only when supported by public source material. The field identifies the community named in the source record; it does not imply that Campus Evidence Lab independently determined motive, legal liability, or institutional fault. Multiple communities may be listed when a public source identifies more than one affected group.

Exclusion Criteria

Verification Status

Verification describes the kind of public support behind a record. It does not determine legal truth and does not measure severity.

Confidence

Confidence describes source support, not moral severity.

Review Workflow

Correction Process

Corrections must identify the record ID, the disputed field, and a public source supporting the change. Accepted corrections update the changelog and produce a new record hash. Rejected corrections should preserve a short rationale in review notes before the project has a formal public review queue.

Reviewer Standard

A reviewer should be able to reproduce the record from public material without private context. Review checks include source availability, source type, date precision, school identity, category choice, affected community label, legal-status wording, privacy risk, duplicate risk, neutral language, and whether confidence describes source support instead of severity.

No Ranking System

Campus Evidence Lab does not rank schools by hate, safety, severity, or moral standing. Record counts reflect public-source availability, source discovery, reporting practices, school size, jurisdiction, institutional transparency, and reviewer capacity. If future coverage signals describe documentation density or source diversity, they should remain documentation signals rather than comparative school judgments.

Privacy Limits

The MVP does not collect private testimony or sensitive evidence. Public records should avoid naming private individuals unless a name is necessary to understand the public record and already appears in source material from a reliable public source.

AI Use

AI may assist with extraction, summarization, duplicate detection, and brief drafting. AI does not publish records, determine legal meaning, or assign final classification without human review. Human review is required before a record appears in the public dataset.

Integrity

Each event receives a deterministic record hash. Each dataset snapshot receives a snapshot hash. Hashes are used to make silent retroactive edits easier to detect.

The current manifest is published at data/snapshot-manifest.json. The current archived manifest is linked from the data downloads page. Weekly briefs reference the event dataset hash used for their record set.

Limitations

The dataset is not a complete census of campus civil-rights incidents. It reflects public-source availability, reviewer capacity, source discoverability, and the current scope of the MVP. Records should not be used as rankings, severity scores, prevalence measures, or proof of legal liability.

Versioning and Audit Policy

Public changes regenerate record hashes, dataset hashes, snapshot manifests, the snapshot index, CSV exports, source audit metadata, and the public changelog. Archived snapshot manifests are listed in data/snapshot-index.json. Before launch or major publication cycles, maintainers should run the advisory live source audit and manually review any failures.