Methods Library

40+ detection methods. Every one cited.

Search, filter and inspect every detector in the IPLYR forensic stack. Each method links to its paper and to a /methods slug page with implementation, test count, and adversarial-robustness notes.

20 of 20

Modality

Type

Tier

Min-K%++

Daubert

Zhang et al. · ICLR 2025

Calibrated membership-inference attack against pretraining data. Uses per-token log-likelihood vs. neighborhood entropy to achieve state-of-the-art AUROC across LLaMA, Pythia, GPT-NeoX families. Decouples token informativeness from membership signal, producing a clean p-value under permutation null.

Duarte et al. · arXiv 2024

Detection of copyrighted content via paraphrase-distinguishing probes. Forces the model to choose between original passages and high-quality paraphrases; preference for the original is statistically attributable to memorization.

text

35 tests

nv-recall (Stanford Algorithm 1)

Daubert

Stanford NLP · Stanford 2024

Non-verbatim recall measurement. Replaces exact-match with semantic-equivalence scoring under a calibrated neighborhood, recovering memorization signal that survives stylistic rewriting and translation.

Cooper et al. · arXiv 2024

Probabilistic discoverable extraction with confidence bounds. Estimates the probability that a target string is recoverable from a model under a budgeted prompt distribution, with formal lower-bound guarantees.

Park et al. · ICML 2023

Training-data attribution at scale via random-projection of gradients. Returns a per-training-example influence score on a target query.

Park et al. (extension) · ICML 2024

Scalable TRAK variant for billion-parameter diffusion models. Uses distilled gradient surrogates and locality-sensitive hashing for tractable attribution across catalog-scale training indices.

image

41 tests

LoRA Probes

IPLYR Internal · Internal 2025

Low-rank adapter probes that fingerprint memorization-prone parameter subspaces. Surface circuits where training-data residue concentrates.

IPLYR + Speech-MIA literature · ICASSP-adjacent 2024

Speech and music MIA combining acoustic loss-curvature, mel-spectrogram divergence, and waveform-level NN-distance.

audio

29 tests

CSD style fingerprinting

Daubert

Somepalli et al. · NeurIPS 2023

Contrastive style descriptors capturing artist-specific visual fingerprints invariant to subject matter. Validated for Getty / artist-style cases.

image

22 tests

MelodySim

Music-MIR consortium · ISMIR 2024

Melodic similarity across pitch-class profiles and rhythmic envelopes for music memorization claims.

audio

16 tests

STALL video forensics

Video-Forensics WG · CVPR-W 2024

Spatiotemporal action-level latent localization for short-form video memorization detection.

Merge-forensics literature · arXiv 2024

Forensic detection of model merges and lineage via centered kernel alignment between candidate parent models and a target.

textimage

33 tests

Multi-teacher MIA ensemble

Daubert

IPLYR + statistical combination literature · SaTML 2025

Fisher / Stouffer / Simes / harmonic-mean / Cauchy combiners over independent MIA detectors. Patentable ensemble — top novelty rating.

textimageaudio

87 tests

MCID concept inference

Daubert

Dubiński et al. · CVPR 2025

Collective dataset inference at the concept level for image distributions.

image

24 tests

RAG two-stage detector

IPLYR Internal · Internal 2025

Two-stage detector that disentangles RAG-retrieval recall from parametric memorization.

text

21 tests

Concept memorization (MMD / KS / energy)

Daubert

Statistical literature · Various 2023

Two-sample tests (MMD, Kolmogorov–Smirnov, energy distance) for concept-level recall in generative models.

textimage

31 tests

Textual Inversion forensics

Daubert

IPLYR Internal (patentable) · Internal 2025

Diffusion-model attribution via inverted embedding probes. Recovers a pseudo-token that best activates target style; statistical significance of activation vs. distractors yields attribution score. Patentable invention, top novelty rating.

image

19 tests

SSCD + DINOv2 image attribution

Daubert

Pizzi et al. + Oquab et al. · ICCV 2023

Self-supervised copy detection embeddings combined with DINOv2 features for image-level provenance and near-duplicate detection.

image

28 tests

MUSE 6-axis unlearning verifier

Daubert

MUSE benchmark consortium · NeurIPS 2024

Six-axis verifier for unlearning and deletion claims: verbatim, knowledge, privacy, bias, utility, and scalability.

Privacy auditing literature · USENIX Security 2024

Tight empirical lower bound on the differential-privacy epsilon of a deployed model via membership-inference adversary.

textimage

17 tests

Known limits — disclosed

·8th Circuit case-law coverage is currently 0 cases — flagged gap for next dataset expansion.
·Audio and video modules (MelodySim, STALL) are Tier 2–3; full Daubert-tier hardening is in progress.
·Reports are evidentiary inputs, not legal conclusions. Daubert-tier classification reflects the current SaTML 2025 framework; ultimate admissibility is the court's determination.