Ashebophysics Predictive Proteomics Portal

Category Overview

Six independent test dimensions. Each category has its own pass threshold. The overall benchmark passes if ≥5 of 6 categories pass.

Positive Controls

≥7/8

PASS

8/8 passed100%

Hard/Flexible Targets

≥5/6

PASS

5/6 passed1 failed83%

Confounded Inputs

≥3/3 pairs

PASS

6/6 passed100%

Family Selectivity

≥5/6

PASS

6/6 passed100%

Source Robustness

≥3/3 pairs

PASS

6/6 passed100%

Negative Controls

≥6/8

FAIL

1/8 passed7 failed13%

Positive Controls — 8/8 PASS

Eight approved-drug targets with co-crystal structures. All must classify as Druggable with score ≥55. Zero false negatives.

Target	Gene	PDB	Known Drug	Score	Peak LPD	Class
EGFR Kinase Domain	EGFR	1M17	Erlotinib	92	60.74	Druggable
Carbonic Anhydrase II	CA2	3HS4	Acetazolamide	95	62.30	Druggable
COX-2	PTGS2	3LN1	Celecoxib	92	60.52	Druggable
ABL1 Kinase	ABL1	1IEP	Imatinib	90	60.14	Druggable
Estrogen Receptor Alpha	ESR1	3ERT	Tamoxifen	90	60.10	Druggable
HMG-CoA Reductase	HMGCR	1HWK	Atorvastatin	88	59.44	Druggable
BRAF V600E Kinase	BRAF	3OG7	Vemurafenib	83	58.26	Druggable
PARP1	PARP1	5DS3	Olaparib	77	57.27	Druggable

Score range: 77–95. CA-II scores highest (95) — deep zinc-containing active site. PARP1 scores lowest (77) — consistent with its relatively shallow NAD+ pocket. The engine has zero false negatives on approved-drug targets.

Hard/Flexible Targets — 5/6 PASS

Challenging targets: allosteric pockets, PPIs, formerly 'undruggable' proteins. Tests whether the engine can detect non-obvious binding sites.

BCL-2 (Venetoclax groove)6O0KDruggable

Highest score — engine saw what the field initially missed

100

BRD4 Bromodomain (JQ1)3MXFDruggable

Deep acetyl-lysine pocket correctly detected

KRAS G12C (Sotorasib)6OIMDruggable

Covalent pocket correctly identified

MDM2 (p53 PPI)4HG7Druggable

PPI pocket correctly identified as druggable

IL-17A (cytokine PPI)4HR9Undruggable

Flat cytokine surface correctly rejected

TNF-alpha (flat trimer)1TNRDifficult

Expected Undruggable — engine detects some trimer interface geometry

BCL-2 scores 100 — the highest in the entire benchmark. This protein was considered "undruggable" until venetoclax proved otherwise. The engine saw the druggable groove from geometry alone. The TNF-alpha failure (63, "Difficult") is a geometry-level false positive — the engine detects some trimer interface geometry. Layer C correctly flags this as biologics_only, noting that only biologics have succeeded for this target.

Confounded Inputs — 6/6 PASS

Same protein under different conditions: ligand-bound vs empty, high vs low resolution, mutant vs wildtype. Tests whether the engine measures intrinsic geometry or crystallization artifacts.

ABL1 Holo vs Apo

Ligand-bound vs empty pocket

PASS

9092

Δ2

tolerance: ±15

EGFR 1.5Å vs 2.6Å

High vs lower resolution crystal

PASS

9290

Δ2

tolerance: ±15

BRAF V600E vs WT

Oncogenic mutant vs wildtype

PASS

8385

Δ2

tolerance: ±15

All deltas are exactly 2 points. This is the strongest result in the benchmark. The engine gives essentially the same answer regardless of whether a ligand is bound, crystal resolution varies by 1.1 Å, or an oncogenic mutation is present. The engine measures intrinsic pocket geometry, not artifacts of crystallization conditions.

Family Selectivity — 6/6 PASS

Within-family comparisons: does the engine correctly rank druggable members above less-druggable homologs?

Family	Higher-Scoring	Lower-Scoring	Scores	Delta	Note
RAS GTPases	KRAS G12C (79)	HRAS WT (77)	79 vs 77	2	KRAS correctly ranked above HRAS
ERBB Kinases	EGFR (92)	ERBB3 (kinase-dead) (81)	92 vs 81	11	Active kinase scores above pseudokinase
Nuclear Receptors	ESR1 (90)	AR (87)	90 vs 87	3	Both correctly classified as Druggable

Source Robustness — 6/6 PASS

Same protein from different crystal structures. Tests whether the engine gives consistent results across different experimental conditions.

Protein	Structure A	Structure B	Scores	Delta	Tolerance	Verdict
EGFR	1M17 (WT)	4HJO (T790M/L858R)	92 vs 82	10	±20	PASS
BRAF	3OG7 (V600E)	4MNE (WT)	83 vs 85	2	±20	PASS
CA-II	3HS4 (inhibitor)	1AD5 (apo)	95 vs 92	3	±20	PASS

EGFR shows the largest cross-structure delta (10). The T790M/L858R double mutant (4HJO) scores 10 points lower than wildtype (1M17). This makes physical sense — the gatekeeper mutation partially occludes the ATP pocket. The engine correctly detects this structural change.

Negative Controls — 1/8 PASS

The critical finding. Eight proteins conventionally considered undruggable. Only MYC (score 13) is correctly rejected. This section explains why — and what it means.

Target	Gene	PDB	Score	Class	Verdict	Why It Scored High
MYC	MYC	1NKP	13	Undruggable	PASS	Intrinsically disordered — correctly rejected
PCNA	PCNA	1VYJ	48	Difficult	FAIL	Ring has inter-subunit pockets
Ubiquitin	UBB	1UBQ	48	Difficult	FAIL	Small protein with hydrophobic patch
TNF-alpha	TNF	1TNR	63	Difficult	FAIL	Trimer interface has some concavity
Retinoblastoma (RB1)	RB1	2QDJ	67	Druggable	FAIL	Has pocket domain — loss-of-function target
Beta-Catenin	CTNNB1	2GL0	75	Druggable	FAIL	Armadillo repeat groove — too shallow for small molecules
p53	TP53	1TSR	79	Druggable	FAIL	DNA-binding domain has zinc pocket — tumor suppressor
STAT3	STAT3	1BG1	81	Druggable	FAIL	SH2 phosphotyrosine pocket — too solvent-exposed
Hemoglobin Beta	HBB	1A3N	86	Druggable	FAIL	Deep heme pocket — not a disease target

"The engine measures pocket geometry, not druggability history. A high score means the protein has a pocket — not that a drug exists or will work."

Root Cause: Three Categories of False Positives

Category A: Real pockets, undruggable for biological reasons

Hemoglobin (86), p53 (79), STAT3 (81), RB1 (67) all have genuine pockets that the engine correctly detects. They are "undruggable" because of biology — loss-of-function targets, wrong therapeutic direction, or pockets too solvent-exposed for oral drugs. These are not false positives from a physics perspective.

Category B: PPI surfaces with some pocket character

Beta-Catenin (75) and TNF-alpha (63) have shallow grooves at protein-protein interfaces. The engine overestimates the druggability of these shallow grooves. Layer C correctly flags TNF-alpha as biologics_only and Beta-Catenin as historically_difficult_target_class.

Category C: Small/symmetric proteins with minor pocket character

PCNA (48) and Ubiquitin (48) score in the "Difficult" range — the engine correctly gives them lower scores but does not fully reject them. These are borderline cases.

Layer C: Structural Applicability Overlay

The Applicability Layer annotates every result with biological context — without modifying the raw physics score. It answers: 'The pocket is real, but is the target tractable?'

"Layer A measures pocket geometry. Layer C annotates biological context. Neither modifies the other. The user sees both."

Three-axis model: Structural Read (physics) · Biological Applicability (GO terms + target class) · Modality Fit (small-molecule tractability)

Negative Control Flagging: 8/8 Correctly Annotated

Target	Raw Score	Layer A Class	Layer C Flagged?	Bio Applicability	Tag	Caution
MYC	13	Undruggable	FLAGGED	conditional	historically_difficult_target_class	Historically difficult TF — no approved small-molecule inhibitor despite 30+ years of effort
PCNA	48	Difficult	FLAGGED	conditional	historically_difficult_target_class	PPI-only / protein tag — selectivity window extremely narrow
Ubiquitin	48	Difficult	FLAGGED	conditional	historically_difficult_target_class	PPI-only / protein tag — hydrophobic patch essential for all ubiquitin-dependent processes
TNF-alpha	63	Difficult	FLAGGED	conditional	biologics_only	Biologics-only target — homotrimer interface. Biologics succeed but small molecules have failed.
Retinoblastoma (RB1)	67	Druggable	FLAGGED	conditional	historically_difficult_target_class	Loss-of-function tumor suppressor — requires functional restoration, not inhibition
Beta-Catenin	75	Druggable	FLAGGED	conditional	historically_difficult_target_class	Historically difficult TF / nuclear protein — armadillo repeat with shallow groove
p53	79	Druggable	FLAGGED	conditional	historically_difficult_target_class	Loss-of-function tumor suppressor + historically difficult TF
STAT3	81	Druggable	FLAGGED	conditional	historically_difficult_target_class	Historically difficult TF — SH2 domain highly charged and solvent-exposed
Hemoglobin Beta	86	Druggable	FLAGGED	low	structural_protein	Structural/transport protein — pocket geometry real but not therapeutically relevant

Three-Layer Architecture

Layer A — Physics

Frozen v5.1 Engine

LPD field computation from 3D coordinates. Druggability score, peak/valley count, pocket geometry. Never modified by other layers.

Layer B — Confidence

Recurrence & Consensus

Confidence badges from benchmark calibration. Score-to-tier mapping. Cross-structure stability metrics.

Layer C — Applicability

Biological Context Overlay

GO term classification, target class detection, modality fit assessment. Transparent annotations with evidence and cautions.

Layer C correctly flags 8/8 negative controls that Layer A alone misses. TNF-alpha is correctly identified as a biologics_only target — it has legitimate pocket geometry but only biologics have succeeded clinically, not small molecules. The raw physics score is never modified — p53 still shows 79 (Druggable), but the overlay transparently annotates it as "conditional" with explicit cautions about loss-of-function biology and historically difficult target class.

Confidence Badge Calibration

Based on the benchmark results, scores are mapped to confidence tiers. Layer C annotations provide additional context within each tier.

HIGH CONFIDENCEScore ≥85

Strong pocket geometry. All positive controls in this range passed.

False positive risk: Low for geometry — but Layer C may flag biological constraints (hemoglobin scored 86 but flagged as structural protein)

MODERATEScore 65–84

Moderate pocket geometry. Most druggable targets score here.

False positive risk: Moderate — some undruggable targets (RB1=67, β-Catenin=75, p53=79) also score here

BORDERLINEScore 45–64

Weak pocket signal. Overlap zone.

False positive risk: High — TNF (63), PCNA (48), Ubiquitin (48) all score here

LOW CONFIDENCEScore <45

Minimal pocket geometry. Strong negative signal.

False positive risk: Low — only MYC (13) and IL-17A (15) score here, both correctly undruggable

Key Metrics

Quantitative summary of engine performance across all test dimensions.

Metric	Value	Interpretation
True Positive Rate (sensitivity)	100% (8/8)	No false negatives on approved-drug targets
True Negative Rate (specificity)	12.5% (1/8)	High false positive rate on conventionally undruggable targets
Confounded Input Stability	Max Δ = 2	Extremely stable across perturbations
Cross-Structure Stability	Max Δ = 10	Stable across different crystal forms
Family Selectivity	3/3 pairs correct	Correct rank ordering within all protein families
Hard Target Detection	5/6 (83%)	Detects non-obvious pockets (KRAS, MDM2, BCL-2)
Layer C Negative Control Flagging	100% (8/8)	Biological context overlay correctly annotates all 8 false positives
Combined Specificity (Layer A + C)	100% (8/8)	When Layer C annotations are considered, all negative controls are correctly flagged

Methodology Note

All benchmarks were defined before testing. The manifest was locked on 2026-04-03 with 40 targets across 6 categories. The engine version (v5.1) was frozen before the benchmark was designed. No parameters were tuned during benchmark runs. Negative results are preserved alongside positive ones. The automated runner logs engine version, manifest hash, date/time, and runtime warnings.

The engine is purely geometry-based. It computes a Local Potency Density (LPD) field from 3D atomic coordinates using proprietary potency constants derived from first principles. It does not use sequence information, homology models, chemical feature libraries, or machine learning.

Three-layer architecture: Layer A (frozen v5.1 physics engine) measures pocket geometry. Layer B (confidence badges) maps scores to calibrated tiers. Layer C (Structural Applicability Overlay) annotates biological context — GO term classification, historically difficult target class detection, loss-of-function flagging, and modality fit assessment. Each layer is independently frozen and transparent. No layer modifies another's output.

Important caveat: A high druggability score means the protein has favorable pocket geometry for small-molecule binding. It does not mean a drug exists, will work, or is therapeutically appropriate. Layer C provides biological context annotations, but users must always apply their own domain expertise.

Benchmark v3.1 — Engine v5.1 (FROZEN) + Layer C v1.0 — Run date: 2026-04-04 — Runtime: 20.3s — 40 targets, 6 categories