Benchmark
Results
SQuAD with LLMLingua-2 compression.
BASELINE F1: 78.4
01
420
AVG TOKENS (IN)02
210
AVG TOKENS (OUT)03
78.4
F1 BASELINE04
76.1
F1 @ RATIO=0.5Preset Results3 PRESETS
| PRESET | TOKENS IN | TOKENS OUT | REDUCTION | F1 SCORE | F1 DROP |
|---|---|---|---|---|---|
| AGGRESSIVE (0.3) | 420 | 147 | ~65% | 73.4 | 5.0 PT |
| BALANCED (0.5) | 420 | 210 | ~50% | 76.1 | 2.3 PT |
| LIGHT (0.7) | 420 | 294 | ~30% | 77.6 | <1 PT |
METHODOLOGY: SQUAD · LLMLINGUA-2 · BASELINE F1: 78.4 · AVG LATENCY ~85MS