Benchmark
Results

SQuAD with LLMLingua-2 compression.

BASELINE F1: 78.4
01
420
AVG TOKENS (IN)
02
210
AVG TOKENS (OUT)
03
78.4
F1 BASELINE
04
76.1
F1 @ RATIO=0.5
Preset Results3 PRESETS
PRESETTOKENS INTOKENS OUTREDUCTIONF1 SCOREF1 DROP
AGGRESSIVE (0.3)420147~65%73.45.0 PT
BALANCED (0.5)420210~50%76.12.3 PT
LIGHT (0.7)420294~30%77.6<1 PT
METHODOLOGY: SQUAD · LLMLINGUA-2 · BASELINE F1: 78.4 · AVG LATENCY ~85MS