1. Travel Extractor
Runs Pack's real streaming extractor over Gmail-shaped messages and calendar events, then emits profile JSON for trips, cancellations, stale evidence, loyalty, preferences, and costs.
Benchmark pack-deeperbench-v0
Pack DeeperBench measures the complete workflow: extracting a household travel history from realistic email and calendar data, planning from a short human prompt, and selecting flights and hotels from large deterministic inventories.
Each phase is scored separately and then rolled into an end-to-end case score. Pack runs through its native code path; external systems run against the same tool protocol and result schemas.
Runs Pack's real streaming extractor over Gmail-shaped messages and calendar events, then emits profile JSON for trips, cancellations, stale evidence, loyalty, preferences, and costs.
Runs Pack's real planner on human-written requests with extracted family context, obligations, public-event timing, prior travel, and red-herring private context.
Executes deterministic flight and hotel search from Pack planner outlines, then scores seat fit, price, stops, refundability, room capacity, location, and preference match.
Correct trips, travelers, cancellations, changes, stale bookings, loyalty, and preference evidence.
Correct dates, travelers, public timing, calendar constraints, active context, and trip outline structure.
Whether selected flights and hotels actually match family seats, fare flexibility, price, rooms, and location preferences.
Evidence quality for inbox, calendar, public-event, planner, and search decisions.
Wall-clock time by extractor, planner, search, and full case execution.
Model calls, Pack phase costs, and local/AWS runner costs reported separately.
Book two weeks in Japan for @family in June 2027. Start with Tokyo and keep our usual two-room family split.
cd PackServer npm run bench:travel-context:pack-extractor-hartwell -- --email-task-concurrency 2 --out-dir tmp/hartwell-pack-real-extractor-full npm run bench:travel-context:pack-phased-hartwell -- --extraction-dir tmp/hartwell-pack-real-extractor-full --limit 100 --flight-count 1000000 --hotel-count 1000000Release repository coming next