Concepts
- Duration Estimate: Mean or typical time required to complete a task.
- Range Estimate: Time bounds under known conditions.
- Preference-Aware Adjustment: User-corrected duration refined through feedback.
- Task-Instance Memory: Each observed duration stored as a discrete trace.
Test Suite Overview
Chronologue supports structured testing across domain-specific timing datasets:- Cooking Tasks – Vary by temperature, ingredient, and equipment.
- Cleaning Tasks – Vary by square footage and mess level.
- Exercise Tasks – Vary by capacity, intensity, and environment.
- Travel Tasks – Vary by stops, mode, and traffic conditions.
Feedback Logging and User Correction
Duration estimation improves with user input. Logging outcomes helps refine future prompts. Define per-user profile:“Last time you used 14 minutes for salmon and it was overcooked. Setting timer to 12 min.”
Conditional and Multi-Step Timing Chains
Support for conditional cooking logic:If skin-on → roast 1 lb salmon at 450°F for 10–12 minThis allows structured timing chains:
If skinless → check doneness at 9 min
- Initial estimate
- Mid-task check
- Final adjustment
- Timer execution
Integrating with Chronologue Systems
Theduration/
module integrates across the Chronologue architecture:
- .ics Generator – Computes
DTSTART
,DURATION
, andDTEND
- Planner (MCP) – Allocates time blocks based on learned durations
- Memory Module – Stores task-instance traces for retrieval
- Feedback Logger – Updates model and user profiles from evaluations
Training Preference-Adaptive Agents
Agents adapt through repeated use and feedback. They can:- Parse recipe content into steps
- Adjust estimates using historical memory
- Reference internal sources (e.g., USDA guidelines, Bon Appétit)
- Suggest timers with contextual notes
suggest_timer_duration(ingredient, weight, temp, past_outcomes)
log_cooking_outcome(task, result, feedback)
Duration as a Core Abstraction
Reasons to isolateduration/
into its own module:
Modularity
- Prevents logic duplication
- Keeps estimation separate from storage or UI
- Used across
.ics
generation, planning, memory replay, agent prompts
- Add support for:
- User-specific classifiers
- Conditional adjustments
- Duration uncertainty and confidence scores
9. Model Output Schema
Prompt: Estimate how long it takes to grill a 1.5 lb skirt steak at 425°F.For each doneness level (rare, medium-rare, medium, well-done), provide:
- Mean cook time in minutes
- A 2-value range
- Recommended internal temperature (°F)
- A brief description of technique or doneness