Classify Intent (LLM)¶

The Classify Intent (LLM) node maps a user message to a discrete intent label using a small language model's first-token logprob over a closed set of letter candidates (A, B, C, …). Each letter maps positionally to an entry in intents_ids. On success it attaches an IntentionComponent — the same routing-only component produced by Classify Intent — so downstream IntentToInstruction wiring is unchanged.

NodeType: ClassifyIntentLLM.

How it works¶

Projects the dialog via ClassifyIntentLLMProjectionStrategy:
Stable system message (Mustache template + pre-rendered label list).
Up to history_turns past user / assistant pairs for coreference.
Current user message (latest HumanMessage).
Prefills the classifier model (Sync()); reuses KV-cache prefix across turns when the prompt tail is append-only.
Reads logits at the final position and softmaxes over the single-token ids for A, B, C, …
If top_prob ≥ threshold and (top − second) ≥ margin, attaches IntentionComponent{intents_ids[top]} and returns found.

No text is generated; no sampling.

Parameters¶

Param	Type	Default	Range	Structural	Description
`model_name`	Optional[str]	inherit model default	—	✓	Model catalog name. Structural because the token-ID mapping is built from the model's vocabulary at construction.
`intents_ids`	Optional[str]	inherit model default	—	✓	Comma-separated intent IDs (e.g. "greet,farewell,other"). Structural because label-to-token-ID mapping is built at construction.
`intents_prompt`	Optional[str]	inherit model default	—	✓	Comma-separated intent prompt labels / descriptions. Structural because the projection scaffolding and label-token mapping are built at construction.
`system_prompt`	Optional[str] (multiline)	inherit model default	—	—	Prepended before the user turn during classification projection.
`history_turns`	int	4	0.0 – 64.0	—	Number of dialog history turns included in the classification context.
`threshold`	float	0.5	0.0 – 1.0	—	First-token probability threshold for the winning label.
`margin`	float	0.0	0.0 – 1.0	—	Minimum margin between top and second label probabilities.
`notify_client`	bool	False	—	—	When true, fire OnNodeEvent("intent_classified", …) on the found path.
`not_found_intent`	Optional[str]	inherit model default	—	—	Intent ID returned when no label clears the threshold/margin.

Exits¶

Each exit is a structural string field on the node's params; its value names the target node (empty = END).

Exit	Param field	Description
`found`	`found_exit`	Exit taken when classification produced a label above threshold/margin. Empty string = END.
`not_found`	`not_found_exit`	Exit taken when no label cleared the threshold/margin. Empty string = END.

Exit routes¶

Route	Fires when
`found`	Valid message, top probability passes `threshold` and `margin`, and winning intent is not the sink.
`not_found`	Empty message, failed thresholds, internal scoring error, or winning intent equals `not_found_intent` (`sink_intent`).

Diagnostics¶

When diagnostics are enabled on the agent, TurnComplete.debug_info includes these keys from the classify node:

Key	Description
`query`	User message text classified.
`top_prob`	Softmax probability of the winning letter.
`second_prob`	Softmax probability of the runner-up letter.
`margin_result`	`top_prob − second_prob`.
`result.intent`	Attached intent label (empty on `not_found`).
`probs_json`	Full probability vector over labels.
`intents_ids_json`	JSON array of `intents_ids` in label order.
`predicted_letter`	Winning letter (`A`, `B`, …).
`not_found_reason`	Why `not_found` fired: `empty_message`, `score_size_mismatch`, `score_failed`, `below_threshold`, `below_margin`, or `sink_intent`. Empty on `found`.

Example¶

PythonC++

from tryll_client.graph import GraphDescription, ClassifyIntentLLMParams

graph = (
    GraphDescription()
    .add_node("classify", ClassifyIntentLLMParams(
        model_name="Llama 3.2 1B Instruct (Q4_K_M)",
        intents_ids="confess_margaret_saw,when_saw_body,other",
        intents_prompt=(
            "inspector confronts butler with Mrs Hollis witness statement,"
            "inspector asks when butler first saw the body,"
            "anything else"
        ),
        history_turns=4,
        threshold=0.5,
        notify_client=True,
        found_exit="pick_instruction",
        not_found_exit="retrieve_world",
    ))
    # ... rest of graph nodes
    .set_start_node("classify")
    .set_default_model_name("My Local Model")
)

using namespace Tryll::Client;
using namespace Tryll::NodeParams;

ClassifyIntentLLMParamsT cp;
cp.model_name     = "Llama 3.2 1B Instruct (Q4_K_M)";
cp.intents_ids    = "confess_margaret_saw,when_saw_body,other";
cp.intents_prompt =
    "inspector confronts butler with Mrs Hollis witness statement,"
    "inspector asks when butler first saw the body,"
    "anything else";
cp.history_turns  = 4;
cp.threshold      = 0.5f;
cp.notify_client  = true;
cp.found_exit     = "pick_instruction";
cp.not_found_exit = "retrieve_world";

GraphDescription graph;
graph.AddClassifyIntentLLM("classify", std::move(cp));
// ... rest of graph nodes

When to use this node¶

	Classify Intent (embedding)	Classify Intent (LLM)
Requires KB	Yes — labelled examples needed	No
Inference cost	Embedding only (fast)	One LM forward pass
Intent set	Fixed at storage creation	Defined inline as strings
Best for	Large, pre-labelled KB; low latency	Small label sets; no pre-built KB

See tryll_test_chat agent butler2 (IntentLLMWorkflow) for a full graph alongside retrieval-based butler.

Classify Intent — embedding retrieval alternative
Intent to Instruction
Research: docs/research/retrieve-and-intent-classification/intent-classification-slm-logprob.md (engine design)
Prompt guide: docs/research/retrieve-and-intent-classification/intent-classification-logprob-prompt-best-practices.md