Skip to main content

Smart Probe: Technical Reference

How probing logic, content controls, and scoring behave at runtime

This article explains how Smart Probe works at a technical level: what the AI evaluates, what governs its behavior, and what it can and cannot generate. It is intended for researchers who want to understand the system's logic before designing studies.

For setup instructions, see the Smart Probe Text Card authoring guide.

1. What Smart Probe Does and Does Not Do

Smart Probe is designed to probe deeper on what a respondent has already said. It does not originate new lines of inquiry outside the scope the researcher has defined.

What Smart Probe will do

• Ask follow-up questions that build on the respondent's own answer.

• Stay within the topic established by the anchor question.

• Apply conversational goals to shape the direction of follow-ups.

• Enforce avoidance instructions at every probe turn.

• Stop probing once the response meets the thoughtfulness threshold or the probe limit is reached.

• Generate a summary of the full conversational thread.

What Smart Probe will not do

• Introduce new or unrelated topics.

• Exceed the researcher-configured maximum number of probes.

• Ignore avoidance instructions.

• Override display logic or thoughtfulness thresholds.

• Generate probes if the AI is unavailable, the survey continues with the initial open-ended question only.

2. Inputs That Shape Probe Behaviour

The follow-up generation engine draws on five inputs simultaneously at runtime.

Input

How it shapes probe behaviour

Anchor question

The core topic question set by the researcher. It defines the topical scope for the entire probing thread.

Respondent's answer(s)

The primary driver of what the follow-up asks. The AI identifies what was said and what could be explored further within scope.

Conversational goals

Researcher written instructions describing what the probe should aim to uncover.

Avoidance instructions

Hard constraints specifying topics or question styles to exclude. Applied at every probe turn.

Full conversation history

The complete thread up to the current point. Used to prevent redundancy, the AI will not re-ask what has already been covered.

3. Topic Containment

Smart Probe enforces topic containment through two mechanisms.

Anchor-based scoping

The anchor question sets the topical frame for the entire conversation. The AI uses it as the reference point for what is and is not on-topic. Smart Probe cannot introduce new or unrelated topics.

Avoidance instructions

Avoidance instructions specify topics or question styles the AI must not use. These constraints apply at every probe in the thread, not just the first.

Example

• Anchor question: 'How has your experience with the product changed over the past six months?'

• Avoidance instruction: 'Do not ask about pricing or competitors.'

• A respondent mentions a competitor unprompted.

• Result: Smart Probe will not follow up on the competitor reference. It probes only within the experience-change topic.

What avoidance instructions do not do

Avoidance instructions govern what Smart Probe asks — they do not prevent respondents from writing whatever they choose in their answers. If a respondent volunteers information outside the configured scope, that content will appear in the response data. The instructions shape probe behavior only.

4. Thoughtfulness Scoring

The thoughtfulness score determines whether a follow-up question is generated at all. It evaluates the quality and completeness of the respondent's answer.

Scoring dimensions

Each response is scored across 10 dimensions, each on a 0–10 scale:

Dimension

What it evaluates

Depth of Insight

Does the response go beyond surface-level statements?

Relevance

Does it stay on-topic with the anchor question?

Specificity

Does it include concrete details or examples?

Clarity

Is it logically structured and easy to follow?

Originality

Does it reflect an authentic, personal viewpoint?

Critical Thinking

Does it show reasoning or weigh trade-offs?

Emotional Engagement

Is there genuine feeling or personal context present?

Breadth

Does it cover more than one angle or perspective?

Supporting Evidence

Is there justification or rationale provided?

Constructiveness

Does it offer suggestions or forward-looking ideas?

Score calculation

The 10 dimension scores are averaged into a final aggregated score. Two rules apply on top of the average:

• If the relevance dimension scores zero, the final score is set to zero regardless of all other scores.

• Short responses that are genuinely high quality receive an upward adjustment, so concise but substantive answers are not unnecessarily probed.

Scores are internal only

Thoughtfulness scores are stored as metadata and never shown to respondents. They are available to researchers in exports and reports.

5. Probe Generation Logic

Redundancy prevention

Before generating each follow-up, the engine reads the full conversation history. It will not re-ask a question already covered in the thread.

Threshold re-evaluation

After each follow-up response, the scoring engine re-evaluates the new answer against the threshold. Probing stops as soon as the threshold is met, even if the maximum probe count has not been reached.

Hard limits on AI autonomy

The following constraints are enforced at a system level and cannot be overridden by model judgment:

• Cannot exceed the researcher-configured maximum probe count.

• Cannot introduce topics not present in the anchor question or respondent answers.

• Cannot ignore or partially apply avoidance instructions.

• Cannot modify display logic or thoughtfulness thresholds at runtime.

6. Versioning and Auditability

Every Smart Probe operation logs the AI model version and prompt version used — for question generation, thoughtfulness scoring, and summarization — at the respondent level.

Version behavior

• All respondents in a study are evaluated by the same model and prompt version.

• If a scoring framework is updated after a study launches, the existing study continues on its original version. The new version applies only to new studies.

• This document is updated when a new scoring version becomes available.

Current model configuration

AI Function

GPT Model

Prompt Version

Key Purpose

Generate Question

GPT-4.1 Mini

1.0.0

Generates context-aware follow-up questions based on respondent answers and researcher-defined goals.

Thoughtfulness Scoring

GPT-4.1 Mini

2.0.0

Evaluates response depth and completeness using the 10-dimension rubric.

Summarize Messages

GPT-4o

1.0.0

Produces a concise synthesis of the full probing thread for reporting.

7. Language Support

• Smart Probe can be used in any language, but it has been primarily trained and tested in English. Performance is most reliable in English-language studies.

• In other languages, thoughtfulness scoring, follow-up question quality, and summary accuracy may be less consistent.

• For non-English deployments, test the full probe flow in a test chat before deploying to respondents.

Did this answer your question?