Monitoring Overview

The monitoring suite is a powerful tool that helps you track, classify, and address issues within your client or tenant environment. This guide explains how to effectively use the classification system, evaluate responses, and leverage monitoring data to improve performance.

Classification System

Adding Classifications

Classifications can be added in multiple ways:

From Chat Interface:
- Click the tag icon beneath any chat response
- Select existing classifications or create new ones directly from the conversation
- New classifications are automatically applied to the current interaction
From Query Browser:
- Open the diagnostics panel by selecting an interaction in the query browser
- Expand the classification section
- Either select an existing classification or click "Add Classification"
From Monitoring Page:
- Navigate to any classification and click "Edit Classification"
- Modify name and criteria directly from the monitoring interface
Automatic Evaluation:
- Classifications are applied automatically when the "Does response answer query" evaluation runs
- This can be scheduled to run daily or triggered manually

Managing Classifications

Editing Classifications:

Click into any classification from the monitoring page to modify its name and evaluation criteria
Changes are immediately reflected across all monitoring views
Classification criteria are preserved even when auto-assignment is disabled

Active/Inactive States:

Toggle auto-assignment on/off for any classification
Disabled classifications appear grayed out with a "disabled" tag
Useful for pausing evaluations on legacy issues or freezing classification groups in time
Re-enable by editing the classification and turning auto-assign back on

Unevaluated Questions:

System automatically tracks questions that haven't been processed through evaluations
"Unevaluated" classification appears when questions exist that haven't been run through daily or manual evaluations
Cannot be edited as it's automatically maintained by the system
Helps identify gaps in evaluation coverage

Writing Effective Classification Criteria

When creating classification criteria, leverage the content available to the evaluation model:

Reference relevant tags:
- query: Use when classifying based on what the user asked
- response: Use when classifying based on how the system responded
- message history: Use when classifying based on the conversation context
- System prompt elements like data values and skill descriptions
Examples of effective criteria:
- "When the message history reflects that the user asked several questions, but the assistant response does not include any direct answers"
- "When the user requests information in a language different from the system's configured language"
- "When a specific brand mentioned in message history is automatically appended as a parameter"

Legacy Admin Tag Migration

Legacy admin tags are automatically converted to the new classification system
Migrated tags maintain backward compatibility
Important: Tagged interactions older than 28 days require adjusting the date filter in classification detail pages to be visible

Setting Up Daily Evaluations

Configuration Best Practices

Set the correct UTC time:
- Ensure hour and minutes are set to UTC time
- Use Google to confirm the current UTC time if needed
Configure question threshold appropriately:
- Set to the average number of queries per day (with some buffer)
- Setting to zero will prevent automatic evaluations
- Example: If you average 80 queries per day, set threshold to 100
Exclude internal domains:
- Add internal domains like "answerocket" to the exclusion list
- This prevents evaluations on internal test questions

Using the Monitoring Dashboard

Navigating the Monitor Tab

Access the monitoring suite from Skill Studio by clicking the "Monitor" tab
Select the relevant assistant from the dropdown at the top
View key statistics for the last 28 days:
- Pass rate
- Total questions
- Active users
- New active users

Understanding Classifications Table

The monitoring dashboard displays classifications in a table format:

Successful Questions:
- Queries that passed evaluation
- Can still have classifications attached
Issue Classifications:
- Each classification shows:
  - Title and criteria
  - Number of occurrences
  - Number of users impacted
  - Most recent occurrence date
  - Percentage of total classified issues
- Use these metrics to prioritize issue resolution
Unevaluated Questions:
- System-maintained classification for questions not yet processed through evaluations
- Prompts setup of automated evaluation schedules or manual evaluation runs

Analyzing Classification Details

When you click on a classification:

Navigate through examples using arrow keys
View all query browser data for each instance
Reorganize table columns by dragging
Apply filters by date or question type
Share the classification link in tickets for troubleshooting

Question Browser Integration

View All Questions

Access all questions for the current assistant directly from the monitoring page
Click "View All Questions" to see comprehensive question history
Default 28-day filter can be adjusted for historical analysis

Multi-Select and Bulk Operations

Selecting Multiple Questions:

Use checkboxes on the left side of questions in "View All Questions" view
Select questions from classification detail pages
"Add [X] questions" button appears when questions are selected

Bulk Actions:

Add selected questions to existing collections
Create new collections from selected questions
Useful for building test suites after identifying and fixing issues

Groups and Filtering

Groups Column:

New filtering capability by user groups in Query Browser
Filter by specific groups (admins, UAT training, etc.)
Shows all groups a user belongs to, not just the filtered group
Group information included in all data exports

Enhanced Persistence:

Questions and responses preserved even after editing or deletion
Full conversation history maintained for troubleshooting
Multiple versions of edited questions show complete interaction progression

Creating Test Collections from Monitoring

Workflow for Issue Resolution

Identify Issues: Use monitoring dashboard to spot problematic classifications
Analyze Questions: Click "View All Questions" to see comprehensive scope
Select Related Questions: Use multi-select to choose questions affected by the same issue
Create Validation Collection: Add selected questions to a new collection for post-fix testing
Schedule Evaluations: Run tests on the collection to validate fixes

Best Practices for Collections

Use descriptive names that indicate purpose (e.g., "Authentication Issues - March 2024")
Group questions by issue type for focused testing
Create collections before and after fixes for comparison
Leverage groups filtering to create user-segment-specific test suites

Important Notes

Test runs are excluded: Interactions from test runs won't appear in classifications to prevent clutter
28-day default filtering: Monitoring views default to last 28 days but can be adjusted for historical data
Prioritize by impact: Focus on issues affecting multiple users first
Use classification links: When creating JIRA tickets, include the classification link rather than individual question URLs
Regular review: Periodically review classifications to identify patterns and recurring issues
System prompt updates: Enhanced context handling applies to newly created assistants or existing assistants with updated system prompts

Best Practices Summary

Create specific, targeted classifications that reference the evaluation context
Configure daily evaluations with appropriate thresholds and exclusions
Use the new chat interface for immediate classification and feedback
Leverage multi-select to efficiently organize questions into test collections
Take advantage of groups filtering for user-segment analysis
Prioritize issues based on occurrence count, user impact, and recency
Use classification links in troubleshooting tickets
Regularly review and refine your classification system
Adjust date filters when working with historical data or migrated admin tags

Note: This guide reflects the monitoring suite functionality as of version 2507. Future updates may enhance these capabilities.