NLI Span Labeler

🎓 Training Mode

0 / 5 completed

Welcome! Before you start annotating, please complete a short training to familiarize yourself with the labeling process.

You'll annotate 5 gold-standard examples and receive feedback on each one.

Loading training example...

Span Labels

Complexity Scores (0-10)

Dataset:

Instructions: 1. Click a label below to select it (or press 1-9). 2. Click tokens or use j/k to navigate and Space to toggle. 3. Connected tokens (with a dash) are WordPiece subwords. 4. Press Enter to save, Esc to skip, ? for all shortcuts.

Loading example...

Span Labels

Complexity Scores (0-10)

⚠️ Edge Case Flags

Ambiguous gold label

Multiple valid span sets

Needs discussion

▶ Add rationale (optional)

🚩 Flag Example for Review

What's unusual about this example?

Labeling Statistics

Loading stats...

🏅 Annotator Leaderboard

Loading leaderboard...

📊 Admin Dashboard

Loading dashboard...

📈 Annotation Activity

Loading activity...

📁 Dataset Progress

Loading datasets...

⚠️ Quality Indicators

Loading quality data...

👥 Annotator Breakdown

Loading annotators...

🔥 Disagreement Hotspots

Examples with low agreement scores that may need review

Loading hotspots...

🔍 Quality Review

Review individual annotator quality. Select a user to see their annotations compared against consensus.

Disagreements only

Select a user to begin review.

📚 Gold Examples

0 gold examples configured (need 5 for training)

Gold examples are used to train new annotators. Each example needs:

Gold-standard complexity scores (the "correct" answers)
An explanation of why these are the correct scores

To add a gold example, annotate any example in the Quality Review tab and click "Mark as Gold".

Loading gold examples...

🚩 Flagged Examples

Review flagged examples - both auto-flagged (low agreement) and manually flagged by annotators.

Total Flagged

Pending

Auto-flagged

Manual

Loading flagged examples...

NLI Span Labeler

🎓 Training Mode

Span Labels

Complexity Scores (0-10)

Feedback

Your Scores

Gold Standard

🎉 Training Complete!

Span Labels

Complexity Scores (0-10)

📍 Justify High Scores

⚠️ Edge Case Flags

🏆 Your Progress

Labeling Statistics

🏅 Annotator Leaderboard

Lookup Example

📊 Admin Dashboard

📈 Annotation Activity

📁 Dataset Progress

⚠️ Quality Indicators

👥 Annotator Breakdown

🔥 Disagreement Hotspots

🔍 Quality Review

📚 Gold Examples

🚩 Flagged Examples

NLI Span Labeler

📖 Help & Documentation

What is NLI?

Your Task

Annotation Workflow

Tokenization

The Core Principle

Entailment Spans

Contradiction Spans

Neutral Spans

Decision Tree

Difficulty Dimension Labels

NLI Relationship Labels

Custom Labels

Complexity Scores

Navigation

Labeling

Difficulty Sliders

Actions

Help

API Documentation

Key Endpoints

Authentication

Export Format

🎓 Training Mode

Span Labels

Complexity Scores (0-10)

Feedback

Your Scores

Gold Standard

🎉 Training Complete!

Span Labels

Complexity Scores (0-10)

📍 Justify High Scores

⚠️ Edge Case Flags

🚩 Flag Example for Review

🏆 Your Progress

Labeling Statistics

🏅 Annotator Leaderboard

Lookup Example

📊 Admin Dashboard

📈 Annotation Activity

📁 Dataset Progress

⚠️ Quality Indicators

👥 Annotator Breakdown

🔥 Disagreement Hotspots

🔍 Quality Review

Annotation Detail

📚 Gold Examples

🚩 Flagged Examples

🚩 Flagged Example Detail