Interactive Tutorial

Model Lab Demo

An animated walkthrough of every step you take on the Model Lab page. Controls on the left let you step through manually or let it auto-play.

Overview

Labeled Candidates

Positive Labels

Negative Labels

Uncertain Labels

Candidate Rows

Reviewed Items

Hostnames

Saved Models

Feature Families

💡

Labeled Candidates is the number that drives model quality. You need at least 20 balanced labels (mix of Positive and Negative) before training produces a reliable model. Aim for 50+.

Train Variants

Training uses human-reviewed candidate rows only. Leave the job filter blank to use all labeled candidates stored, or enter comma-separated Job IDs to scope training to a specific batch.

Job IDs Filter

Training Algorithm

Dataset Export ↓ Download Candidate Dataset CSV

keyword-aware

Keyword-Aware Model

Uses text keywords (e.g. "comments", "reply", "posted by") combined with structural HTML features. Best accuracy for UGC detection.

Feature Count: 42 🔤 Keywords: included Algorithm: Logistic Regression

↓ Download Variant Dataset

structure-only

Structure-Only Model

Uses only DOM shape signals — element depth, sibling counts, ARIA roles. No text keywords. Useful as a baseline to measure keyword contribution.

Feature Count: 28 🏗 Keywords: excluded Algorithm: Logistic Regression

↓ Download Variant Dataset

Trained Models

Use Runtime JSON when the browser extension needs a compact deployable inference bundle. Click Use to select a model for scoring jobs or the site-group probe.

Artifact	Variant	Created	Precision	Recall	F1	Top-1

Score Existing Job

Use a trained model against a completed job. This scores every stored candidate, re-ranks each page, and shows whether the top candidate is confident or needs manual review.

Model

Job ID

Recent Jobs

badffc6c-8464-4675-af85-1f4e3a762a31

running

227/332

89 detected

cc919aa8-d163-43c4-9fa9-ccdf799aab28

completed_with_errors

77/79

34 detected

Site Group Probe

Paste or upload URLs or hostnames. The probe matches them against pages you have already scanned, scores with the selected model, and groups results by hostname.

Model

Optional Job IDs Filter

Upload CSV Or Text File

URLs Or Hostnames

Model Lab
Demo Tour

Demo Complete!

Model Lab Demo

Overview

Train Variants

Keyword-Aware Model

Structure-Only Model

Keyword-Aware Model

Most Influential Families

Top Positive Weights

Top Negative Weights

Trained Models

Score Existing Job

Site Group Probe

Model LabDemo Tour

Demo Complete!

Model Lab Demo

Overview

Train Variants

Keyword-Aware Model

Structure-Only Model

Keyword-Aware Model

Most Influential Families

Top Positive Weights

Top Negative Weights

Trained Models

Score Existing Job

Site Group Probe

Model Lab
Demo Tour