Skip to content

Autoresearch

Autoresearch automatically finds the best SQL variation for a metric by testing against ground truth data.

How it works

  1. You provide a metric and its ground truth SQL (the "correct" answer)
  2. OnlyMetrix generates up to 30 SQL variations with different approaches
  3. Each variation is scored against the ground truth using precision, recall, and F1
  4. The best variation wins and can be promoted to your metric catalog

When to use it

  • You have a complex metric and aren't sure which SQL approach is most accurate
  • You want to validate that your metric SQL matches a known-good baseline
  • You're migrating metrics from another tool and want to verify accuracy

Running Autoresearch

Navigate to Autoresearch in the sidebar, select a metric, and click Run. The process runs in the background — you'll see results when it completes.

Understanding results

Results show each variation with:

  • F1 score — balanced measure of precision and recall
  • Precision — how many returned results are correct
  • Recall — how many correct results were found
  • SQL — the exact query used

Three-way classification

Metrics are classified based on their Autoresearch results:

ClassificationMeaning
StructuredHigh F1, compiled to IR, supports all analysis primitives
Opaque-UpgradeableLow F1 but could be improved with better SQL
Opaque-IntentionalIntentionally kept as raw SQL (complex custom logic)

MIT Licensed (SDK) | Proprietary (Server)