hyperparam

The missing UI for dataset exploration and curation.

Data quality is the most important factor in machine learning success. Hyperparam brings exploration and analysis of massive text datasets to the browser. Drop in a dataset (parquet, etc) to get started. Look At Your Data. 👀

Hyperparam is still in development. But today you can:

  • Drop a parquet file on this page
  • Explore the dataset in the browser
  • Use the CLI tool to open any dataset on your computer
  • Open datasets from S3 and other cloud storage

All of Wikipedia in the Hyperparam Viewer Demo:

This is a demo of what you can do with parquet files being read directly in the browser. A parquet file with all of the English Wikipedia articles is stored in S3 and rows are retrieved on demand using hyparquet.

Highly scalable dataset tool

The first step in data science is to be deeply familiar with your training data.
But where do you even start? Most data tools cannot handle the scale of modern data interactively. Using modern data formats like parquet, Hyperparam can load and explore datasets with billions of rows directly in the browser.

Model assisted data curation

Welcome to the era of model-assisted data exploration and curation.
Using models to reflect back on their own training data can help you find the best quality data, in order to build the best quality models.

Local-first

A new type of app that moves everything to the browser.
Hyperparam is a local-first app that can run entirely in the browser.
Drop a parquet file on this page, or install the Hyperparam CLI tool:

npx hyperparam

Open-source

Everyone benefits from open source software and open data standards.
Hyperparam is open-source first. Code available on GitHub.

wiki_en.parquet
Drop files to upload