The missing UI for ML data

Data quality is the most important factor in machine learning success. Hyperparam brings exploration and analysis of massive text datasets to the browser. Look At Your Data. 👀

Drop in a dataset (parquet, etc) to get started.

All of Wikipedia in the Hyperparam Viewer

This is a demo of what you can do with parquet files being read directly in the browser. A parquet file with all of the English Wikipedia articles is stored in S3 and rows are retrieved on demand using hyparquet.

The tool for LLM datasets

The first step in data science is to look at your data. But where do you even start?

Most data tools cannot handle the scale of modern data interactively. Using modern data formats like parquet, Hyperparam can load and explore datasets with billions of rows directly in the browser.

col1col2col3
loremipsumdolor
sitametconsectetur
adipiscingelitsed
doeiusmodtempor
incididuntutlabore
etdoloremagna
aliquautenim
 

Welcome to the era of model-assisted
data exploration and curation

Using models to reflect back on their own training data can help you find the best quality data, in order to build the best quality models.

Local-first design

Hyperparam is a new type of data app that can run entirely in the browser. Use our CLI for quick access to local files:

npx hyperparam

Open-source

Everyone benefits from open source software and open data standards.
Hyperparam is open-source first. Code available on GitHub.

wiki.en.parquet
Drop files