The missing UI for ML data
Data quality is the most important factor in machine learning success. Hyperparam brings exploration and analysis of massive text datasets to the browser. Look At Your Data. 👀
Drop in a dataset (parquet, etc) to get started.
All of Wikipedia in the Hyperparam Viewer
This is a demo of what you can do with parquet files being read directly in the browser. A parquet file with all of the English Wikipedia articles is stored in S3 and rows are retrieved on demand using hyparquet.
The tool for LLM datasets
The first step in data science is to look at your data. But where do you even start?
Most data tools cannot handle the scale of modern data interactively. Using modern data formats like parquet, Hyperparam can load and explore datasets with billions of rows directly in the browser.
col1 | col2 | col3 |
---|---|---|
lorem | ipsum | dolor |
sit | amet | consectetur |
adipiscing | elit | sed |
do | eiusmod | tempor |
incididunt | ut | labore |
et | dolore | magna |
aliqua | ut | enim |
Welcome to the era of model-assisted
data exploration and curation
Using models to reflect back on their own training data can help you find the best quality data, in order to build the best quality models.
Local-first design
Hyperparam is a new type of data app that can run entirely in the browser. Use our CLI for quick access to local files:
npx hyperparam
Open-source
Everyone benefits from open source software and open data standards.
Hyperparam is open-source first. Code available on GitHub.