embeddings
Get Embeddings
Get all embedding data for 500k (or otherwise requested) data points.
This will take a random sample of the dataset. It exports the result as a csv in order to reduce request size and response latency (as results can be large) This will return:
- id
- gold (index)
- prediction (index)
- data quality
- text_sample
- x coordinate
- y coordinate ** any metadata columns explicitly requested
Set data_view=true to return data embeddings instead of model embeddings. If data embeddings are not available, an exception will be thrown
Set use_seed=false for random batches of embeddings on every response. This is useful for batching, but should remain true if you are requesting a single set of embeddings. If you are returning all embeddings, this parameter is useless
POST
/
projects
/
{project_id}
/
runs
/
{run_id}
/
split
/
{split}
/
embeddings
Authorizations
Galileo-API-Key
string
headerrequiredPath Parameters
project_id
string
requiredrun_id
string
requiredsplit
enum<string>
requiredAvailable options:
training
, validation
, test
, inference
Query Parameters
inference_name
string
default: data_view
boolean
default: falsescope
string | null
default: Body
application/json
task
string | null
filter_params
object
compare_to
enum<string> | null
Available options:
training
, validation
, test
, inference
map_threshold
number
default: 0.5meta_cols
string[] | null
num_samples
integer | null
text_sample_length
integer | null
default: 500include_text_sample
boolean
default: falsefile_type
enum<string>
Available options:
csv
, json
, parquet
, arrow
, zip
only_meta_cols
boolean
default: falseuse_seed
boolean
default: trueWas this page helpful?