Picsellia
  • Picsellia
  • Getting started
    • Start using Picsellia
    • Create, annotate, review a Dataset
    • Create a new Dataset Version with merged labels
    • Train a custom Object Detection model
    • Train a custom Classification model
    • Deploy model in production (Tensorflow only)
    • Feedback loop - Send predictions from models to Datalake or Datasets
  • Data Management
    • Upload assets to your Lake
    • Edit Tags for Pictures
    • Create a Dataset
    • Add data to a Dataset
    • Create a new Dataset version
    • Configure your Labels
    • Import annotation from other Dataset version
  • Experiment Tracking
    • Initialize an experiment
    • Checkout an experiment
    • Log your results to Picsell.ia
    • Store your files to Picsell.ia
    • Evaluate your models
    • Retrieve and update your assets
    • Publish your model
    • Namespace
  • Hyperparameter tuning
    • Overview
    • Install Picsell CLI
    • Config
    • Launch your Hyperparameters tuning
  • Models
    • Model HUB
  • References
    • API Reference
    • Python SDK Reference
    • Python Training Reference
  • Organization
  • Website
Powered by GitBook
On this page
  • With the Platform
  • Search for pictures
  • With Python SDK

Was this helpful?

  1. Data Management

Create a Dataset

PreviousEdit Tags for PicturesNextAdd data to a Dataset

Last updated 3 years ago

Was this helpful?

With the Platform

Creating a Dataset is really simple, once on your "Datalake" page

Search for pictures

To perform operations, your need to select assets on the table, the right checkbox select all the visible assets, the left one select all filtered assets.

You can search pictures on your lake with our Data Query Language, basically you can search pictures in your lake with :

  • Tags

  • Width

  • Height

  • Source

  • Filename

  • Annotations

  • Dataset

For example, let's seach for pictures that are tagged penfun with a Picture width > 400 px and that have at least one annotation.

Once you have filtered your pictures, you can either select all filtered pictures or a subset.

You can now click on create dataset !

Then enter a Name and description for your Dataset, please note that description is optional :)

With Python SDK

pip install picsellia

First make sure that you have Picsellia Python package installed

then you will need to initialize the Client with your API Token, available in you profile page.

from picsellia.client import Client
clt = Client(api_token="your token")

you can now search for some assets on your lake with the datalake.fetch() method:

pictures = clt.datalake.picture.fetch(quantity=1, tags=['tag1'])

You can use Client.datalake.pictures.status() to vizualize the fetched assets

then you can create your dataset

clt.datalake.dataset.create(name='dataset2', 
                            description='this is a test',
                            pictures=pictures)

You can find a complete reference to the SDK .

here