# Create a Dataset

### With the Platform

Creating a Dataset is really simple, once on your "Datalake" page

![](/files/-Me-sWK3pmh4WiOIlatm)

### Search for pictures

{% hint style="info" %}
To perform operations, your need to select assets on the table, the right checkbox select all the visible assets, the left one select all filtered assets.
{% endhint %}

You can search pictures on your lake with our Data Query Language, basically you can search pictures in your lake with :

* Tags
* Width
* Height
* Source
* Filename
* Annotations
* Dataset

For example, let's seach for pictures that are tagged `penfun` with a Picture width > 400 px and that have at least one annotation.

![](/files/-McA4v7NTEQSjzag7xSb)

Once you have filtered your pictures, you can either select all filtered pictures or a subset.

![](/files/-McA5cC6fts6yAtE5reB)

You can now click on create dataset !

Then enter a Name and description for your Dataset, please note that description is optional :)&#x20;

![](/files/-Me-taEhjBh8pS53gcyQ)

### With Python SDK

```python
pip install picsellia
```

First make sure that you have Picsellia Python package installed&#x20;

then you will need to initialize the Client with your API Token, available in you profile page.

```python
from picsellia.client import Client
clt = Client(api_token="your token")
```

you can now search for some assets on your lake with the datalake.fetch() method:

```python
pictures = clt.datalake.picture.fetch(quantity=1, tags=['tag1'])
```

You can use Client.datalake.pictures.status() to vizualize the fetched assets &#x20;

then you can create your dataset

```python
clt.datalake.dataset.create(name='dataset2', 
                            description='this is a test',
                            pictures=pictures)

```

You can find a complete reference to the SDK [here](/picsellia/references-1/python-sdk-reference-1.md#push_dataset).


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://picsellia.gitbook.io/picsellia/data-management/create-a-dataset.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
