# Create, annotate, review a Dataset

Here are all the steps that you will acomplish :

* **Upload your images to the Datalake**
* **Create a new dataset version with tags**
* **Set the labels for your dataset**
* **Annotate your images**
* **Review your annotations**

## Upload images

This journey starts in the [Datalake](https://picsellia.gitbook.io/picsellia/data-management/upload-assets-to-your-lake#with-the-platform)&#x20;

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Me-C4jQERgSYmpd01nm%2F-Me-CHGQNmgvq221JUNE%2Fimage.png?alt=media\&token=62e8d16d-477a-440d-ba53-3d58dca2af65)

Click on the green `Upload data`.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MbqowzDMLbwu8mqARK8%2F-MbuKEpWU5CQiWvnx9WO%2Fimage.png?alt=media\&token=7fd76a70-17b9-4fc2-aad3-dcaceb538fc0)

You should see a modal with an input field and a button. In the input, let's write some tags so we will be able to search for our images later and then click on **Browse files** to open your local disk and select some images.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Me-C4jQERgSYmpd01nm%2F-Me-CVLMMkaf3SRHF_5U%2Fimage.png?alt=media\&token=eb603333-8d87-4542-b9be-38161e07a102)

Now click on **Upload** and go grab a cup of coffee while your files are uploading to your Datalake.

At the top of the page, you will see a text input that allows you to search through your Datalake using our Data Query Language (DQL).

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MbqowzDMLbwu8mqARK8%2F-MbuN5kTch625A9yZ8-D%2Fimage.png?alt=media\&token=c6ef611f-1c0d-4fd9-ab9d-e45e5ee3a1cc)

For example, if you want to find the images we just uploaded, you can search for the tag 'new\_tag' that we applied during upload like so :

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MbqowzDMLbwu8mqARK8%2F-MbuODOk6AU9sHSLDeJM%2Fimage.png?alt=media\&token=85832e03-a6dd-4c11-8ff6-74b8c851c836)

We can see that id does show us only the 21 images we just uploaded !

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Me-C4jQERgSYmpd01nm%2F-Me-CuVKBypVAinR2JDI%2Fimage.png?alt=media\&token=8a45ccf5-8474-4b37-91c1-b4cb06618625)

Now that your images are filtered with your tag, you can click on the `Select All` button to select them :

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MbudoITqNOWOshu1iwf%2F-MbuecmFcYGFXD6WjyHg%2Fimage.png?alt=media\&token=a8b61858-0fa9-4345-8df5-c059d950c085)

You now have access to several options :

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MbudoITqNOWOshu1iwf%2F-MbuexVy7cm2ZZCsNkj9%2Fimage.png?alt=media\&token=93a230dd-1ac2-4067-8b77-199aeae0d593)

## Create a new Dataset

Now that we have selected some assets, we want to create a dataset that only contains them. To do this click on the green **create dataset** button.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MbudoITqNOWOshu1iwf%2F-Mbuf7RUo3tmS6mvD7cV%2Fimage.png?alt=media\&token=af707b40-4dc2-4a2c-967a-bc074ddad5a4)

A new modal should appear that allows you to choose a name for your dataset and also write a short description.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Me-C4jQERgSYmpd01nm%2F-Me-D9FjO6dGT0GceDKJ%2Fimage.png?alt=media\&token=9928d1c5-6693-4b03-8cd4-4c0394d8bf9b)

When you have entered the desired information, please click on the green **Create** button.

{% hint style="info" %}
You should see a green message at the top of the screen saying that your dataset has been created successfully.
{% endhint %}

Now you are still in your Datalake, click on the '**All Datasets**' link in the sidebar to see our brand new dataset.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MbudoITqNOWOshu1iwf%2F-MbufPKE4JoPBspFAUL8%2Fimage.png?alt=media\&token=fd046589-395a-4195-b1bd-e9c06d5d8a3a)

In your dataset list, click on the card corresponding to your new dataset.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Me-C4jQERgSYmpd01nm%2F-Me-DGs247C7T6Q10MQw%2Fimage.png?alt=media\&token=60bfba1f-7109-43eb-ab92-7df810c2534f)

Now that we have created the dataset, let's get to the next step.

## Set the labels for your dataset

Before annotating, we have to choose what this dataset will be used for and set up the labels, to do this please click on the `settings` tabs then click on `labels`.&#x20;

You are now on the page to configure your labels. First, you need to select the type of dataset you want.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MSE7hbyknAwJy6m7a8d%2F-MSEHpGUEYrdOOJYuZka%2Fimage.png?alt=media\&token=28c2b1f6-2ff4-4463-aa11-6197d28842c3)

This will restrain the available tools for annotation, for example if you select **Object detection**, you will be able to annotate with bounding-boxes only.

Speaking about this, let's choose **Object detection** as a type for our dataset.

Now you have the ability to enter your labels, we will create three labels here named **cat**, **bird**, **person**.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Mc9or2xW9yQvwl7Oi-Z%2F-Mc9ptcLYqKmm2hxUIAv%2Fimage.png?alt=media\&token=d3373805-f3ae-4a93-93f9-df5daeed5c70)

Click on **Create Labels** to finish the set up. You will be redirected to your dataset and you should now see the labels you just created display in the upper-middle card.

## Annotate your images

Now that you have set up labels for this dataset, you should see green buttons in the last column of the table listing your images.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Me-C4jQERgSYmpd01nm%2F-Me-E6ZXHLxQY-Ng92gO%2Fimage.png?alt=media\&token=377cdfab-5d69-488a-be5f-4e5c8d0a4d7e)

Click on **Annotate** in the first row to start annotating your dataset.

To start drawing bounding-boxes, let's click on the desired label on the upper-right of the interface.

Now you can **click and drag** and you will see your bounding-box appear on the image, release the left click to finish the shape.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MW59V5EPtRJYrPw2fcY%2F-MW5Cq4hT18icOyks9SO%2Fimage.png?alt=media\&token=1cf095eb-2e59-4a54-afc2-11eb4ad37c68)

Congratulations ! You've just annotated your first object 🎉

Repeat the operation with as many shapes as you need. Once you are finished, click on the **Save** button on the right of the interface to validate your annotations.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MSE7hbyknAwJy6m7a8d%2F-MSEL5MWxXwpBoSFHcRm%2Fimage.png?alt=media\&token=b933c7d7-582f-45a0-bbb1-b061eceef6dc)

You will be automatically moved to the next image. Now that you know how to annotate one image and save your work, you can do it on as many image as you want.

When you consider having annotated enough images, click on the **back to dataset** on the upper-left of the screen.

## Review your annotations (optional)

Now you should be on your dataset page. We can see that our table has changed and contains more information than earlier, let's see the first row.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Mc9or2xW9yQvwl7Oi-Z%2F-Mc9r72QDUZE1O38UWso%2Fimage.png?alt=media\&token=d2816c41-29e1-4c43-9ce6-61efdcd076a0)

Let's describe the new columns quickly :

* Instances: The object annotated in one image
* Annotations : The list of people that have annotated the image.

Now imagine that one of your colleague has annotated all the images (yes, you can thank him), now you want to check if his annotations are accurate so they don't mess with your algorithm later.

This is what we call the review process, to do this, click on your profile picture thumbnail in the **Annotations** column in the first row.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Mc9or2xW9yQvwl7Oi-Z%2F-Mc9rmLvuXeZCVTL5Gmj%2Fimage.png?alt=media\&token=8a4e271c-3629-4051-8d3c-6b2cccae1258)

You are back to the interface used for annotation, but if you pay attention, you will see that some element has changed.

On the left side, you should see something like this&#x20;

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Mc9or2xW9yQvwl7Oi-Z%2F-Mc9rp32qWAO33jYpA1g%2Fimage.png?alt=media\&token=2a6e900a-eaff-4a43-8060-2948c977f37f)

This means that you are looking at an annotation that **you** made on **2021-06-14**, if it was someone else you would see its profile pic and its username.

Now you will want to review all the images annotated and either **accept** or **reject** the annotations so they will be flaged and you (or your annotator colleague) will know what annotation to fix.

To do this, look at the bottom-right part of the screen , you should see two buttons

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-MW59V5EPtRJYrPw2fcY%2F-MW5DgMz2aKtwsJGrExu%2Fimage.png?alt=media\&token=f90a7b4a-9349-47e7-bac9-ec23b8b5a51f)

If you think the annotation is ok, you can click on **accepted**, otherwise you can click on **rejected** so you will know you have to pay more attention to this image later.

{% hint style="info" %}
Once you click on review or accepted, you don't need to click on the Save button to validate, you can change the image or leave the interface.
{% endhint %}

When you have finished reviewing all of your images (if you need to) you can go back to your dataset.

![](https://2100239547-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-M78aoA9Gcxf25O1fNjm%2F-Mc9or2xW9yQvwl7Oi-Z%2F-Mc9rwMWpXjBgbDB11WZ%2Fimage.png?alt=media\&token=4982acc1-7e18-4a6f-98d7-488a3ddff130)

## Conclusion

Tadam, you now have a pixel-perfect annotated dataset ready for training.&#x20;

If you want to use it as ground-truth for your next training, please follow the next tutorial that will guides you through all the steps from project creation to having a trained model.
