[PictoBloxExtension]

Pose Classifier (ML)

Extension Description

Create ML models to classify poses into different classes.

Available in: Block Coding, Python Coding
Mode: Stage Mode
WiFi Required: No
Compatible Hardware in Block Coding: evive, Quarky, Arduino Uno, Arduino Mega, Arduino Nano, ESP32, T-Watch, Boffin, micro:bit, TECbits, LEGO EV3, LEGO Boost, LEGO WeDo 2.0, Go DFA, None
Compatible Hardware in Python: Quarky, None
Object Declaration in Python: Not Applicable
Extension Catergory: ML Environment

Introduction

Pose Classifier is the extension of the ML Environment is used for classifying different body poses into different classes.

The model works by analyzing your body position with the help of 17 data points.

Tutorial on using Image Classifier in Block Coding

Tutorial on using Image Classifier in Python Coding

Pose Classifier Workflow

Alert: The Machine Learning Environment for model creation is available in the only desktop version of PictoBlox for Windows, macOS, or Linux. It is not available in Web, Android, and iOS versions.

Follow the steps below:

Open PictoBlox and create a new file.
Select the coding environment as appropriate Coding Environment.
Select the “Open ML Environment” option under the “Files” tab to access the ML Environment.
You’ll be greeted with the following screen.
Click on “Create New Project“.
A window will open. Type in a project name of your choice and select the “Pose Classifier” extension. Click the “Create Project” button to open the Pose Classifier window.
You shall see the Pose Classifier workflow with two classes already made for you. Your environment is all set. Now it’s time to upload the data.

Class in Pose Classifier

Class is the category in which the Machine Learning model classifies the poses. Similar poses are put in one class.

There are 2 things that you have to provide in a class:

Class Name: It’s the name to which the class will be referred as.
Pose Data: This data can either be taken from the webcam or by uploading from local storage.

Note: You can add more classes to the projects using the Add Class button.

Adding Data to Class

You can perform the following operations to manipulate the data into a class.

Naming the Class: You can rename the class by clicking on the edit button.
Adding Data to the Class: You can add the data using the Webcam or by Uploading the files from the local folder.
1. Webcam:
  
  Note: You can edit the capture setting in the camera with the following. Hold to Record allows you to capture images with pose till the time button is pressed. Whereas when it is off you can set the start delay and duration of the sample collection.
  
  If you want to change your camera feed, you can do it from the webcam selector in the top right corner.
2. Upload Files: You can also add bulk images from the local system.
3. Upload Class from Folder: You can upload bulk classes with the images available in the appropriate folder structure. PictoBlox imports the class with the class name as the folder name and data from the image files inside the folder. This is helpful if you have to import multiple classes.
Deleting individual samples:
Delete all samples:
Enable or Disable Class: This option tells the model whether to consider the current class for the ML model or not. If disabled, the class will not appear in the ML model trained.
Delete Class: This option deletes the full class.

Note: You must add at least 20 samples to each of your classes for your model to train. More samples will lead to better results.

Training the Model

After data is added, it’s fit to be used in model training. In order to do this, we have to train the model. By training the model, we extract meaningful information from the pose, and that in turn updates the weights. Once these weights are saved, we can use our model to make predictions on data previously unseen.

However, before training the model, there are a few hyperparameters that you should be aware of. Click on the “Advanced” tab to view them.

Note: These hyperparameters can affect the accuracy of your model to a great extent. Experiment with them to find what works best for your data.

There are three hyperparameters you can play along with here:

Epochs– The total number of times your data will be fed through the training model. Therefore, in 10 epochs, the dataset will be fed through the training model 10 times. Increasing the number of epochs can often lead to better performance.
Batch Size– The size of the set of samples that will be used in one step. For example, if you have 160 data samples in your dataset, and you have a batch size of 16, each epoch will be completed in 160/16=10 steps. You’ll rarely need to alter this hyperparameter.
Learning Rate– It dictates the speed at which your model updates the weights after iterating through a step. Even small changes in this parameter can have a huge impact on the model performance. The usual range lies between 0.001 and 0.0001.

Note: Hover your mouse over the question mark next to the hyperparameters to see their description.

It’s a good idea to train a numeric classification model for a high number of epochs. The model can be trained in both JavaScript and Python. In order to choose between the two, click on the switch on top of the Training panel.

Alert: Dependencies must be downloaded to train the model in Python, JavaScript will be chosen by default.

The accuracy of the model should increase over time. The x-axis of the graph shows the epochs, and the y-axis represents the accuracy at the corresponding epoch. Remember, the higher the reading in the accuracy graph, the better the model. The x-axis of the graph shows the epochs, and the y-axis represents the corresponding accuracy. The range of the accuracy is 0 to 1.

Testing the Model

To test the model, simply enter the input values in the “Testing” panel and click on the “Predict” button.

The model will return the probability of the input belonging to the classes.

Export in Block Coding

Click on the “Export Model” button on the top right of the Testing box, and PictoBlox will load your model into the Block Coding Environment if you have opened the ML Environment in the Block Coding.

Export in Python Coding

Alert: For the model to work in Python Coding Environment the model is need to be trained in Python.

Click on the “Export Model” button on the top right of the Testing box, and PictoBlox will load your model into the Python Coding Environment if you have opened the ML Environment in Python Coding.

The following code appears in the Python Editor of the selected sprite.

####################imports####################
# Do not change

import numpy as np
import tensorflow as tf
import time

# Do not change
####################imports####################

#Following are the model and video capture configurations
# Do not change

model = tf.keras.models.load_model("num_model.h5",
                                   custom_objects=None,
                                   compile=True,
                                   options=None)
pose = Posenet()  # Initializing Posenet
pose.enablebox()  # Enabling video capture box
pose.video("on", 0)  # Taking video input
class_list = ['Goddess', 'Plank', 'Tree', 'Warrior']  # List of all the classes

# Do not change
###############################################

#This is the while loop block, computations happen here
# Do not change

while True:
  pose.analysecamera()  # Using Posenet to analyse pose
  coordinate_xy = []

  # for loop to iterate through 17 points of recognition
  for i in range(17):
    if (pose.x(i, 1) != "NULL" or pose.y(i, 1) != "NULL"):
      coordinate_xy.append(int(240 + float(pose.x(i, 1))))
      coordinate_xy.append(int(180 - float(pose.y(i, 1))))
    else:
      coordinate_xy.append(0)
      coordinate_xy.append(0)

  coordinate_xy_tensor = tf.expand_dims(
      coordinate_xy, 0)  # Expanding the dimension of the coordinate list
  predict = model.predict(
      coordinate_xy_tensor)  # Making an initial prediction using the model
  predict_index = np.argmax(predict[0],
                            axis=0)  # Generating index out of the prediction
  predicted_class = class_list[
      predict_index]  # Tallying the index with class list
  print(predicted_class)

Note: You can edit the code to add custom code according to your requirement.

PictoBlox Blocks

Controls the camera functionality, allowing users to turn the camera on or off and switch to other camera.

Starts the script whenever you press a specified button of the wizbot.

Calibrates wizbot sensors, ensuring accurate readings for line following

Switches the state of the wizbot to Grid mode.

The block takes the motor port, the direction of rotation (forward or reverse) and speed of rotation (between 0 to 100 %) as input from the user and rotates the motor accordingly.

evive has two tactile switches; this block checks if either of them is pressed. The switch whose state you want to check can be chosen from the drop-down menu on this block. It returns “true” if the switch is pressed and “false” if the switch is not pressed.

The block compares the latest string message in the terminal with the data input by the user in the block. If the data matches, it returns the true, else it returns false.

The block reports either the temperature or humidity (selected from the dropdown menu) from DHT sensor connected to the digital pin selected from the drop-down menu.

The block is used to draw characters and symbols on evive TFT Display. The matrix size for the block is 20 horizontally and 16 vertically.

This block defines the PWM pins to which each of the servos is connected.

This block defines the PWM pins to which all the four servos of legs(2 servos of legs + 2 servos of feet) are connected.

The blocks turn their sprite the specified amount of degrees clockwise. This changes the direction the sprite is facing.

The block gives its sprite a speech bubble with the specified text — the speech bubble stays until another speech or thought block is activated, or the stop sign is pressed.

The block will play the specified sound, with no pause to its script.

Blocks held inside this block will loop a given amount of times, before allowing the script to continue. If a decimal is put in, the number is rounded up.

Scripts placed underneath this block will activate when the specified key is pressed.

The block checks whether its sprite is touching a specified color. If it is, the block returns “true”.

The block checks if the first value is equal to the other value. If the values are equal, the block returns true; if not, false. This block is not case-sensitive.

The block will change the specified variable by a given amount.

The block enables or disables the automatic display of the box on the human pose or hand detection on the stage. This is useful when you want to see if the detection is happening or not.

The recognize () in image after () seconds block starts the camera and takes an image after the specified time and analyzes it. It then saves the image features in PictoBlox.

The function enables or disables the automatic display of the box on face detection on the stage.

This block is used to analyze the image received as input from the camera, for the handwritten and printed text.

When the block is executed, the recognition window will open and you will get a specified time during which PictoBlox will record whatever you say. Once recorded, the speech will be converted to the text of the language you spoke in and saved locally.

The block opens the recognition window and shows the machine learning analysis on the camera feed. Very good for visualization of the model in PictoBlox.

The block trains the NLP model with the data added with add () as () block.

The block enables or disables the automatic display of the box on object detection on the stage. This is useful when you want to see if the object detection happens during the analysis or not.

The block causes the text in the Text to Speech extension to be spoken using the pronunciation of the given language but does not translate the text.

The block returns the PictoBlox language of the current user. This block can be used with the translate () to () block, to translate to or from the end user’s set language.

All articles loaded

No more articles to load

Pose Classifier (ML)

Introduction

Tutorial on using Image Classifier in Block Coding

Tutorial on using Image Classifier in Python Coding

Pose Classifier Workflow

Class in Pose Classifier

Adding Data to Class

Training the Model

Testing the Model

Export in Block Coding

Export in Python Coding

PictoBlox Blocks

Block Coding Examples

Company

Community

Get in Touch

Follow Us

Dabble App - One App. Infinite Control.