ElephantRobotics

ElephantRobotics

What is the 4-axis robotic arm?
In the era of Industry 4.0, where information technology is being used to promote industrial change, robotic arms are essential in industry transformation. Automated robotic arms can reduce staff labor and increase productivity using automation technology combined with artificial intelligence, voice, and vision recognition. Robotic arms are now very relevant to our lives. Most robotic arms are built like human hands to perform more tasks such as grasping, pressing, and placing. The axes of a robotic arm represent degrees of freedom and independent movement, and most robotic arms have between two and seven axes. Here I will show you a four-axis palletizing robotic arm that is suitable for introductory learning.
What is the palletizing robotic arm?
Palletizing means neatly stacking items. Palletizing robotic arms grip, transfer, and stack items according to a fixed process.
https://www.youtube.com/watch?v=oXiIPEDNTF8
Which kind of robotic arm is more suitable? A 4-axis robotic arm? Or a 6-axis robotic arm?
Let's look at the table.
0_1668389561632_cec34f98-6f64-44b1-a890-4ce1fc7a3bd9-image.png
The 4-axis palletizing robotic arm can only move horizontally up and down, backward and forwards, left and right, with the end fixed towards the bottom. This is a significant limitation in terms of application and is mainly used in high-speed pick-and-place scenarios. Six-axis robotic arms are suitable for a wide range of designs and can move without dead space to reach any position within the field. We will mainly look at the four-axis palletizing robotic arm.
A video was made about the movement of two types of robotic arms.
https://www.youtube.com/watch?v=EuAIix7_D8g
myPalletizer 260 M5Stack
The myPalletizer robotic arm shown in the video, with M5Stack-ESP32 as the central control, is a fully wrapped lightweight 4-axis palletizing robotic arm with an overall finless design, small and compact, and easy to carry. The weight of myPalletizer is 960g, the payload is 250g, and the working radius is 260mm. I think it is designed for individual makers and educational use. With the multiple extension interfaces, we can learn machine vision with the AI Kit.
0_1668389663629_11ade7a8-3110-4a71-8b85-7ac5798806ee-image.png
Why would we recommend this arm as an introductory 4-axis palletizing robotic arm?
There are many four-axis (4DOF) robotic arms in industry, the mainstream being represented by palletizing robotic arms. Compared to 6-axis robotic arms, myPalletizer has a more straightforward structure, fewer joints, less stretching, faster reaction times, and faster-operating efficiency and is better to use than 6-axis robotic arms. It would be quite an excellent choice with palletizing robotic arms. Let's take a look at the myPalletizer 260-M5Stack parameter.
0_1668389714850_2b2d03cc-67ab-4208-a147-f302fe9f007f-image.png
The suitability of a robotic arm for learning requires several conditions.

The robotic arm must support multiple functions.
If this robotic arm has a mainstream structure, there will be many models of industrial robotic arms to provide a reference value.
Supporting documentation for the robotic arm is available and provides the user with basic operating instructions.

What can we learn with myPalletizer 260?
Robotics
When programming the robotic arm, we will learn about forward and inverse kinematics, DH model kinematics, Cartesian coordinate systems, motors and servos, motion mechanics, programming, machine vision, etc. Here is a brief introduction to what DH model kinematics is.
First, let's talk about forward kinematics and inverse kinematics.
Forward kinematics:
Determine the position and pose of the end effector given the values of the robot joint variables.
Inverse kinematics:
The values of the robot joint variables are determined according to the given position and attitude of the end effector.
DH Model Kinematics:
Mainly by constraining the position of the joint coordinate system, the transformation between the joint coordinate system and the coordinate system is disassembled into 4 steps, each step has only one variable/constant, thus reducing the difficulty of solving the inverse kinematics of the manipulator.
0_1668389815645_e4d19904-aa08-44e2-b9d0-046378832cc8-image.png
With a robotic arm, we can learn more about robotic armics.
Open Source Information
Elephant Robotics provides relevant information about myPalletizer in Gitbook. There are basic operation tutorials in mainstream programming languages, such as programming in python language, and a series of detailed introductions from the installation of the environment to the control of the robotic arm, providing beginners with a quick way to build and use the robotic arm.
0_1668389923892_6949dbf2-bc4d-4b7f-9f24-7afdcdc6aee6-image.png
Programming support
We can program the myPalletizer in Python, C++, C#, JavaScript, Arduino, and ROS, giving the user more options to control the myPalletizer.
0_1668392644283_b69bb958-19b9-481c-8ac1-962c1237cce6-image.png
More open source code on GitHub.
Artificial Intelligence Kit
We also provide an artificial intelligence kit, a robotic arm is not capable of human work, and we also need a pair of eyes (cameras) to recognize, the combination of the two can replace manual work. A camera just displays the picture it shoots, we need to program it to realize the method of color and object recognition. We used OpenCV and python to recognize and grab the color of wood blocks and recognize and grab objects.
Let's see how it works.
0_1668392579355_174964e2-63cd-4cb4-97dd-71a2fc8a0422-image.png
0_1668392676262_a28dbf31-2bb7-4d43-8c0d-484b56a5c8a2-image.png
The Artificial Intelligence Kit is designed to give us a better understanding of machine vision and machine learning. OpenCV is a powerful machine vision algorithm. If you want to learn more about the code, you can look up the project on GitHub.

Summary
myPalletizer is an excellent robotic arm for those just starting! I hope this article will help you choose your own robotic arm. If you still want to know more, feel free to comment below. If you enjoyed this article, please give us your support, and like us, your like is our motivation to update!

ElephantRobotics

Long time no see, I'm back.

I'll give a report on the recent progress of the facial recognition and tracking project. For those who are new, let me briefly introduce what I am working on. I am using a desktop six-axis robotic arm with a camera mounted on the end for facial recognition and tracking. The project consists of two modules: one for facial recognition, and the other for controlling the movement of the robotic arm. I've previously discussed how the basic movement of the robotic arm is controlled and how facial recognition is implemented, so I won't go into those details again. This report will focus on how the movement control module was completed."

Equipment

mechArm 270M5Stack, camera
alt text

Details of the equipment can be found in the previous article.

Motion control module

Next, I'll introduce the movement control module.

In the control module, the common input for movement control is the absolute position in Cartesian space. To obtain the absolute position, a camera and arm calibration algorithm, involving several unknown parameters, is needed. However, we skipped this step and chose to use relative displacement for movement control. This required designing a sampling movement mechanism to ensure that the face's offset is completely obtained in one control cycle and the tracking is implemented.

Therefore, to quickly present the entire function, I did not choose to use the hand-eye calibration algorithm to handle the relationship between the camera and arm. Because the workload of hand-eye calibration is quite large.

The code below shows how to obtain parameters from the information obtained by the facial recognition algorithm.

Code:

_, img = cap.read()
# Converted to grey scale
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
# Detecting faces
faces = face_cascade.detectMultiScale(gray, 1.1, 4)
# Drawing the outline
for (x, y, w, h) in faces:
if w > 200 or w < 80:
#Limit the recognition width to between 80 and 200 pixels
continue
cv2.rectangle(img, (x, y), (x+w, y+h), (255, 0, 0), 3)
center_x = (x+w-x)//2+x
center_y = (y+h-y)//2+y
size_face = w

The obtained variables, center_x, center_y, and size_face, are used to calculate the position. Below is the code for the algorithm that processes the data to control the movement.

run_num = 20    
#Control cycle of 20 frames
if save_state == False:
# Save a start point (save_x, save_y)
save_x = center_x
save_y = center_y
save_z = size_face
origin_angles = mc.get_angles()
print("origin point = ", save_x, save_y, origin_angles)
time.sleep(2);
current_coords = mc.get_coords()
save_state = TRUE
else:
if run_count > run_num: # Limit the control period to 20 frames
run_count = 0
# Recording relative offsets
error_x = center_x - save_x
error_y = center_y - save_y
error_z = size_face - save_z
# Pixel differences are converted into actual offsets, which can be scaled and oriented
trace_1 = -error_x * 0.15
trace_z = -error_y * 0.5
trace_x = -error_z * 2.0
# x/z axis offset, note that this is open loop control
current_coords[2] += trace_z
current_coords[0] += trace_x
#Restricting the Cartesian space x\z range
if current_coords[0] < 70:
current_coords[0] = 70
if current_coords[0] > 150:
current_coords[0] = 150
if current_coords[2] < 220:
current_coords[2] = 220
if current_coords[2] > 280:
current_coords[2] = 280
# Inverse kinematic solutions
x = current_coords[0]
z = current_coords[2]
# print(x, z)
L1 = 100;
L3 = 96.5194;
x = x - 56.5;
z = z - 114;
cos_af = (L1*L1 + L3*L3 - (x*x + z*z))/(2*L1*L3);
cos_beta = (L1*L1 - L3*L3 + (x*x + z*z))/(2*L1*math.sqrt((x*x + z*z)));
reset = False
# The solution is only applicable to some poses, so there may be no solution
if abs(cos_af) > 1:
reset = True
if reset == True:
current_coords[2] -= trace_z
current_coords[0] -= trace_x
print("err = ",cos_af)
continue
af = math.acos(cos_af);
beta = math.acos(cos_beta);
theta2 = -(beta + math.atan(z/x) - math.pi/2);
theta3 = math.pi/2 - (af - math.atan(10/96));
theta5 = -theta3 - theta2;
cof = 57.295 #Curvature to angle
move_juge = False
# Limits the distance travelled, where trace_1 joint is in ° and trace_x/z is in mm
if abs(trace_1) > 1 and abs(trace_1) < 15:
move_juge = True
if abs(trace_z) > 10 and abs(trace_z) < 50:
move_juge = True
if abs(trace_x) > 25 and abs(trace_x) < 80:
move_juge = True
if (move_juge == True):
print("trace = ", trace_1, trace_z, trace_x)
origin_angles[0] += trace_1
origin_angles[1] = theta2*cof
origin_angles[2] = theta3*cof
origin_angles[4] = theta5*cof
mc.send_angles(origin_angles, 70)
else:
#Due to the open-loop control, if no displacement occurs the current coordinate value needs to be restored
current_coords[2] -= trace_z
current_coords[0] -= trace_x
else:
# 10 frames set aside for updating the camera coordinates at the end of the motion
if run_count < 10:
save_x = center_x
save_y = center_y
save_z = size_face
run_count += 1

In the algorithm module, after obtaining the relative displacement, how to move the arm? To ensure the movement effect, we did not directly use the coordinate movement interface provided by Mecharm, but instead added the inverse kinematics part in python. For the specific posture, we calculated the inverse solution of the robotic arm and transformed the coordinate movement into angle movement to avoid singular points and other factors that affect the Cartesian space movement. Combining the code of the facial recognition part, the entire project is completed.

Let's look at the results together.
https://youtu.be/dNdqrkggr9c

Normally, facial recognition has high computational requirements. Its algorithm mechanism repeatedly calculates adjacent pixels to increase recognition accuracy. We use MechArm 270-Pi, which uses a Raspberry Pi 4B as the processor for facial recognition. The computing power of the Raspberry Pi is 400MHZ. Due to the insufficient computing power of the Raspberry Pi, we simplified the process and changed the recognition mechanism to only a few times of fuzzy recognition. In our application, the background needs to be simpler."

Summary
The facial recognition and robotic arm tracking project is completed.

Key information about the project:

● In the case of low computing power, set a simple usage scenario to achieve smooth results

● Replace complex hand-eye calibration algorithms with relative position movement and use a sampling movement mechanism to ensure that the face's offset is completely obtained in one control cycle and the tracking is implemented.

● In python, added the inverse kinematics part, calculated the inverse solution of the robotic arm for specific postures, and converted the coordinate movement into angle movement to avoid singular points and other factors that affect the Cartesian space movement.

Some shortcomings of the project:

● There are certain requirements for the usage scenario, and a clean background is needed to run successfully (by fixing the scene, many parameters were simplified)

● As mentioned earlier, the computing power of the Raspberry Pi is insufficient, using other control boards, such as Jetson Nano (600MHZ) or high-performance image processing computers, would run smoother.

● Also, in the movement control module, because we did not do hand-eye calibration, only relative displacement can be used. The control is divided into "sampling stage" and "movement stage". Currently, it is preferable to require the lens to be stationary during sampling, but it is difficult to ensure that the lens is stationary, resulting in deviation in the coordinates when the lens is also moving during sampling.

Finally, I would like to specially thank Elephant Robotics for their help during the development of the project, which made it possible to complete it. The MechArm used in this project is a centrally symmetrical structured robotic arm with limitations in its joint movement. If the program is applied to a more flexible myCobot, the situation may be different.

If you have any questions about the project, please leave me a message below.

ElephantRobotics

Introduction

AI Kit (Artificial Intelligence) is mainly designed to provide a set of kits suitable for beginners and professionals to learn and apply artificial intelligence. It includes robotic arms(myCobot280-M5Stack,mechArm270-M5Stack,myPalletizer260-M5Stack) and related software, hardware, sensors, and other devices, as well as supporting tutorials and development tools. The AI Kit aims to help users better understand and apply artificial intelligence technology and provide them with opportunities for practice and innovation. The latest upgrade will further enhance the functionality and performance of AI Kit 2023, making it more suitable for various scenarios and needs, including education, scientific research, manufacturing, and more.
alt text

Product Description

AI Kit is an entry-level artificial intelligence kit that combines visual, positioning, grabbing, and automatic sorting modules in one. The kit is based on the Python programming language and enables control of robotic arms through software development. With the ROS robot operating system in the Ubuntu system, a real 1:1 scene simulation model is established, allowing for quick learning of fundamental artificial intelligence knowledge, inspiring innovative thinking, and promoting open-source creative culture. This open-source kit has transparent designs and algorithms that can be easily used for specialized training platforms, robotics education, robotics laboratories, or individual learning and use.
alt text
Why upgrade AI Kit 2023?
The answer to why we upgraded AI Kit 2023 is multifaceted. First, we collected extensive feedback from our users and incorporated their suggestions into the new release. The upgraded version enhances the functionality and performance of the AI Kit, making it more suitable for various scenarios and industries such as education, research, and manufacturing. The following are some of the reasons for this.

● Even with detailed installation instructions, installation environment setup for the AI Kit can still be challenging due to various reasons, causing inconvenience to users.

● The first generation of the AI Kit only has two recognition algorithms: color recognition and feature point recognition. We aim to provide a more diverse range of recognition algorithms.

● Due to the abundance of parts and complex device setups, the installation process of the AI Kit can be time-consuming and require a lot of adjustment.

Based on the above 3 points, we have begun optimizing and upgrading the AI Kit.

What aspects have been upgraded in AI Kit 2023?
Let’s take a look at a rough comparison table of the upgrades. alt text
The additions to the functionality can be divided into two main areas of improvement.
One is the software upgrades, and the other is the hardware upgrades.
Let’s start by looking at the hardware upgrades.

Hardware upgrades

alt text
The AI Kit 2023 has been upgraded in several aspects, as shown in the comparison table. The updated AI Kit has a clean and minimalist style with multiple hardware upgrades, including:

list itemAcrylic board: upgraded in hardness and material
list itemCamera: upgraded to higher resolution and added a lighting lamp
list item External material of the camera: upgraded from plastic to metal
list item Suction pump: adjusted to suitable power (not too strong or weak) and upgraded interface (old models require an additional power supply interface)
list item Arm base: strengthened the fixing of the arm to make the arm movement more stable
list itemBucket/parts box: smaller in size for easier carrying and installation
Here is a video of unboxing the AI Kit 2023.
video
The overall impression is still very good, let’s take a look at the software upgrades that have been made.

Software upgrades

● Optimization of environment setup: In the previous version of the AI Kit, it needed to run on the ROS development environment. Based on user feedback that installing Linux, ROS, and other environments was difficult, we have loaded the program directly onto the Python environment. Compared to setting up Python and ROS environments, the former can be easily achieved.

● Upgrade of program UI: The previous version had a one-click start UI interface, which did not provide users with much information (similar to simple operations such as booting up). In the AI Kit 2023 program, a brand new UI interface has been designed, which can give users a refreshing feeling in terms of both aesthetics and functionality. It not only provides users with convenient operation, but also helps users to have a clearer understanding of the operation of the entire program.
alt text
From the figure, we can see the features of connecting the robotic arm, opening the camera, selecting recognition algorithms, and automatic startup. These designs can help users better understand the AI Kit.

● Breakthroughs in recognition algorithms: In addition to the original color recognition and feature point recognition algorithms, the AI Kit has been expanded to include five recognition algorithms, which are color recognition, shape recognition, ArUco code recognition, feature point recognition, and YOLOv5 recognition. The first four recognition algorithms are based on the OpenCV open-source software library. YOLOv5 (You Only Look Once version 5) is a recent popular recognition algorithm and a target detection algorithm that has undergone extensive training.
alt text
The expansion of recognition algorithms is also intended to provide users with their own creative direction. Users can add other recognition algorithms to the existing AI Kit 2023.

Summary

The upgrade of the AI Kit 2023 has been a great success, thanks to extensive user feedback and product planning. This upgrade provides users with a better learning and practical experience, helping them to master AI technology more easily. The new AI Kit also introduces many new features and improvements, such as more accurate algorithms, more stable performance, and a more user-friendly interface. In summary, the upgrade of the AI Kit 2023 is a very successful improvement that will bring better learning and practical experiences and a wider range of application scenarios to more users.

In the future, we will continue to adhere to the principle of putting users first, continuously collect and listen to user feedback and needs, and further improve and optimize the AI Kit 2023 to better meet user needs and application scenarios. We believe that with continuous effort and innovation, the AI Kit 2023 will become an even better AI Kit, providing better learning and practical experiences for users and promoting the development and application of AI technology.

ElephantRobotics

Introduction

As a developer, I am currently involved in an interesting project to combine a SLAM (Simultaneous Localization and Mapping) car, myAGV, with a small six-axis robotic arm, myCobot 280 M5Stack, for research on logistics automation in education and scientific fields.

myAGV is a small car that can perform mapping and navigation and uses Raspberry Pi 4B as the controller. It can locate and move indoors and outdoors. MyCobot280 is a small collaborative robotic arm with six degrees of freedom that can accomplish various tasks in limited space.

My project goal is to integrate these two devices to achieve automated logistics transportation and placement. We plan to use open-source software and existing algorithms to achieve autonomous navigation, localization, mapping, object grasping, and placement functions. Through documenting the process in this article, we aim to share our journey in developing this project.

The equipment that I am using includes:

myAGV, a SLAM car that is capable of mapping and navigation.
alt text

myCobot280 M5Stack, a six-axis collaborative robotic arm with a complete API interface that can be controlled via Python. alt text

An adaptive gripper that can be mounted as an end effector with MyCobot280, which is capable of grasping objects. alt text

Development environment:

Ubuntu 18.04, Python 3.0+, ROS1.

Note: myAGV is controlled by Raspberry Pi 4B, and all environment configurations are based on the configurations provided on the Raspberry Pi.

Project

The picture below shows the general flow of this project. alt text

I split the function into one, a small part to be implemented independently and finally integrated together.

myAGV

Firstly, I am working on the functions of myAGV, to perform mapping and automated navigation. I am implementing these functions based on the information provided in the official Gitbook.

I am using the gmapping algorithm to perform mapping. Gmapping, also known as grid-based mapping, is a well-established algorithm for generating 2D maps of indoor environments. It works by building a grid map of the environment using laser range finder data, which can be obtained from the sensors mounted on myAGV. alt text

It's worth noting that I have tried myAGV in various scenarios, and the mapping performance is good when the environment is relatively clean. However, when the surrounding area is complex, the mapping results may not be as good. I will try to improve it by modifying the hardware or software in the future.

The picture below shows myAGV performing automatic navigation. alt text

During automatic navigation, myAGV still experiences deviations. Implementing navigation functionality is quite complex because the navigation conditions are quite strict. It is necessary to adjust the actual position of myAGV after enabling navigation and turn in place to determine if the position is correct. There are still many areas for improvement in navigation functionality, such as automatically locating the position of the small car on the map after enabling navigation, among other aspects.

myCobot 280

After handling the myAGV, the next step is to control the myCobot movement.

Here, I use Python to control myCobot 280. Python is an easy-to-use programming language, and myCobot's Python API is also quite comprehensive. Below, I will briefly introduce several methods in pymycobot.

time.sleep()
Function: Pause for a few seconds (the robotic arm needs a certain amount of time to complete its movement).
send_angles([angle_list], speed)
Function: Send the angle of each joint and the speed of operation to the robot arm.
set_gripper_value(value, speed)
Function: Controls the opening and closing of the jaws, 0 is closed 100 is open, 0 to 100 adjustable

Wrote a simple program to grab objects, see GIF demo.
alt text
Establishing communication
After dealing with the small functions, the next step is to establish communication between myCobot and myAGV.

The controller of myAGV is a Raspberry Pi, which is a micro-computer (with Ubuntu 18.04 system) that can be programmed on it.
MyCobot 280 M5Stack needs to be controlled by commands sent from a computer.

Based on the above conditions, there are two ways to establish communication between them:

Serial communication: directly connect them using a TypeC-USB data cable (the simplest and most direct method).
Wireless connection: myCobot supports WIFI control, and commands can be sent by entering the corresponding IP address (more complicated and communication is not stable).
Here, I choose to use serial communication and directly connect them with a data cable.

Here I recommend a software called VNC Viewer, which is a cross-platform remote control software. I use VNC to remotely control myAGV, which is very convenient because I don't have to carry a monitor around.

If you have any better remote control software, you can leave a comment below to recommend it to me.

Let's see how the overall operation works. alt text

Summary

In this project, only simple SLAM-related algorithms are used. The navigation algorithm needs to be further optimized to achieve more accurate navigation. As for the usage of myCobot, it is a relatively mature robotic arm with a convenient interface, and the end effectors provided by the Elephant Robotics can meet the requirements without the need to build a gripper for the project.

There are still many aspects of the project that need to be optimized, and I will continue to develop it in the future. Thank you for watching, and if you have any interest or questions, please feel free to leave a comment below.

ElephantRobotics

alt text
Introduction
Do you think this display is innovative and magical? Actually, this is a technology called holographic projection. Holographic technology has become a part of our daily lives, with applications covering multiple fields. In the entertainment industry, holographic technology is used in movie theaters, game arcades, and theme parks. Through holographic projection technology, viewers can enjoy more realistic visual effects, further enhancing their entertainment experience. In the medical field, holographic technology is widely used in medical diagnosis and surgery. By presenting high-resolution 3D images, doctors can observe the condition more accurately, improving the effectiveness of diagnosis and surgery. In the education field, holographic technology is used to create teaching materials and science exhibitions, helping students better understand and master knowledge. In addition, holographic technology is also applied in engineering manufacturing, safety monitoring, virtual reality and other fields, bringing more convenience and innovation to our lives. It is foreseeable that with the continuous development of technology and the continuous expansion of application scenarios, holographic technology will play a more important role in our future lives.

alt text
(Images from the internet)

The main content of this article is to describe how to use myCobot320 M5Stack 2022 and DSee-65X holographic projection equipment to achieve naked-eye 3D display.

This project is jointly developed by Elephant Robotics and DSeeLab Hologram.

DSee-65X holographic equipment:
We take a brief look at how the holographic influence is generated.The holographic screen is a display device that uses the technical principle of persistence of vision (POV) (after-image of moving objects) to achieve 3D holographic visual enhancement effect, air suspension, holographic stereo display effect by rotating imaging with ultra-high density LED light,break the limitation and boring of traditional flat display, real-time synchronization and interactive development can also be carried out, leading the new trend of commercial holographic display industry.

DSee-65X is a product of DSee Lab Hologram, a company that specializes in holographic technology.

DSee-65X: high resolution, high brightness, supports various content formats, WiFi connection, APP operation, cloud remote cluster control, unlimited splicing for large screen display, 30,000 hours of continuous operation.

Here is a video introduction of DSee-65X.
https://youtu.be/UDXlNgjwQ8c
alt text

myCobot 320 M5Stack 2022

myCobot 320 M5Stack is an upgraded version of the myCobot 280 product, mainly suitable for makers and researchers, and can be customized according to user needs through secondary development. It has three major advantages of usability, safety, and economy, with a sophisticated all-in-one design. The myCobot 320 weighs 3kg, has a payload of 1kg, a working radius of 350mm, and is relatively compact but powerful. It is easy to operate, can collaborate with humans, and work safely. The myCobot 320 2022 is equipped with a variety of interfaces and can quickly adapt to various usage scenarios.
alt text

Here is a video presentation of the myCobot 320 M5Stack 2022
https://youtu.be/B14BS6I-uS4

Introduction of the two devices complete, next step is to combine the holographic device with the robotic arm to work together. The operation of this project is very simple and can be divided into two steps:

Install the DSee-65X at the end of myCobot 320.
Control myCobot 320 to perform a beautiful trajectory to display the holographic image.

Project

Installation

DSee-65X and myCobot320 M5Stack 2022 are products from two different companies. When we received them, we found that we couldn't directly install the holographic device on the end of myCobot320. Therefore, we needed to modify the holographic device.

This is the structure at the end of myCobot320
alt text

This is the DSee-65X

alt text
According to the provided information, we added a board as a bridge between them for adaptation.

The maximum load of myCobot320 can reach up to 1kg, so this modification is completely feasible for it.

Controlling Robotics Arm
Our goal is to design a trajectory for the myCobot 320 robotic arm that ensures an unobstructed view of the hologram display.
alt text
The myCobot 320 has a rich interface and supports Python, C++, C#, JavaScript, Arduino, ROS, and more. Next, we will program it. Here we use a very easy-to-learn method. The method is to use myBlockly software for programming. myBlockly is a graphical programming software that allows code to be written by drag and drop.
alt text

The code in the picture is a graphic code for the trajectory of the myCobot 320.

myBlockly's underlying code is written in Python, so we can also directly use Python code to control the robotic arm. The following is an example of Python code:

import time
from pymycobot.mycobot import MyCobot

mc = MyCobot('/dev/ttyUSB0')
mc.set_speed(60)

# move to a home position
mc.send_angles([0, -90, 90, 0, 0, 0], 80)
time.sleep(1)

# move to a new position
mc.send_angles([0, -90, 90, 0, 0, 30], 80)
time.sleep(1)

# move to another position
mc.send_angles([0, -90, 90, 0, 30, 30], 80)
time.sleep(1)

# move to a final position
mc.send_angles([0, -90, 90, 0, 30, 0], 80)
time.sleep(1)

mc.release_all_servos()

Briefly explain how to use the DSee-65X.

DSee-65X has its own dedicated LAN. By connecting your computer to the same LAN, you can launch the software to make the holographic device work.
alt text

Summary

The whole process seems to be just a display of holographic imaging device with the robotic arm serving as a support. However, we can imagine more possibilities by using holographic projection technology to project 3D models or images into space and then capturing users' movements or gestures with sensors or cameras to control the robotic arm. For example, in manufacturing or logistics industries, combining robotic arms with holographic technology can achieve more efficient production and logistics operations. In the medical field, using robotic arms and holographic technology can achieve more precise surgery and treatment. In short, combining robotic arms and holographic technology can bring more intelligent and precise control and operation methods for various application scenarios, improving production efficiency and work quality.

These are all areas that require creative minds like yours to put in effort and develop! Please feel free to leave your ideas in the comments below and let's discuss together how to create more interesting projects.

ElephantRobotics

This article is primarily about introducing 3 robotic arms that are compatible with AI Kit. What are the differences between them?
If you have a robotic arm, what would you use it for? Simple control of the robotic arm to move it around? Repeat a certain trajectory? Or allow it to work in the industry to replace humans? With the advancement of technology, robots are frequently appearing around us, replacing us in dangerous jobs and serving humanity. Let's take a look at how robotic arms work in an industrial setting.
alt text

Introduction

what is AI Kit？

The AI Kit is an entry-level artificial intelligence Kit that integrates vision, positioning, grasping, and automatic sorting modules. Based on the Linux system and built-in ROS with a 1:1 simulation model, the AI Kit supports the control of the robotic arm through the development of software, allowing for a quick introduction to the basics of artificial intelligence.

alt text
Currently, AI Kit can achieve color and image recognition, automatic postioning and sorting. This Kit is very helpful for users who are new to robotic arms and machine vision, as it allows you to quickly understand how artificial intelligence projects are built and learn more about how machine vision works with robotic arms.

Next, let's briefly introduce the 3 robotic arms that are compatible with the AI Kit.
The AI Kit can be adapted for use with myPalletizer 260 M5Stack, myCobot 280 M5Stack and mechArm 270 M5Stack.All three robotic arms are equipped with the M5Stack-Basic and the ESP32-ATOM.

Robotic Arms

myPalletizer 260

myPalletizer260 is a lightweight 4-axis robotic arm, it is compact and easy to carry. The myPalletizer weighs 960g, has a 250g payload, and has a working radius of 260mm. It is explicitly designed for makers and educators and has rich expansion interfaces.
alt text

mechArm 270

mechArm 270 is a small 6-axis robotic arm with a center-symmetrical structure (like an industrial structure). The mechArm 270 weighs 1kg with a payload of 250g, and has a working radius of 270mm. As the most compact collaborative robot, mechArm is small but powerful.
alt text
myCobot 280
myCobot 280 is the smallest and lightest 6-axis collaborative robotic arm (UR structure) in the world, which can be customized according to user needs. The myCobot has a self-weight of 850g, an effective load of 250g, and an effective working radius of 280mm. It is small but powerful and can be used with various end effectors to adapt to various application scenarios, as well as support the development of software on multiple platforms to meet the needs of various scenarios, such as scientific research and education, smart home, and business pre R&D.
alt text
Let's watch a video to see how AI Kit works with these 3 robotic arms.
https://youtu.be/kgJeSbo9XE0

Project Description

The video shows the color recognition and intelligent sorting function, as well as the image recognition and intelligent sorting function. Let's briefly introduce how AI Kit is implemented (using the example of the color recognition and intelligent sorting function).

This artificial intelligence project mainly uses two modules:

●Vision processing module

●Computation module (handles the conversion between eye to hand)

Vision processing module

OpenCV (Open Source Computer Vision) is an open-source computer vision library used to develop computer vision applications. OpenCV includes a large number of functions and algorithms for image processing, video analysis, deep learning based object detection and recognition, and more.

We use OpenCV to process images. The video from the camera is processed to obtain information from the video such as color, image, and the plane coordinates (x, y) in the video. The obtained information is then passed to the processor for further processing.
Here is part of the code to process the image (colour recognition)

# detect cube color
def color_detect(self, img):
# set the arrangement of color'HSV
x = y = 0
gs_img = cv2.GaussianBlur(img, (3, 3), 0) # Gaussian blur
# transfrom the img to model of gray
hsv = cv2.cvtColor(gs_img, cv2.COLOR_BGR2HSV)
for mycolor, item in self.HSV.items():
redLower = np.array(item[0])
redUpper = np.array(item[1])
# wipe off all color expect color in range
mask = cv2.inRange(hsv, item[0], item[1])
# a etching operation on a picture to remove edge roughness
erosion = cv2.erode(mask, np.ones((1, 1), np.uint8), iterations=2)
# the image for expansion operation, its role is to deepen the color depth in the picture
dilation = cv2.dilate(erosion, np.ones(
(1, 1), np.uint8), iterations=2)
# adds pixels to the image
target = cv2.bitwise_and(img, img, mask=dilation)
# the filtered image is transformed into a binary image and placed in binary
ret, binary = cv2.threshold(dilation, 127, 255, cv2.THRESH_BINARY)
# get the contour coordinates of the image, where contours is the coordinate value, here only the contour is detected
contours, hierarchy = cv2.findContours(
dilation, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
if len(contours) > 0:
# do something about misidentification
boxes = [
box
for box in [cv2.boundingRect(c) for c in contours]
if min(img.shape[0], img.shape[1]) / 10
< min(box[2], box[3])
< min(img.shape[0], img.shape[1]) / 1
]
if boxes:
for box in boxes:
x, y, w, h = box
# find the largest object that fits the requirements
c = max(contours, key=cv2.contourArea)
# get the lower left and upper right points of the positioning object
x, y, w, h = cv2.boundingRect(c)
# locate the target by drawing rectangle
cv2.rectangle(img, (x, y), (x+w, y+h), (153, 153, 0), 2)
# calculate the rectangle center
x, y = (x*2+w)/2, (y*2+h)/2
# calculate the real coordinates of mycobot relative to the target
if mycolor == "red":
self.color = 0
elif mycolor == "green":
self.color = 1
elif mycolor == "cyan" or mycolor == "blue":
self.color = 2
else:
self.color = 3
if abs(x) + abs(y) > 0:
return x, y
else:
return None

Just obtaining image information is not enough, we must process the obtained data and pass it on to the robotic arm to execute commands. This is where the computation module comes in.

Computation module

NumPy (Numerical Python) is an open-source Python library mainly used for mathematical calculations. NumPy provides many functions and algorithms for scientific calculations, including matrix operations, linear algebra, random number generation, Fourier transform, and more. We need to process the coordinates on the image and convert them to real coordinates, a specialized term called eye to hand. We use Python and the NumPy computation library to calculate our coordinates and send them to the robotic arm to perform sorting.

Here is part of the code for the computation.

while cv2.waitKey(1) < 0:
# read camera
_, frame = cap.read()
# deal img
frame = detect.transform_frame(frame)
if _init_ > 0:
_init_ -= 1
continue
# calculate the parameters of camera clipping
if init_num < 20:
if detect.get_calculate_params(frame) is None:
cv2.imshow("figure", frame)
continue
else:
x1, x2, y1, y2 = detect.get_calculate_params(frame)
detect.draw_marker(frame, x1, y1)
detect.draw_marker(frame, x2, y2)
detect.sum_x1 += x1
detect.sum_x2 += x2
detect.sum_y1 += y1
detect.sum_y2 += y2
init_num += 1
continue
elif init_num == 20:
detect.set_cut_params(
(detect.sum_x1)/20.0,
(detect.sum_y1)/20.0,
(detect.sum_x2)/20.0,
(detect.sum_y2)/20.0,
)
detect.sum_x1 = detect.sum_x2 = detect.sum_y1 = detect.sum_y2 = 0
init_num += 1
continue
# calculate params of the coords between cube and mycobot
if nparams < 10:
if detect.get_calculate_params(frame) is None:
cv2.imshow("figure", frame)
continue
else:
x1, x2, y1, y2 = detect.get_calculate_params(frame)
detect.draw_marker(frame, x1, y1)
detect.draw_marker(frame, x2, y2)
detect.sum_x1 += x1
detect.sum_x2 += x2
detect.sum_y1 += y1
detect.sum_y2 += y2
nparams += 1
continue
elif nparams == 10:
nparams += 1
# calculate and set params of calculating real coord between cube and mycobot
detect.set_params(
(detect.sum_x1+detect.sum_x2)/20.0,
(detect.sum_y1+detect.sum_y2)/20.0,
abs(detect.sum_x1-detect.sum_x2)/10.0 +
abs(detect.sum_y1-detect.sum_y2)/10.0
)
print ("ok")
continue
# get detect result
detect_result = detect.color_detect(frame)
if detect_result is None:
cv2.imshow("figure", frame)
continue
else:
x, y = detect_result
# calculate real coord between cube and mycobot
real_x, real_y = detect.get_position(x, y)
if num == 20:
detect.pub_marker(real_sx/20.0/1000.0, real_sy/20.0/1000.0)
detect.decide_move(real_sx/20.0, real_sy/20.0, detect.color)
num = real_sx = real_sy = 0
else:
num += 1
real_sy += real_y
real_sx += real_x

The AI Kit project is open source and can be found on GitHub.

Difference

After comparing the video, content, and code of the program, it appears that the 3 robotic arms have the same framework and only need minor modifications to the data to run successfully.

There are roughly two main differences between these 3 robotic arms.

One is comparing the 4- and 6-axis robotic arms in terms of their practical differences in use (comparing myPalletizer to mechArm/myCobot).

Let's look at a comparison between a 4-axis robotic arm and a 6-axis robotic arm.
alt text
From the video, we can see that both the 4-axis and 6-axis robotic arms have a sufficient range of motion in the AI Kit's work area. The main difference between them is that myPalletizer has a simple and quick start process with only 4 joints in motion, allowing it to efficiently and steadily perform tasks, while myCobot requires 6 joints, two more than myPalletizer, resulting in more calculations in the program and a longer start time (in small scenarios).

In summary, when the scene is fixed, we can consider the working range of the robotic arm as the first priority when choosing a robotic arm. Among the robotic arms that meet the working range, efficiency and stability will be necessary conditions. If there is an industrial scene similar to our AI Kit, a 4-axis robotic arm will be the first choice. Of course, a 6-axis robotic arm can operate in a larger space and can perform more complex movements. They can rotate in space, while a 4-axis robotic arm cannot do this. Therefore, 6-axis robotic arms are generally more suitable for industrial applications that require precise operation and complex movement.

alt text
The second thing is that both are 6-axis robotic arms, and their main difference is the structure. mechArm is a centralized symmetrical structure robotic arm, and myCobot is a UR structure collaborative robotic arm. We can compare the differences between these two structures in actual application scenarios.

Here are the specifications of the two robotic arms.
alt text
The difference in structure between these two leads to a difference in their range of motion. Taking mechArm as an example, the centrally symmetrical structure of the robotic arm is composed of 3 pairs of opposing joints, with the movement direction of each pair of joints being opposite. This type of robotic arm has good balance and can offset the torque between joints, keeping the arm stable.
alt text
Shown in the video, mechArm is also relatively stable in operation.

You may now question, is myCobot not useful then? Of course not, the UR structure robot arm is more flexible and can achieve a larger range of motion, suitable for larger application scenarios. myCobot's more important point is that it is a collaborative robot arm, it has good human-robot interaction ability and can collaborate with humans for work. 6-axis collaborative robot arms are usually used in logistics and assembly work on production lines, as well as in medical, research, and education fields.

Summary

As stated at the beginning, the difference between these 3 robotic arms included in the AI Kit is essentially how to choose a suitable robotic arm to use. If you are choosing a robotic arm for a specific application, you will need to take into consideration factors such as the working radius of the arm, the environment in which it will be used, and the load capacity of the arm.

If you are looking to learn about robotic arm technology, you can choose a mainstream robotic arm currently available on the market to learn from. MyPalletizer is designed based on a palletizing robotic arm, mainly used for palletizing and handling goods on pallets. mechArm is designed based on a mainstream industrial robotic arm, which has a special structure that keeps the arm stable during operation. myCobot is designed based on a collaborative robotic arm, which is a popular arm structure in recent years, capable of working with humans and providing human strength and precision.

That's all for this post, if you like this post, please leave us a comment and a like!

We have published an article detailing the differences between mechArm and myCobot.Please click on the link if you are interested in learning more.

ElephantRobotics

Program： Map creation and automatic navigation with myAGV
Equipment：
1 myAGV：
myAGV is an autonomous navigation smart vehicle from Elephant Robotics. It uses the competition-level Mecanum wheels and a full wrap design with a metal frame. There are two built-in slam algorithms to meet the learning of mapping and navigation directions.

2 PC：
A normal computer.

The autonomous robot positioning and navigation technology include two parts: position&map creation (SLAM), and path planning&motion control. SLAM only completes the robot's positioning and map creation.

The main solution for implementing localization and navigation technology is SLAM, path planning, and motion control.

The process that a robot describes the environment and recognizes the environment mainly depends on the map. It uses environmental maps to describe its current environmental information and adopts different forms of map descriptions with the algorithm and sensor differences used.
We use the gmapping algorithm in the SLAM algorithm. The gmapping algorithm builds the map based on the raster map.
Gmapping：
Gmapping is a SLAM algorithm based on 2D LiDAR using RBPF (Rao-Blackwellized Particle Filters) algorithm to complete 2D raster map construction.
Advantages:
gmapping can build indoor environment maps in real-time, with less computation in small scenes and higher map accuracy, and requires less LIDAR scanning frequency.
Disadvantages:
As the environment grows larger, the memory and computation required to build the map become huge, so gmapping is unsuitable for large scene composition. An intuitive feeling is that for a range of 200x200 meters, if the raster resolution is 5cm and each raster occupies one byte of memory, then each particle carries 16M of RAM for the map, and if it is 100 particles, it is 1.6G of RAM.
Raster Map：
The most common way for robots to describe the environment is Grid Map or Occupancy Map, which divides the environment into a series of grids, where each grid is given a possible value indicating the probability that the grid will be occupied.

Start the project:
Because myAGV has a built-in Raspberry Pi computer, controlling myAGV requires leaving the keyboard and mouse, and monitor, so a computer is needed to control myAGV's computer. We use the VNC remote control.
VNC:
VNC (Virtual Network Console) stands for Virtual Network Console. It is an excellent remote control tool software developed by the famous AT&T's European research labs.VNC is a free open source software based on UNIX and Linux operating systems with powerful remote control capabilities, efficient and practical, and its performance is comparable to any remote control software in Windows and MAC.

Place the myAGV on a horizontal surface.

The launch file will start the odometer and IMU sensor of myAGV.
Enter the command in the terminal

roslaunch myagv_odometry myagv_active.launch

0_1664506614777_建图1.png
After turning on the odometer and IMU sensors, we then turn on the radar and gmapping algorithms to start the map creation.
Enter the command in the terminal:

roslaunch myagv_navigation myagv_slam_laser.launch

0_1664506673304_建图2.png
This is the page we just started, then we move myAGV and we can draw the map out. To control myAGV movement, Elephant Robotics gives us keyboard control.

Saved the map.
Enter the command in the terminal

rosrun map_server map_saver

0_1664506764839_建图3.png
The next step is to make myAGV able to navigate automatically on the map, where myAGV can avoid obstacles to its destination (navigation) automatically by clicking on it.
First, we load the map and modify the path into our startup file.
Path planning + motion control:
Movement planning is a big concept. For example, the movement of robotic arms, the flight of vehicles, and the path planning of myAGVs we are talking about here, all are in motion planning.
Let's talk about motion planning for these types of wheeled robots. The basic capability required here is path planning, that is, the ability to perform what is generally called target point navigation after completing SLAM. In short, it means planning a path from point A to point B, and then make the robot move over it.

1. Global Planning.
To achieve this process, motion planning has to implement at least two levels of modules, and one is called global planning, which is a bit like our car navigator. It needs to pre-plan a route on the map and also the current robot's position. This is provided by our SLAM system. The industry typically uses an algorithm called A* to implement this process, which is a heuristic search algorithm that is excellent. It is mostly used in games, such as real-time strategy games like Starcraft and Warcraft, which use this algorithm to calculate the movement trajectory of units.

2. Partial Planning
Of course, just planning the path is not enough. There are many unexpected situations in reality. For example, a small child is in the way, so the original path needs to be adjusted. Of course, sometimes, this adjustment does not require a recalculation of the global path, and the robot may be able to make a slight detour. In this case, we need another level of planning called local planning. It may not know where the robot will end up, but it is particularly good at getting around the obstacles in front of it.
Next, we start the program and open the saved maps and the autopilot function.
Enter the command in the terminal

roslaunch myagv_navigation navigation_active.launch

Use keyboard control to make myAGV rotate in place for positioning. After the positioning is completed and the point cloud converges, proceed to the next navigation step.
0_1664507046384_建图4.jpg
Click "2D Nav Goal" on the top, click the point on the map you want to reach, and myAGV will set off towards the target point. You can also see a planned path of myAGV between the starting point and the target point in RVIZ, and myAGV will move along the route to the target point.
0_1664507107724_建图5.jpg
This is the end of the project
Summary:
The navigation demonstrated at present is only a relatively basic situation. There are many mobile robots on the market, such as sweeping robots. It needs to plan paths according to different environments, which is more complicated. For the problem of path planning in different environments, there is a unique called space coverage, which also proposes a lot of algorithms and theories.
There is still a lot of work to be done to navigate the SLAM algorithm in the future.
If you have good ideas to express welcome to discuss with us in the comments below ~
About Elephantrobotics:
Home
GitHub
Gitbook for myAGV

ElephantRobotics

It looks great.Looking forward to your subsequent and more exciting projects about mechArm.

ElephantRobotics

Background：

When it comes to the robotic arm, the first reaction of most people is the industrial robotic arm is doing the work of the assembly line in the factory, but it is not. The robotic arm is like an accurate arm. The robotic arm can perform a number of works in daily life. For example, it can latte art, playing chess with human beings, restaurant ushers, massage, ultrasound, drawing, etc. The robotic arm has long been integrated into our daily life and will appear in various forms in the future.

Recently, I’ve seen a series of videos on Youtube and Twitter about writing machines and machine drawing, so I thought I’d use the myCobot Pro 600 around me to try it out and see if I could draw with a robotic arm.

What is myCobot Pro 600？

myCobot Pro 600 is a robotic arm developed for education and commercial use with a Raspberry Pi microprocessor and embedded RoboFlow visual programming primitives from Elephant Robotics.myCobot Pro 600 uses an industrial grade servo, which is comparable to an industrial robotic arm. It is very good in this aspect of stability and can be a painter.
（What is RobotFlow is described below.）
0_1665741802296_111.1.jpg

Plan：

The first step is to get the outline of a photo, transform it to get the Cartesian coordinates of the outline, transfer it to the robotic arm to execute along the path, and then the robotic arm will be able to draw.
Projects

Key points:

1 Get the path/contour map of the image and convert it to Cartesian coordinates.

2 The recognition of the pen up point and down point positions in the contour of the image.

Projects:

1 Inkscape
Choose software that can draw graphics and can convert images into outlines/paths. Here we recommend Inkscape, which allows us to develop plugins on its software.

Inkscape is a free and open source vector graphics editing software, and can fully comply with and support XML, SVG and CSS, and other open standard formats. It also has cross-platform support for multiple operating systems, windows, macOS, Linux, UNIX, etc.
Inkscape

2 Unicorn
Unicorn is a lightweight multi-platform, multi-architecture CPU emulator framework.

We used unicorn simulation to write a myCobot Cartesian coordinate transformation to combine the path/contour of the graph to simulate and finally generate the NGC file.
Contents of an NGC file

Elephant Robotics)
G21 G94 G64 G40 (metric ftw)
G90 (absolute mode)
G92.2
G4 P1 (wait 1s)
G38.3F1000X38.18 Y-156.72 Z-60.01 A-121.90 B-31.55 C99.32
G4 P1 (wait 1s)
G01F1000X-505.03 Y-177.45 Z61.41 A67.37 B60.32 C-132.00
G4 P1 (wait 1s)
G92 X0 Y0 Z0 A0 B0 C0
G4 P1 (wait 1s)
G01 X0 Y0 Z20.00 A0 B0 C0
G4 P1 (wait 1s)

G0 Z0 (pen down)
G4 P1 (wait 1ms)
G0 Z10 (pen up)
G4 P1 (wait 1ms)

(Polyline consisting of 97 segments.)
G1 X145.82 Y229.44 F2000.00
G0 Z0.00 (pen down)
G4 P1 (wait 1ms)
G1 X151.53 Y225.73 F2000.00
G1 X157.58 Y220.95 F2000.00
...

(unicorn is an open source project, if you are interested， please refer to github for more)

3 RobotFlow
RobotFlow is an operating system for collaborative robotic arms developed specifically for Elephant Robotics with a user-friendly UI. It allows users to realize its functions through simple operations even if they do not know the underlying principles well. Finally, the NGC file generated from the image, transferred to RobotFlow and run, can realize the robotic arm drawing.
myCobot pro 600 painting
0_1665741537235_myCobot pro 600 painting.png

Process

The project went as planned, but some problems arose.
video：
https://www.youtube.com/shorts/W4xHpRlfpOs

Problems:

1 Error in the NGC file when the outline/path of the image is generated.

2 When drawing, some places were not drawn. The reason for this is attributed to the end unit, as the trajectory of the run is fixed, too much drop in the run to some places will cause the pen to be crushed.

3 Some coordinates of myCobot exceed the limit and cannot be reached, so the program cannot proceed.

Solutions:

1 Select the picture as much as possible, the outline is a relatively clear picture, so that the generated NGC file will be able to achieve the drawing.

2 Make the end device flexible so that it will not be damaged even when the pen is pressed.

3 When setting the initial position, try to set the initial position with a control range from the limit.

Show video：
https://www.youtube.com/watch?v=QNvH5mAz4wU

Summary:

For some related information (plugins, code) in the above article, you can leave a comment below and I will share it with you. If you have good ideas to express welcome to discuss with us in the comments below ~

More information：
Home | Elephantrobotics
Gitbook | Elephantrobotics
GitHub | Open-Source

ElephantRobotics

Background

With the development of modern industry and the advancement of science and technology, people's requirements for industry, medical care, and service levels continue to increase. Single-arm robots cannot meet the requirements. Dual-arm robots should be used to meet the needs of complexity, intelligence, and flexibility of tasks. And live. The dual-arm robot is not a simple combination of two robotic arms. In addition to their respective control goals, they also need to satisfy mutual coordinated control and adaptability to the environment. This high complexity makes the operation of dual-arm robots more demanding—advanced integrated systems, high-level planning and reasoning, and adjustable control methods.
Dual-arm collaborative robots are the inevitable trend in the future of robotics.

Introduction

myBuddy 280 is Elephant Robotics' first dual-arm collaborative robot, powered by Raspberry Pi, and is a service robot - a dual-arm 13-axis humanoid collaborative robot. myBuddy 280 has a single arm with a working radius of 280mm and a maximum payload of 250g. It has a 7" interactive display and 2-megapixel HD cameras. It can be adapted to the needs of different applications.
alt text

Functions

Excellent algorithm control

Dual-arm robots have more apparent advantages over single-armed robots. A dual-arm robot can operate a single-armed double simultaneously, with higher total power, or it can reach two different positions simultaneously for separate operations, or even multiple robots can physically achieve object transfer. The trajectory of a robotic arm is ultimately single and requires human optimization to design algorithms for optimal trajectory calculation. This approach is quite complex to implement because of several factors, such as redundant kinematics, collision avoidance, unclear possibilities for performing tasks, complex objective functions, etc.
With superior algorithms, myBuddy 280 can respond to commands as fast as 30ms, and with anti-collision detection, it can work safely with people.

A more complete secondary development environment

Ultra-complete python control interface

■ Provides 100+ control interfaces for secondary application development or self-interference algorithm research.
■ Open interfaces for joint angle, speed control, and robot coordinate control make management more accessible and user-friendly.
■ Supports separate controls for left and right arm and waist, allowing more control at your fingertips.
■ Programming examples are provided to enable rapid deployment of scenario applications

       Sends a single joint angle to the robot arm.
       send_angle(id, joint, angle, speed)
       id - 1/2/3 (left arm/right arm/waist)
       joint - 1~ 6 (Corresponding to each joint)
       angle - (-180 ~ 180)Different angles have different limits, please check the product parameters for details
       speed – 1 ~ 100 (The higher the value, the faster the arm is moving)
       # Get the angle of a single joint
       get_angle(id, joint_id)
       id - 1/2/3 (left arm/right arm/waist)
       joint_id - 1~7 (7 is grapper)
       # Sending the arcs of all joints of the specified robot arm to the arm
       send_radians(id, radians, speed)
       id – 1/2（left arm/right arm）。
       radians – The radian values are stored as a list
       （List[float]），The length of the list is 6
       speed - 0 ~ 100(The higher the value, the faster the arm is moving)
       # There are many more functions, here is an example of their use
       from pymycobot.mybuddy import MyBuddy
       import time
        #MyBuddy('port',baud)
        mc = MyBuddy("/dev/ttyACM0",115200)
        # Send angles to the six joints of the left arm
       mc.send_angles(1, [0, 0, 0, 0, 0, 0], 50)
       time.sleep(3)
       # Send the angle to the first joint of the right arm
       mc.send_angle(2, 1, 90, 50)
       time.sleep(2)

code on GitHub

ROS robot control system support

■ With RVIZ, RVIZ can display images, models, paths, and other information, complete with visual rendering, making it easier for developers to understand the meaning of the data.

■ With MoveIt, among other things, motion planning, collision detection, kinematics, 3D perception, and manipulation control. When users develop paths and encounter different situations that require constraints, the functions of MoveIt can be helpful.
Self-developed software support
■ myBlockly: myBlockly is visual modular programming software that belongs to the graphic programming language. Like Scratch, it is an excellent software for getting started with myBuddy 280 quickly.

■myStudio: myStudio is a one-stop platform for the use of robotic arms. It offers firmware updates, driver installation, and tutorials on how to use the robot arm.
Configuration
■With 13 high-performance brushless DC servos, a seven-inch interactive display can be used for image display and touch control.
■Two built-in 2-megapixel and OpenCV compiled environments for rapid deployment of machine vision development.
■The LEGO end unit interface allows users to use 3D-printed accessories for various scenarios.

Summary

Dual-arm collaborative robots will dominate the future robotics landscape, and you could be designing more creative projects with myBuddy 280! Please leave your comments below and share them with us to start the journey of dual-arm collaborative robots!
Learn more about us:
Home | Elephant Robotics
GitHub | Elephant Robotics
Shop | Elephant Robotics

ElephantRobotics

we enable the AI to learn from provided texts, allowing it to automatically write code and execute tasks to control myCobot.

What's AI Agent？

An AI Agent is an intelligent entity capable of perceiving its environment, making decisions, and executing actions. Its functionality is based on large language model (LLM). However, unlike directly conversing with an LLM, an AI Agent can independently think, utilize tools, and gradually complete given tasks. Depending on the developer's skill level, it can accomplish various specialized tasks.

In this case study, we will build a simple single-agent system based on the DeepSeek large language model. The execution logic is straightforward and consists of the following steps: Definition + Observation + Thinking + Action + Memory.

Since an offline LLM lacks internet retrieval capabilities, it requires data input for learning. This ensures that whenever we activate the Agent, it is already prepared to assume its designated role and has sufficient knowledge to answer our queries. During usage, it will record user-approved responses, store them in a database, and continue learning from them.

myCobot 280 M5 Stack

The myCobot 280 series, created by Elephant Robotcs, represent a line of 6 DOF collaborative robot arms designed primarily for personal DIY projects, education and research applications. The myCobot 280 , equipped with a M5 Stack as its control board, it has full Python API for easily control, which is designed to be user-friendly and easy for beginners to learn and use.

Project Setup

1. Provide Knowledge Input

To enable the Agent to function effectively, we need to create a knowledge base and input relevant information.

We save the following information as separate DOCX files:

● Introduction to myCobot

● Technical details of 6 DOF collaborative robotic arms

● Usage instructions for the pymycobot API function library

(These resources can be found on myCobot’s GitBook.)

For example :

Save the reference text as '.docx' documents.

2. Load the DeepSeek Model

Before that, you need to purchase your own API_key from the official website of DeepSeek.

Then, we need to load the knowledge into the DeepSeek model via code.

import os
from docx import Document
 
from openai import OpenAI
 
def extract_text_from_word(doc_path):
    """Extract From Word (.docx) """
    doc = Document(doc_path)
    return "\n".join([para.text for para in doc.paragraphs])
 
 
def load_local_documents(directory):
    """Read Word """
    texts = []
    for filename in os.listdir(directory):
        if filename.endswith(".docx"):
            file_path = os.path.join(directory, filename)
            text = extract_text_from_word(file_path)
            texts.append(text)
    return texts
 
word_documents = load_local_documents("E:\MyCode\Agent_Deepseek\RobotData")
context = "\n".join(word_documents)  # Merge all text
client = OpenAI(
    api_key="xxxx {Your API}",
    base_url="https://api.deepseek.com"
)
 
query = ""
 
completion = client.chat.completions.create(
    model="deepseek-chat",
    temperature=0.6,
    messages=[
        {"role": "system", "content": "You are  mainly researching Python tasks for collaborative robotic arms. You are familiar with and proficient in the Python language, and can utilize the 'pymycobot' robotic API interface to provide a complete Python code that can be used."},
        {"role": "user", "content": f" Word Reference：\n{context}\n\n：{query}"}
    ]
)
 
print(completion.choices[0].message.content)

After executing the code, the DeepSeek model will generate a complete myCobot example script based on our input.

3. Output Formatting

LLM-generated output is typically presented as a continuous text stream, which cannot be directly executed as a program.

To allow the Agent to achieve our desired result as automatically saving AI-generated code as a '.py' file and executing it to control the robot, we must format the output properly.

We can use the method of regular expressions to extract Python code from the response of DeepSeek model and save it as a file.

# Use a regular expression to match Python code blocks
code_pattern = r"python(.*?)"  # Match Python code blocks
matches = re.findall(code_pattern, message_content,re.DOTALL)
 
# If code blocks are found, extract them
if matches:
    python_code = "\n".join(matches).strip()
else:
    # If no markdown code block is found, try to match plain Python code
    python_code = message_content.strip()
 
# Specify the Python file path
file_path = "generated_script.py"
 
# Write the extracted Python code to a file
with open(file_path, "w", encoding="utf-8") as f:
    f.write(python_code)
 
print(f"Python code has been saved to {file_path}")

4. Execute the Script Automatically

To enable automatic execution, we need to call the system terminal and run the generated script.

def execute_command(command):
    """Executes a shell command and returns stdout and stderr."""
    try:
        result = subprocess.run(command, shell=True, capture_output=True, text=True)
        return result.stdout, result.stderr
    except Exception as e:
        return None, str(e)
 
def run_command():
    """Runs the specific command and prints the output."""
    command = "conda activate base && python generated_script.py "
    print(f"\nRunning command: {command}")
 
    stdout, stderr = execute_command(command)
 
    if stdout:
        print(f"\nOutput:\n{stdout}")
    if stderr:
        print(f"\nError:\n{stderr}")
 
# run command
run_command()

5. Test with Robot

By adding a while True: loop to the execution process, we can continuously run tasks.

At this point, we have successfully built a simple AI Agent to control the myCobot robot. We can now connect the robot and test its functionality.

Code

import os
from docx import Document
from openai import OpenAI
import subprocess
import re
 
def extract_text_from_word(doc_path):
    """Extract From Word (.docx) """
    doc = Document(doc_path)
    return "\n".join([para.text for para in doc.paragraphs])
 
 
def load_local_documents(directory):
    """Read Word """
    texts = []
    for filename in os.listdir(directory):
        if filename.endswith(".docx"):
            file_path = os.path.join(directory, filename)
            text = extract_text_from_word(file_path)
            texts.append(text)
    return texts
 
def execute_command(command):
    """Executes a shell command and returns stdout and stderr."""
    try:
        result = subprocess.run(command, shell=True, capture_output=True, text=True)
        return result.stdout, result.stderr
    except Exception as e:
        return None, str(e)
 
def run_command():
    """Runs the specific command and prints the output."""
    command = "conda activate base &&  E: && cd E:\MyCode\Agent_Deepseek\RobotData && python generated_script.py"
    #print(f"\nRunning command: {command}")
 
 
    stdout, stderr = execute_command(command)
 
    if stdout:
        print(f"\nOutput:\n{stdout}")
    if stderr:
        print(f"\nError:\n{stderr}")
 
 
word_documents = load_local_documents("E:\MyCode\Agent_Deepseek\RobotData")
context = "\n".join(word_documents)  # Merge all text
 
client = OpenAI(
    api_key="xxxx {Your API}",
    base_url="https://api.deepseek.com"
)
 
while True:
    # Get user input
    query = input("\n input command ( 'exit' )：")
    if query.lower() == "exit":
        print("exit")
        break
 
    completion = client.chat.completions.create(
        model="deepseek-chat",
        temperature=0.6,
        messages=[
            {"role": "system", "content": "You are  mainly researching Python tasks for collaborative robotic arms. You are familiar with and proficient in the Python language, and can utilize the 'pymycobot' robotic API interface to provide a complete Python code that can be used."},
            {"role": "user", "content": f"reference text：\n{context}\n\n：{query}，generate Python Script"}
        ]
    )
 
    # Extract the generated Python code
    message_content = completion.choices[0].message.content
    code_pattern = r"```python(.*?)```"  # Extract code between ```python ... ```
    matches = re.findall(code_pattern, message_content, re.DOTALL)
 
    if matches:
        python_code = "\n".join(matches).strip()
    else:
        python_code = message_content.strip()
 
    # Save the extracted Python code to a file
    file_path = "E:\\MyCode\\Agent_Deepseek\\RobotData\\generated_script.py"
    with open(file_path, "w", encoding="utf-8") as f:
        f.write(python_code)
 
    print(f" running..... ")
 
    # Run the generated script
    run_command()

Summary

By building a simple AI agent to control the 6-axis cobot myCobot 280 Pi, we have learned how to create a basic LLM-driven robotics application.

Since this version of agent task does not include additional vision models, speech models, or sensors, it can only perform simple actions. However, if developers integrate vision, speech processing, and sensors, the AI Agent can autonomously complete more complex tasks.

ElephantRobotics

Special thanks to M5stack basic for a very suitable device embedded in a robotic arm!

Introduction

In recent years, many projects similar to Stanford University’s Alopha robot project have emerged, primarily focusing on learning by mimicking human motion trajectories to achieve human-like artificial intelligence. The Alopha robot, through advanced algorithms and sensor technology, can precisely replicate human actions, collecting data and learning from it to enhance its performance in various tasks. This imitation learning approach not only enables robots to excel in industrial automation but also shows significant potential in educational and research fields. Against this backdrop, Elephant Robotics has introduced the myArm M & C series robotic arms, further advancing humanoid robot technology.

Innovative Solutions for the Future

With the rapid advancement of robotics technology, Elephant Robotics has introduced the groundbreaking myArm 650 M & C series robotic arms, offering unprecedented flexibility and precision for educational, research, and industrial applications. These high-performance robotic arms are not only powerful but also flexible and user-friendly, making them suitable for a variety of applications and providing ideal solutions. Let’s delve into the myArm 650 M & C series products!

Products

myArm C650

0_1716194676691_3ed1d8aa-ac81-404a-abbd-1803921d0b23-image.png
The myArm C650 is a versatile 6-degree-of-freedom robotic motion information collection device, equipped with a fingertip controller and two intelligent buttons. The “C” stands for Controller. It can output end coordinates or joint angles at 50 Hz. Designed for education, research, and industrial data collection, its flexibility and highly modular design make it suitable for various complex operations and tasks. It can be extended to a 7-degree-of-freedom data collector.
0_1716194697038_61be2f8b-05ce-480e-961c-8c5c40150726-image.png
myArm M750

0_1716194745224_68c1ce0f-cde7-4d3e-a3a0-25c759183095-image.png
The myArm M750 is a general-purpose intelligent six-degree-of-freedom robotic arm with a reach of 750 mm and a rated load capacity of 500 g, with a maximum capacity of up to 1 kg. It comes with a 1-degree-of-freedom parallel end effector (gripper). The “M” stands for Master controller. It is suitable for applications requiring complex motion control, precise positioning, and high programmability.
0_1716194766688_db267f91-b177-41f0-b5b5-27e17ef2397d-image.png

Product Features

● High-Speed Data Output: Both the myArm C650 and M750 can output end coordinates or joint angles at a frequency of 50 Hz, providing real-time motion data to ensure smooth operation.

● Flexibility and Modular Design: The robotic arms feature a highly modular design, allowing users to customize and expand according to their needs, making them suitable for a wide range of applications.

● Robust Load Capacity: The myArm M750 has a reach of 750 mm and a load capacity of 500 g, with a maximum capacity of up to 1 kg, making it suitable for complex industrial tasks.

● Programming and Control: Supporting Python and ROS, the arms offer powerful programming capabilities and flexible control options, making them ideal for education and industrial automation fields.

● Strong Compatibility: The arms support various sensors and end effectors, including cameras and IMUs, further enhancing data collection and processing capabilities.

AI Learning and Training Project

Recently, many projects similar to Stanford University’s Alopha robot project have emerged. These projects mainly involve learning by mimicking human motion trajectories. By recording trajectory data and conducting extensive training, the Alopha robot can autonomously perform certain household chores, such as cooking, washing dishes, and folding clothes.
https://youtu.be/HaaZ8ss-HP4
The key elements of the entire project are the full-body remote control system, human demonstration learning, and collaborative training of datasets. To this end, we have also launched a corresponding humanoid composite suite, equipped with two sets of myArm M & C robotic arms and the Elephant Robotics mobile platform for mobility.
0_1716194801911_c6096d4c-32f3-4892-9e7c-f08ca065e6b1-image.png
Now, let me demonstrate how to operate using a set of myArm M & C.
https://youtu.be/-sLegg0wSUQ
The myArm M650 is essentially a 6+1-axis robotic arm: 6 degrees of freedom for the arm itself and 1 for the additional end gripper controller. To ensure better data synchronization, we have also designed the myArm C to be of the same type but different sizes.
https://youtu.be/AlKLbohyA4E
You can see that the synchronous remote control operation is quite smooth. This is mainly due to the myArm's ability to output terminal coordinates and joint angles at a rate of 50 Hz, providing users with real-time motion data. Let's try grasping some parts to experience its effectiveness.
https://youtu.be/DMZSXuJ_lA0
To achieve functionality similar to the Alopha robot, we have completed two parts: the remote control system and human demonstration learning. The most crucial part remaining is the collaborative training of datasets.

Collaborative Training of Datasets:
Collaborative training (co-training) is a machine learning technique particularly suitable for multi-task learning and ensemble learning. The core idea is to use multiple different datasets or data perspectives to jointly train a model, thereby improving the model's performance on specific tasks.
Here is a detailed introduction about the dataset:
https://www.inceptivemind.com/meet-mobile-aloha-your-housekeeping-robot/36176/
The entire Aloha project is open source, so we can find how their team conducted the training online. The specific steps are as follows:

1.Initial Model Training: First, train the initial model using an existing static dataset. This model can perform dual-hand static tasks.

2.Mobile Operation Data Collection: Then, collect demonstration data of the robot performing tasks in a mobile environment using the full-body remote control system.

3.Joint Training: Combine static and dynamic datasets for training. By integrating data from both scenarios, the model can learn a broader range of task
characteristics, improving its generalization ability and success rate on new tasks.

The data involved includes tasks such as grasping and placing objects, tidying up the table, moving speed, opening and closing cabinet doors, camera images, LiDAR data, and IMU data.
To meet the needs of most projects, the myArm M & C series offers significant performance advantages, including high-frequency data transmission at a rate of 50 frames per second. It also has specialized software to collect relevant data, such as the height of objects, the height of the table, and the speed of movement.
Additionally, the myArm is equipped with multiple sensors, such as cameras and IMUs, with the ultimate goal of acquiring precise datasets for training.

Application Scenarios

The project mentioned above is just one example. Our robotic arm can do much more and can be used in conjunction with quadruped robots for tasks like intelligent garbage collection.

0_1716194885106_ab2d7c90-37ea-4cc8-b006-984709a3d42e-image.png
Standalone Robotic Arm Applications:

Medical and Rehabilitation
● Surgery Simulation: In medical training, the myArm M & C can simulate surgical procedures, helping surgeons practice and train before actual operations.
● Rehabilitation Assistance: It can assist in the rehabilitation of patients by facilitating hand and arm recovery exercises, with programmable different rehabilitation movement trajectories.
0_1716194934339_9424ed1f-042d-40eb-997e-a72b007a5e63-image.png
Education and Training
● Robotics Programming Courses: In schools and training institutions, the myArm M & C can be used as a teaching tool, helping students learn robotic programming and control. Through programming tasks, students can master how to control the robot's movements and perform complex operations.
● Laboratory Experiments: Used in university and research institution laboratories for the study of robotic arm kinematics and dynamics. Students can conduct precise motion control experiments and analyze the robot's performance.
0_1716194991417_37eb8ccc-a02b-4c88-8a91-726bb3d0ff2b-image.png
Combined Applications:

Integration with Mobile Robots
● Automated Warehouse System: Combine the myArm M & C with mobile robots (such as Elephant Robotics mobile platforms) to create an automated warehouse system. The mobile robots handle the transportation of items, while the myArm handles the picking and placing, achieving efficient item management and sorting.
● Logistics and Transportation: In logistics centers, the combination of the myArm and mobile platforms can automate the sorting and transportation of packages, improving logistics efficiency.
● Sentinel Patrol: Paired with quadruped robots, it can perform patrol inspections in parks and clean up trash when detected.

Summary

The myArm M & C series robotic arms, with their high flexibility, precision, and modular design, demonstrate broad application potential in education, research, industry, and home environments. When used independently, these robotic arms excel in tasks such as precision assembly, quality inspection, educational training, and medical rehabilitation, showcasing their exceptional performance in complex operations. When combined with other robots, such as mobile robots or other robotic arms, the myArm M & C can build efficient automated systems, enabling more complex task allocation and execution. Additionally, these robotic arms can be applied in artistic creation and smart home setups, further expanding their use cases and creative possibilities.

ElephantRobotics

Introduction

Have you ever encountered a situation like this: when you're ready to shoot a video with your smartphone or camera, you've already envisioned the perfect shot in your mind, but due to practical limitations, you can't capture the angle you desire? This situation can be frustrating. For example, if you want to shoot from the perspective of an ant on the ground, the lens needs to be on the same level as the ant, which is not only difficult in practice but often hard to achieve.

Although there are many stabilizing devices on the market, such as gimbal stabilizers, to assist in achieving steady and multi-angle shooting effects, in this article, I will explore a unique solution: mounting the smartphone on the end of a robotic arm to shoot, aiming to capture those special viewpoints that traditional methods struggle to grasp. This attempt is not only designed to overcome the physical limitations encountered during the shooting process but also hopes to innovate technologically to expand our imagination and practice of photography angles.
https://www.youtube.com/watch?v=xXq9_Nu9djs
Equipment
myCobot 320 M5stack
The myCobot 320, a collaborative robot arm with six degrees of freedom, has become a highlight in its field thanks to its unique design and high-precision servo motors. This robotic arm has a maximum working radius of 350mm and a maximum end load capacity of 1000g, making it suitable for a wide range of application scenarios. The myCobot 320 not only supports flexible visual development applications but also provides in-depth analysis of mechanical motion principles, offering users 12 standard 24V industrial IO interfaces to meet various development needs.
0_1712828807002_d37047f6-4a08-44d7-83cf-004386a82534-image.png
Its openness is extremely high, compatible with most mainstream operating systems and programming languages, including Python and ROS, offering developers a great deal of flexibility and freedom. Whether in education, research and development, or industrial applications, the myCobot 320 provides strong support, making innovation and application development more convenient and efficient.

myCobot Pro Phone Holder

It can be mounted on the end of the myCobot 320 and myCobot Pro 630 robotic arms, capable of securely holding a smartphone in place.
0_1712828822428_8669db63-2b0c-4195-87c5-dd2d31eed82e-image.png
0_1712828826021_60971631-9d33-400c-ab15-01161b1d1879-image.png

Initial Attempt

Installing the Phone Holder
0_1712828839009_d8bd02fe-863b-4171-ae1b-c2b9592f44be-image.png
https://youtu.be/QvcSwkRnooY

Shooting Video

Programming joint control for the robotic arm, let's see how it performs.
https://youtu.be/b-FiXHTFsrw
Although the robotic arm's movement trajectory shown in the video appears simple, in reality, adjusting these trajectory points still requires a considerable amount of time, and the results may not always be ideal. Therefore, I am considering whether there is a better solution, such as setting several motion modes and pre-planning the robotic arm's shooting path. This would not only make effective use of the robotic arm to assist in shooting but also provide a more efficient deployment method.

Programming Challenge

Development Needs Analysis
The equipment to be used is as follows:
0_1712829023682_3c3440ba-7ddd-4feb-af49-edca1a0bf561-image.png
Requirements Optimization:
The goal is to design a series of innovative video shooting methods for static objects. The initial plan adopts three approaches:

Using the smartphone fixed at the end of the robotic arm to achieve 360° panoramic video shooting of the object.
Creating a visual effect that smoothly advances from a distance to close to the object, simulating a "zoom-in" effect.
Achieving rapid rotation and movement at the end of the robotic arm to capture dynamic scenes.

To precisely control the shooting process, the plan is to utilize OpenCV machine vision algorithms and the AVFoundation iOS framework, controlling the exact movements of the robotic arm with Python scripts. We will identify the dimensions of the object through the smartphone camera, thereby calculating the ideal distance between the end of the robotic arm and the object. Based on this distance, we will design a corresponding robotic arm movement algorithm to ensure the best visual effects during the shooting process.
0_1712829043327_48693360-4537-4a64-8b99-81b3b8a1a9ac-image.png

YOLOv5 Vision Algorithm

To save time, we will not train the machine vision algorithm ourselves to recognize specific objects. Instead, we will directly use the YOLOv5 library, which has been trained and optimized by other developers, to achieve accurate detection of target objects.

import cv2
import torch
from pathlib import Path
from models.experimental import attempt_load
from utils.general import non_max_suppression, scale_coords
from utils.torch_utils import select_device, time_synchronized

def detect_apples(img_path):
    device = select_device('')
    weights = 'yolov5s.pt'
    model = attempt_load(weights, map_location=device)
    img0 = cv2.imread(img_path)  # BGR
    img = img0[:, :, ::-1]  # RGB
    img = torch.from_numpy(img).to(device)
    img = img.float()  # uint8 to fp16/32
    img /= 255.0  # 0 - 255 to 0.0 - 1.0
    if img.ndimension() == 3:
        img = img.unsqueeze(0)

    # Inference
    t1 = time_synchronized()
    pred = model(img)[0]
    # Apply NMS
    pred = non_max_suppression(pred, 0.4, 0.5, classes=None, agnostic=False)
    t2 = time_synchronized()
    print(f'Inference time: {(t2 - t1):.3f}s')

    # Process detections
    for i, det in enumerate(pred):  # detections per image
        gn = torch.tensor(img0.shape)[[1, 0, 1, 0]]  # normalization gain whwh
        if len(det):
            det[:, :4] = scale_coords(img.shape[2:], det[:, :4], img0.shape).round()
            for *xyxy, conf, cls in reversed(det):
                label = f'{model.names[int(cls)]} {conf:.2f}'
                plot_one_box(xyxy, img0, label=label, color=(255, 0, 0))
    return img0

def plot_one_box(xyxy, img, color=None, label=None, line_thickness=None):
    # Plots one bounding box on image img
    tl = (
        line_thickness or round(0.002 * max(img.shape[0:2])) + 1
    )  # line/font thickness
    color = color or [random.randint(0, 255) for _ in range(3)]
    c1, c2 = (int(xyxy[0]), int(xyxy[1])), (int(xyxy[2]), int(xyxy[3]))
    cv2.rectangle(img, c1, c2, color, thickness=tl, lineType=cv2.LINE_AA)
    if label:
        tf = max(tl - 1, 1)  # font thickness
        t_size = cv2.getTextSize(label, 0, fontScale=tl / 3, thickness=tf)[0]
        c2 = c1[0] + t_size[0], c1[1] - t_size[1] - 3
        cv2.rectangle(
            img, c1, c2, color, -1, cv2.LINE_AA
        )  # filled
        cv2.putText(
            img,
            label,
            (c1[0], c1[1] - 2),
            0,
            tl / 3,
            [225, 255, 255],
            thickness=tf,
            lineType=cv2.LINE_AA,
        )
    return img

if __name__ == '__main__':
    img_path = 'apple.jpg'  
    result_img = detect_apples(img_path)
    cv2.imshow('Result', result_img)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

0_1712829076653_2d0ae8c5-3a45-425e-bc32-19dee707a30a-image.png
Robotic Arm Motion Control Algorithm
Methods for controlling the robotic arm:

This method controls the movement of the robotic arm's joints.

mc.send_angles([angle_list],speed)

# This method uses coordinates to control the robotic arm's movement in space.
mc.send_coords([coords_list],speed,mode)
example:
mc.send_angles([0,0,0,0,0,0],100)
time.sleep(2)
mc.send_coords([(-3.6), 30.32, (-45.79), (-46.84), 97.38, 0.35],100,1)

After obtaining the dimensions of the object, define a reasonable position for the end of the robotic arm relative to the object.

def calculate_angles_for_distance(distance):
    #Calculate the joint angles of the robotic arm based on the ideal distance.
    # The calculations here need to be adjusted based on actual conditions and physical parameters
    return [0, -10, distance * 0.1, 0, 30, 0]  

def calculate_adjusted_angles(action_angles):
    # Calculate new angles based on the coordinate adjustments that may be needed after a specific action
    # This is just an example, and the specific logic should be adjusted as needed.
    return [angle * 1.1 for angle in action_angles]

Then select the motion control corresponding to the chosen mode.

#For 360° Panoramic Shooting:
# Define the specific shooting mode
def shoot_mode_360(ideal_distance):
    print("excuse 360")
    # First, move to the ideal shooting position.
    move_to_ideal_position(ideal_distance)
    # Handle the ideal distance.
    ideal_ratio =  ratio
    # Perform the specific actions for 360° panoramic shooting.
    mc.send_angles([0, 0, 0, 0, 0, 0], speed=15)
    time.sleep(1)
    mc.send_coords([angle * ratio for angle in angles_list]
，15)
    time.sleep(1)
    mc.send_coords([angle * ratio for angle in angles_list]
，15)
    time.sleep(1)
    mc.send_coords([angle * ratio for angle in angles_list]
，15)

Invoking the Smartphone Camera

In the development process, as I attempted to invoke the smartphone camera interface to realize the automatic shooting function, I encountered a series of challenges. As my first deep exploration into the AVFoundation iOS framework, with the goal of activating and controlling the smartphone's camera, I found that I had not yet fully succeeded in implementing this function. The current difficulties mainly focus on how to accurately call the camera for video shooting and how to compensate for possible image stretching during the shooting process through software adjustments, which requires precise control over the movement of the robotic arm. These issues mark the direction of my subsequent research and require me to continue delving into the use of the AVFoundation framework, especially its specific methods for controlling the camera, and explore how to integrate these controls into the adjustment of the robotic arm's movement to ensure the video quality meets expectations.

Summary

As this project record comes to an end, I realize that although there are many areas for improvement, attempting to make two independent devices work together under different frameworks has been a valuable experience for me. Indeed, the entire project has not yet reached the ideal state in my mind. However, I believe the direction explored by this project has great potential, considering that there are already professional photography robotic arms on the market capable of producing impressive results, which reinforces my belief in the potential value of the project.
The application of robotic arms has already permeated our daily lives, playing an increasingly important role whether in industrial production, daily services, or in the field of artistic creation. With the continuous advancement and popularization of artificial intelligence technology, the combination of AI and robotics will undoubtedly become an important trend in the future development of technology. I have great expectations for the future development of robotics technology and believe that they will display amazing capabilities and creativity in more fields in the future.
If you are interested in my project, or have any ideas and suggestions, I warmly welcome your communication. Your feedback will provide valuable help for me to continue improving and perfecting this project.

ElephantRobotics

As the control core of the embedded robotic arm, M5stack-basic is very useful and helps a lot！

ElephantRobotics

Introduction

Since the release of ChatGPT by OpenAI, the world has been rapidly moving towards a trend of integrating AI technology more broadly into robotic devices. Mechanical arms, as an important part of automation and intelligent technology, are increasingly being applied in fields such as manufacturing, healthcare, and the service industry. With the advancement of AI technology, mechanical arms are not only capable of performing complex operational tasks but also capable of more intuitive interactions through natural language processing technology, greatly enhancing flexibility and user-friendliness.
For example, a Microsoft AI research center is studying how to control robotic devices using natural language. Therefore, I am interested in undertaking a similar project, which allows users to control mechanical arms using natural language. This can significantly lower the barrier to robot programming, enabling non-professionals to easily operate and experiment.
paper link:https://www.microsoft.com/en-us/research/uploads/prod/2023/02/ChatGPT___Robotics.pdf
This project is divided into two parts, this article primarily discusses the design and construction process of the entire artificial intelligence system. The next article will address the difficulties encountered during the development of the project, how to solve them, and the potential expandable functionalities of this project.

Project Background and Motivation

Imagine one day, if you could command a mechanical arm to "help me tidy up the desk and throw the trash into the trash bin," and the mechanical arm starts to obey the command, cleaning up the trash on your desk. What a joyous thing that would be.

Therefore, for the preparation work, we need a small mechanical arm (mainly because large mechanical arms are too expensive), a computer capable of accessing the internet, and a passionate heart! This project is primarily inspired by Microsoft's research that transformed the use of robots.

Technical Overview

The project is compiled in a Python environment. Let’s introduce some of the software technologies that this project will utilize:

ChatGPT: (The core technology crucial to the entire project)

https://openai.com/chatgpt
ChatGPT is an artificial intelligence technology based on the GPT (Generative Pre-trained Transformer) model architecture. GPT is a deep learning-based natural language processing model that achieves language understanding and generation tasks through large-scale unsupervised pre-training and supervised fine-tuning. In other words, you can think of it as chatting with a person with vast knowledge. You can preset some roles for it, such as "You are a doctor," and then you can discuss medical knowledge with it. However, please note that not all generated conversations are correct and require self-judgment.
0_1711940498854_cc8752d6-4576-4e18-ac81-3c7db8e585d3-image.png

Speech Recognition: (An essential module for processing natural language)

We use a speech recognition service by Google, Speech-to-text, which allows developers to convert speech into text. It also supports multiple languages and dialects, including but not limited to English, Spanish, French, German, Chinese, etc., meeting the needs of global users.
You can try it online:https://cloud.google.com/speech-to-text?hl=en#features

pymycobot: (Control module for the mycobot 280 mechanical arm)

Github:https://github.com/elephantrobotics/pymycobot
pymycobot is a control module developed by Elephant Robotics specifically for the my series product mechanical arms. The development of this module significantly lowers the barrier to programming and controlling mechanical arms. pymycobot provides a multitude of control interfaces for the mechanical arm, such as joint control, coordinate control, and control of the mechanical arm's gripper, etc., which is quite friendly for beginners in mechanical arm programming.

These technologies collaborate to achieve the functionality of controlling a mechanical arm through natural language.

Next, let's introduce the hardware device:

mycobot 280 M5Stack

The mycobot 280 M5Stack is a collaborative robot with 6 degrees of freedom, developed in collaboration between Elephant Robotics and M5Stack. It features a compact and exquisite design, with an all-encompassing body that uses high-precision servo motors without any external cables. The mycobot weighs only 850g, and the mechanical arm's end can carry a maximum load of 250g. It has a maximum working radius of 280mm, and its repeatability in positioning accuracy can achieve an error margin within 0.5mm. 0_1711940560000_e51e9eb7-d9ad-4e4f-be6d-eeb9bc9df5c6-image.png

Design Concept and Implementation Process:

You might want to take a look at a recent video published by OpenAI, where a person chats with a robot that processes natural language and generates corresponding actions.
https://www.youtube.com/watch?v=Sq1QZB5baNw&pp=ygUVY2hhdGdwdCBjb250cm9sIHJvYm90
There are also some other mechanical arms that have integrated similar scenarios.
https://www.youtube.com/watch?v=IGsYgSdrT4Y
What I aim to do is create a smaller version of this! It involves communicating with a mechanical arm through natural language, after which the mechanical arm executes the corresponding instructions. 0_1711940582287_a478d12f-1417-4a19-a9dc-5dfc1c535ad0-image.png
Next, I will explain the process of the project.

Speech-to-Text Functionality:

Why use the speech-to-text functionality? Those who have used ChatGPT know about its built-in voice chat feature. However, to integrate it with PCs and mechanical arms, we cannot use ChatGPT's web version but need to implement it locally on a computer using ChatGPT's API interface.
The ChatGPT API can only receive inputs in "text" form, so speech-to-text can convert our speech into text and input it into the computer.


def speech_to_text():
    recognizer = sr.Recognizer()

    with sr.Microphone() as source:
        print("start speaking...")
        audio = recognizer.listen(source)

        try:
            # text = recognizer.recognize_google(audio, language='zh-CN')
            text = recognizer.recognize_google(audio, language='en-US')
            print("you said: " + text)
            return text
        except sr.UnknownValueError:
            print("Google Speech Recognition could not understand audio")
            return None
        except sr.RequestError as e:
            print("Could not request results from Google Speech Recognition service; {0}".format(e))
            return None

0_1711940638605_9bf3e3af-5df3-4ce1-b27e-060612c039e3-image.png

Calling the ChatGPT API & Pre-training:

Once the text form of the speech is obtained, we can call the API to chat with ChatGPT locally. Below is the method provided by OpenAI for calling the ChatGPT API.

def generate_control_code(prompt):
    openai.api_key = ''
    prompt = f"{pre_training}The command the user wants to execute is：'{prompt}'."

    try:
        response = openai.Completion.create(
            engine="gpt-3.5-turbo",
            prompt=prompt,
            temperature=0.5,
            max_tokens=100,
            top_p=1.0,
            frequency_penalty=0.0,
            presence_penalty=0.0
        )
        code = response.choices[0].text.strip()
        return code
    except Exception as e:
        print(f"error: {e}")
        return ""

You need to apply for your own API_KEY from the official website (which requires payment).
Pay attention to the prompt; this will be the pre-training that needs to be mentioned.

    prompt = f"{pre_training}The command the user wants to execute is：'{prompt}'."

To get accurate responses, you need to let ChatGPT know in advance what it needs to do and what you want it to do according to our ideas. We will first test it with the web version because setting up the API is relatively complex.
Below is my prompt (specific to this project); I only want it to output the code for the mechanical arm to execute, so this is what I did.

Generate Python code that matches the following requirements: 
Use an instance of the MyCobotController class robot to perform a specific action. The instance already contains methods such as move_to_zero() to return to the initial position, grab_position() to move to the grab position, and plus_x_coords(value), plus_y_coords(value), plus_z_coords(value) to move specific distances on the X, Y, and Z axes. 
You don’t need to output other textual content, just output the code directly, for example, the robot arm returns to the origin. robot.move_to_zero()

Here is what I said:
I want the robot arm to return to the origin, and then go to the position to be grabbed to perform grabbing.

0_1711940684644_f58b0746-b833-443a-8bb7-6c186599c820-image.png
Here you can see that it successfully meets my basic requirements, but it outputs the code with comments, which will affect our results later. So, modifications are still needed (to output only the code without comments). 0_1711940695984_a40675c8-002e-403e-b414-7371e1b1ab18-image.png

Building a New Mechanical Arm API

Why construct a new API when pymycobot already provides one? Indeed, pymycobot offers a comprehensive and extensive API. However, if the voice command given is complex, ChatGPT might generate more complex code, which could lead to errors. I built a new API for the mechanical arm based on some current testing requirements I wanted to explore.

class MyCobotController:
    def __init__(self, port, baud):
         self.mc = MyCobot(port, baud)
         self.speed = 80
         self.mode =0
         self.coords = []

    def grab_position(self):
        # self.mc.send_angles([4.83, 13.97, (-99.31), (-1.75), 4.39, (-0.26)], 80)
        self.mc.send_coords([149.2, (-48.3), 201.7, (-176.98), 4.55, (-84.66)], 80, 0)
        time.sleep(2)

    def move_to_zero(self):
        self.mc.send_angles([0,0,0,0,0,0],70)
        time.sleep(2)

    def gripper_open(self):
        self.mc.set_gripper_state(0,80,1)
        time.sleep(2)

The goal is to quickly set up the entire project and then enrich its content later on. There's a reason for this approach. For instance, if you want the mechanical arm to move to a point and grab something, using pymycobot's method might look something like this:

robot.send_angles([0,0,0,0,0,0],80)
time.sleep(2)
#open gripper
robot.set_grippr_value(1,80,1)
time.sleep(1)
#clos grippr
robot.set_grippr_value(0,80,1)
time.sleep(1)

In such cases, you would need to output many lines of code, which could lead to errors in more complex situations. By rebuilding a method, we only need to call one method to execute the action, solving the problem with just two lines of code.

class Newmycobot():
    def grab_action(self):
        self.send_angles([0,0,0,0,0,0],80)
        time.sleep(2)
        #open gripper
        self.set_grippr_value(1,80,1)
        time.sleep(1)
        #clos grippr
        self.set_grippr_value(0,80,1)
        time.sleep(1)
    
robot = Newmycobot()
robot.grab_action()

5. Preliminary Results and Demonstration

Let's start with a quick debugging session using the web version of ChatGPT to put it into practice.
0_1711940754460_2811c332-2205-4dfb-8d5d-044c1d07cace-image.png Copy the generated code and run it.
0_1711940763673_9b9500d7-607c-45c1-a43d-af0e76fad50b-image.png
You can see that the simple test is successful.

6. Summary

This record ends here, and the project is not yet completed. In the near future, I will continue to refine this project. In the next article, I will complete the entire project and share how some of the problems that arose during the development process were solved. If you like this article, feel free to leave your thoughts in the comments below.

ElephantRobotics

This article is edited and reprinted with authorization from the author, kimizuka.
Original article link: https://blog.kimizuka.org/entry/2024/02/01/141808
alt text

Introduction

Translate AirPods through the iPhone app to Express, then through python-shell to pymycobot, and finally synchronize with myCobot to synchronize the rotation angle of AirPods with the posture of myCobot 🤖.
0_1708939783546_503ad5a1-9d02-4904-abc0-e02b369867ae-image.png
Although it's unclear if there's a demand, I will provide a rough outline of the source code.

Project Structure

This project primarily employs technologies such as headphone-motion, web servers, the express framework of node.js, python-shell, and pymycobot. Here is a brief introduction to these technologies:
1.Headphone-Motion: Headphone Motion is a project that utilizes specific technology to track and utilize user head movements. Although the specifics may depend on the implementation method and platform used (such as iOS), it mainly demonstrates how to capture head movement data by connecting to the device's headphones (especially those smart headphones with built-in sensors). A more intuitive effect of this can be seen in the Headphone Motion Unity Plugin made by GitHub user anastasiadeana. It is capable of real-time tracking of the user's head movements, including tilting and rotating, which is a core technology of this project.
alt text
2.Web Server: There are many types of servers, which provide data, services, or applications to other applications or devices. Servers perform tasks such as processing data requests, hosting websites, storing information, running business applications, and so on. In this project, the web server mainly handles receiving head movement data from the iOS application and passing these data to the script controlling the mycobot robot arm.

3.Express-Node.js: Express is a fast, open, minimalistic web application framework for Node.js. It is designed for building web applications and APIs. It allows developers to set up middleware to respond to HTTP requests in a very fast and simple way, making the development of web applications quicker and easier.
https://github.com/expressjs/express

4.Pymycobot-Python: pymycobot is a Python library specifically designed for controlling the myCobot robot arm. This library provides a series of functions and interfaces that allow developers to communicate and control the myCobot robot arm directly through Python scripts. Using pymycobot, developers can write code to control the robot arm's movements, adjust its posture, execute preset action sequences, etc., making it widely applicable in education, research, automation, and more.

iOS App

This application is based on an app I created previously, utilizing react-native-headphone-motion to access the sensors in AirPods through a React Native iOS application.

It's an interesting project where, when you are detected bending your head down for a long time while wearing AirPods, a lamenting sound will remind you.
https://twitter.com/i/status/1745588902981931443
https://blog.kimizuka.org/entry/2024/01/16/105208

One point to note is the change I made: I added a process to send POST requests to the web server in onDeviceMotionUpdates. Additionally, to avoid overburdening the server with POST requests on every update, I set it up to send a request at a minimum interval of 500ms.

App.tsx

useEffect(() => {
  const delay = 500;
  const handleDeviceMotionUpdates = onDeviceMotionUpdates((data) => {
     // If the time since the last request is less than 500ms, return
    if (Date.now() - lastUpdateTimeRef.current < delay) {
      return;
    }

    // Post sensor values to the Web server
    axios.post(String(process.env.API_URL), { 
      pitch: data.attitude.pitchDeg || 0,
      roll: data.attitude.rollDeg || 0,
      yaw: data.attitude.yawDeg || 0
    }).then(() => {
      lastUpdateTimeRef.current = Date.now();
    }).catch((err) => {
      console.error(err);
      lastUpdateTimeRef.current = Date.now();
    });

    setPitch(data.attitude.pitch);
    setPitchDeg(data.attitude.pitchDeg);
    setRoll(data.attitude.roll);
    setRollDeg(data.attitude.rollDeg);
    setYaw(data.attitude.yaw);
    setYawDeg(data.attitude.yawDeg);
    setGravityX(data.gravity.x);
    setGravityY(data.gravity.y);
    setGravityZ(data.gravity.z);
    setRotationRateX(data.rotationRate.x);
    setRotationRateY(data.rotationRate.y);
    setRotationRateZ(data.rotationRate.z);
    setUserAccelerationX(data.userAcceleration.x);
    setUserAccelerationY(data.userAcceleration.y);
    setUserAccelerationZ(data.userAcceleration.z);
  });

  return () => {
    handleDeviceMotionUpdates.remove();
  };
}, []);

In the POST request, I used axios, which can send asynchronous HTTP requests to REST endpoints and handle responses.
Therefore, it's also necessary to add module imports.

import axios from 'axios';

Code

import axios from 'axios'; // Added to simplify the POST request
import React, {
  useEffect,
  useRef, // Added to maintain a 500ms interval
  useState,
} from 'react';
import {Button, SafeAreaView, StyleSheet, Text} from 'react-native';
import {
  requestPermission,
  onDeviceMotionUpdates,
  startListenDeviceMotionUpdates,
  stopDeviceMotionUpdates,
} from 'react-native-headphone-motion';

const API_URL = 'http://localhost:3000'; // Fill in the URL to POST

export default function App() {
  const lastUpdateTimeRef = useRef<number>(0); // Added to keep track of the last update time
  const [pitch, setPitch] = useState(0);
  const [pitchDeg, setPitchDeg] = useState(0);
  const [roll, setRoll] = useState(0);
  const [rollDeg, setRollDeg] = useState(0);
  const [yaw, setYaw] = useState(0);
  const [yawDeg, setYawDeg] = useState(0);
  const [gravityX, setGravityX] = useState(0);
  const [gravityY, setGravityY] = useState(0);
  const [gravityZ, setGravityZ] = useState(0);
  const [rotationRateX, setRotationRateX] = useState(0);
  const [rotationRateY, setRotationRateY] = useState(0);
  const [rotationRateZ, setRotationRateZ] = useState(0);
  const [userAccelerationX, setUserAccelerationX] = useState(0);
  const [userAccelerationY, setUserAccelerationY] = useState(0);
  const [userAccelerationZ, setUserAccelerationZ] = useState(0);

  useEffect(() => {
    const delay = 500; // Store the update interval in a variable
    const handleDeviceMotionUpdates = onDeviceMotionUpdates(data => {
      if (Date.now() - lastUpdateTimeRef.current < delay) {
        // Return if the update interval is not met
        return;
      }

      // Post sensor values to the Web server
      // Update lastUpdateTimeRef regardless of success or failure
      // For some reason, await was not used
      axios
        .post(String(API_URL), {
          pitch: data.attitude.pitchDeg || 0,
          roll: data.attitude.rollDeg || 0,
          yaw: data.attitude.yawDeg || 0,
        })
        .then(() => {
          lastUpdateTimeRef.current = Date.now();
        })
        .catch(err => {
          console.error(err);
          lastUpdateTimeRef.current = Date.now();
        });

      setPitch(data.attitude.pitch);
      setPitchDeg(data.attitude.pitchDeg);
      setRoll(data.attitude.roll);
      setRollDeg(data.attitude.rollDeg);
      setYaw(data.attitude.yaw);
      setYawDeg(data.attitude.yawDeg);
      setGravityX(data.gravity.x);
      setGravityY(data.gravity.y);
      setGravityZ(data.gravity.z);
      setRotationRateX(data.rotationRate.x);
      setRotationRateY(data.rotationRate.y);
      setRotationRateZ(data.rotationRate.z);
      setUserAccelerationX(data.userAcceleration.x);
      setUserAccelerationY(data.userAcceleration.y);
      setUserAccelerationZ(data.userAcceleration.z);
    });

    return () => {
      handleDeviceMotionUpdates.remove();
    };
  }, []);

  return (
    <SafeAreaView style={styles.container}>
      <Button
        title={'requestPermission'}
        onPress={async () => {
          await requestPermission();
        }}
      />
      <Button
        title={'startListenDeviceMotionUpdates'}
        onPress={async () => {
          await startListenDeviceMotionUpdates();
        }}
      />
      <Button
        title={'stopDeviceMotionUpdates'}
        onPress={async () => {
          await stopDeviceMotionUpdates();
        }}
      />
      <Text>{lastUpdateTimeRef.current}</Text>
      <Text>{`pitch: ${pitch}`}</Text>
      <Text>{`pitchDeg: ${pitchDeg}`}</Text>
      <Text>{`roll: ${roll}`}</Text>
      <Text>{`rollDeg: ${rollDeg}`}</Text>
      <Text>{`yaw: ${yaw}`}</Text>
      <Text>{`yawDeg: ${yawDeg}`}</Text>
      <Text>{`gravityX: ${gravityX}`}</Text>
      <Text>{`gravityY: ${gravityY}`}</Text>
      <Text>{`gravityZ: ${gravityZ}`}</Text>
      <Text>{`rotationRateX: ${rotationRateX}`}</Text>
      <Text>{`rotationRateY: ${rotationRateY}`}</Text>
      <Text>{`rotationRateZ: ${rotationRateZ}`}</Text>
      <Text>{`userAccelerationX: ${userAccelerationX}`}</Text>
      <Text>{`userAccelerationY: ${userAccelerationY}`}</Text>
      <Text>{`userAccelerationZ: ${userAccelerationZ}`}</Text>
    </SafeAreaView>
  );
}

const styles = StyleSheet.create({
  container: {
    flex: 1,
    alignItems: 'center',
    justifyContent: 'center',
    backgroundColor: 'white',
  },
});

Actually, it would be more convenient to specify the API_URL directly in the application, but I implemented it this way considering speed.

Web Server

I set up a local server on my Mac. To operate myCobot, I went through several setup steps, mainly to adapt to Mac computers. These steps included installing drivers for the robot arm, updating the firmware of mycobot 280, and other operations, all detailed in this article.
https://blog.kimizuka.org/entry/2021/08/10/131812
I think creating a web server with Python would be smoother, but based on my skill set, using Node.js is the fastest method. So, I plan to quickly set up the server using Express. Communication with myCobot is conducted through Python, so for this part, I decided to use python-shell to implement it.

require('dotenv').config(); // Used to pass the port of myCobot from outside
const express = require('express');
const { PythonShell } = require('python-shell'); // Used for communicating with myCobot
const app = express();
const http = require('http').Server(app);

const duration = 100; // If the delay (500ms) set on the application side is too small, it will cause problems

app.use(express.json());
app.post('/', (req, res) => {
  try {
    const angles = [0, 0, 0, 0, 0, 0];

    // For myCobot's joint information, refer to page 13 of https://www.elephantrobotics.com/wp-content/uploads/2021/03/myCobot-User-Mannul-EN-V20210318.pdf
    // The array stores the 6 joints in order from the bottom up
    // Each joint has a definite range of motion; ensure not to exceed this range    
    angles[0] = Math.max(-90, Math.min(req.body.yaw || 0, 90)); // J1
    angles[3] = Math.max(-90, Math.min(req.body.pitch || 0, 90)); // J4
    angles[5] = Math.max(-175, Math.min(req.body.roll || 0, 175)); // J6

    // myCobot connected via USB receives instructions from Python
    PythonShell.runString(
      `from pymycobot.mycobot import MyCobot; MyCobot('${ process.env.MY_COBOT_PORT }').send_angles([${ angles }], ${ duration })`,
      null,
      (err) => err && console.error(err)
    );
  } catch (err) {
    console.error(err);
  }
  res.send(200);
});

try {
  const angles = [0, 0, 0, 0, 0, 0];

  // Resets posture at startup
  PythonShell.runString(
    `from pymycobot.mycobot import MyCobot; MyCobot('${ process.env.MY_COBOT_PORT }').send_angles([${ angles }], ${ duration })`,
    null,
    (err) => err && console.error(err)
  );
} catch(err) {
  console.error(err);
}

http.listen(3000, '0.0.0.0');

Because it's necessary to execute pymycobot through PythonShell, the pymycobot directory must be placed at the same level as app.js.
https://github.com/elephantrobotics/pymycobot
Once everything is prepared and the PC is connected to myCobot,
you can start the web server and pass the pitch, roll, and yaw values received through POST requests to myCobot.
Although this time the sensor values of AirPods are sent from an iPhone application via POST, the source of the POST could be from anywhere. Therefore, I think setting up such a server could be potentially useful in the future.

source code:
https://github.com/kimizuka/mycobot-express/tree/example/airpods

Summary

This project showcases the integration of headphone motion detection with robotic control, illustrating the potential of combining human movement data with robotics. By capturing head motion through smart headphones and translating it into commands for a robotic arm, it enhances human-robot interaction and introduces innovative applications in automation and assistive technologies. This approach opens up new avenues for intuitive control mechanisms in robotics, making technology more accessible and interactive.
Finally, thanks again to Kimizuka for sharing this case with us.Feel free to explore further or share your insights with us.

ElephantRobotics

Introduction

I am a freelancer specializing in machine learning and robotics technology. My passion began during a course in artificial intelligence in college, which inspired me to explore new methods of human-machine interaction. In particular, for the operation of robotic arms, I have always wanted to simplify their complexity to make them more intuitive and easier to use.
The inspiration for this project stems from my love for innovative technology and the pursuit of improving the ways humans interact with machines. My goal is to develop a gesture-based robotic arm control system that allows non-professionals to operate it with ease. For this purpose, I chose Google's MediaPipe library for gesture recognition and used mycobot 320 m5 as the experimental platform.

Technical Overview

Google MediaPipe

MediaPipe is an open-source cross-platform framework developed by Google, specifically designed for building various perception pipelines. This framework offers a wealth of tools and pre-built modules, enabling developers to easily build and deploy complex machine learning models and algorithms, especially in the field of image and video analysis.
0_1705903584530_e88e975d-66a4-4fb1-ae66-a17af9a38989-image.png
A notable feature of MediaPipe is its support for real-time gesture and facial recognition. It can efficiently process video streams and identify and track human gestures and facial features in real-time. This capability makes it incredibly useful in interactive applications, augmented reality (AR), virtual reality (VR), and robotics.

You can try the gesture recognition online feature without needing to install anything.
MediaPipe Studio
Its easy-to-use API and comprehensive documentation make it easier to integrate this framework, making it very suitable for use in the fields of machine learning and computer vision.

pymycobot

pymycobot is a Python API for serial communication and control of the mycobot robotic arm. This library is designed to facilitate developers in controlling the mycobot robotic arm using the Python language. It offers a series of functions and commands that allow users to control the movements and behavior of the robotic arm through programming. For example, users can use the library to get the angles of the robotic arm, send angle commands to control the movement of the arm, or get and send the coordinates of the robotic arm.
The only standard for using this library is that it must be used with the mycobot series of robotic arms, which are specifically adapted for the mycobot.

Product Introduction

myCobot 320 M5stack

The myCobot 320 M5 is a six-axis collaborative robotic arm developed by Elephant Robotics for users. It has a working radius of 350mm and a maximum load capacity of 1000g. The robotic arm is suitable for an open ROS simulation development environment and includes forward and inverse kinematics algorithms. It supports multiple programming languages, including Python, C++, Arduino, C#, and JavaScript, and is compatible with Android, Windows, Mac OSX, and Linux platforms. The versatility of the myCobot 320 M5 makes it suitable for a variety of development and integration applications. 0_1705903642794_fa0820e5-486e-4975-aa08-f37d18f38009-image.png

2D Camera

A 2D camera that can be mounted on the end of the mycobot320, communicating via a USB data cable. It can present the view seen from the end of the robotic arm.
0_1705903657612_05668d04-0b5d-4509-b48a-e9b735cb2e45-image.png

Development Process

Project Architecture

I have divided this project primarily into three functional modules:
Gesture Recognition: This module is mainly used for the recognition of gestures, capable of returning information about what the gesture is, such as a thumbs-up, etc.
Robotic Arm Control: This main function is used for setting the motion control of the robotic arm, including coordinate control, angle control, and so on.
Program Logic: This is used to handle the logic of the program's operation, setting confirmation times for gestures, resetting recognition times, etc. These will be detailed further in subsequent sections. 0_1705903676298_f030fd21-5f03-4406-b5c2-0cc418f9517e-image.png

Compilation Environment

Operating System: Windows 11
Programming Language: Python 3.9+
Libraries:opencv,pymycobot,mediapipe,time

Gesture Recognition

To perform gesture recognition, we first need to obtain a camera image. Here, we use the OpenCV library to access the camera feed.


import cv2

# Get camera stream, default camera - 0, external cameras in order - 1, 2, 3
cap = cv2.VideoCapture(1)

# Continuously acquire camera footage
while cap.isOpened():
    #Get the current image screen
    ret, frame = cap.read()
    # Convert BGR image to RGB
    rgb_frame = cv2.cvtColor(frame, cv2.COLOR_BGR2RGB)    
    # display screen on computer
    cv2.imshow('gesture control',frame)
    # Press the 'q' key to exit to avoid an infinite loop
    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

0_1705908925777_f6808bb2-e2f7-4c63-9a1a-314c498aa323-image.png
With this, the image capture from the camera is successful. Next, we use MediaPipe for gesture recognition.

import mediapipe as mp

# Initialize the MediaPipe Hands module
mp_hands = mp.solutions.hands
hands = mp_hands.Hands()
mp_draw = mp.solutions.drawing_utils


# Process the image and detect hands
result = hands.process(rgb_frame)

if result.multi_hand_landmarks:
    for hand_landmarks in result.multi_hand_landmarks:
        mp_draw.draw_landmarks(frame, hand_landmarks, mp_hands.HAND_CONNECTIONS)

0_1705908950612_048861da-f25e-4b6b-8a24-045e6addff45-image.png
The output after recognizing a gesture is precise in identifying each joint on the hand and names each joint point. MediaPipe Hands provides 21 key points (landmarks) for the hand, collectively depicting the structure of the hand, including the wrist and the joints of each finger. Taking the thumb as an example, there are four joints, which from bottom to top are CMC, MCP, IP, TIP.

cmc: Carpometacarpal Joint
mcp:Metacarpophalangeal Joint
ip:Interphalangeal Joint
tip:tip

Having these landmarks alone is not enough; we need to set a method to recognize specific gestures. For example, if we want to recognize a thumbs-up gesture, we analyze that during a thumbs-up, the tip of the thumb is at the highest point above the entire palm. This makes it much easier. As long as we determine that the tip of the thumb is higher than the tips of all other fingers in the image, then the gesture is identified as a thumbs-up. (Other methods of analysis can also be used.)
Generally, we can obtain three attributes for each joint: X, Y, Z, representing the position of that joint in the image.

# Get the attributes of the thumb tip
thump_tip = hand_landmarks.landmark[mp.hands.HandLandmark.THUMB_TIP]

# Get the height of the thumb tip
thump_tip.y

# Determine thumbs up gesture
def is_thump_up(hand_landmarks):
    thumb_tip = hand_landmarks.landmark[mp_hands.HandLandmark.THUMB_TIP]
    index_tip = hand_landmarks.landmark[mp_hands.HandLandmark.INDEX_FINGER_TIP]
    # Determine which joint is higher.
    if thumb_tip.y < index_tip.y:
        return True
    
    return False

If you want other gestures, you can also set a special identification method based on the characteristics of the hand shape. At this point, gesture recognition is completed.

Robotic Arm Motion Control

Initially, my idea was that when the camera recognizes a gesture, it would send a control command to the robotic arm. Let's start with a simple action, setting the robotic arm to perform a nodding motion.
The pymycobot library offers many functions that are very convenient for controlling the robotic arm.

from pymycobot.mycobot import Mycobot
import time
# connect robot arm
mc = Mycobot(port,baud)

#Control the movement of the robotic arm using angles
mc.send_angles([angles_list],speed)

#Control the movement of the robotic arm using coordinates
mc.send_coords([coords_list],speed,mode)

# Nodding action
def ThumpUpAction(self):
        self.mc.send_angles([0.96, 86.22, -98.26, 10.54, 86.92, -2.37], 60)
        time.sleep(1.5)
        for count in range(3):
            self.mc.send_angles([0.79, 2.46, (-8.17), 4.3, 88.94, 0.26], 70)
            time.sleep(1)
            self.mc.send_angles([(-3.6), 30.32, (-45.79), (-46.84), 97.38, 0.35], 70)
            time.sleep(1)
        self.mc.send_angles([0.79, 2.46, (-8.17), 4.3, 88.94, 0.26], 70)
        time.sleep(1)
        self.mc.send_angles([0.96, 86.22, -98.26, 10.54, 86.92, -2.37], 60)

To enhance the readability and modifiability of the overall code, it's beneficial to create a robotic arm class for easy calling and modification.

class RobotArmController:

        def __init__(self,port):
        self.mc = MyCobot(port, 115200)
        self.init_pose = [0.96, 86.22, -98.26, 10.54, 86.92, -2.37]
        self.coords = [-40, -92.5, 392.7, -92.19, -1.91, -94.14]
        self.speed = 60
        self.mode = 0
        
        def ThumpUpAction(self):
            ...
            
        def OtherAction(self):
            ...

Program Logic Processing

During debugging, some issues arose. When recognizing gestures, continuous recognition meant that if a gesture was recognized 10 times in 1 second, 10 commands would be sent to the robotic arm. This was not what I initially envisioned.
Therefore, logical adjustments were needed. Here's how I addressed it:

# Set a 2-second timeframe to confirm the gesture. Only when a thumbs-up gesture is maintained for 2 seconds, the command to control the robotic arm is issued, using a control variable approach.

#init
#Variable to detect whether gesture exists
gesture_detected = False
#Variable that determines the timing after the gesture appears
gesture_start_time = None
# Set the variable 2s after the gesture appears
gesture_confirmation_time = 2

# When a specific gesture appears, gesture_start_time begins to count. During this period, continuous checks are made. If 2 seconds have passed, the gesture is confirmed, and then the corresponding robotic arm movement for that gesture is executed.

current_time = time.time()
if current_gesture:
    if not gesture_detected:
        gesture_detected = True
        gesture_start_time = current_time
    elif current_time - gesture_start_time > gesture_confirmation_time and not action_triggered:
        if current_gesture == "thumb_up":
            robotic arm action()

However, this is still not sufficient, as the hand maintaining the gesture for over 2 seconds would continue sending commands to the robotic arm. Here, we need to set a cooldown period to allow sufficient time for the robotic arm to complete its movement.

action_triggered = False
cooldown_start_time = None
cooldown_period = 2

# process gesture
                current_time = time.time()
                if current_gesture:
                    if not gesture_detected:
                        gesture_detected = True
                        gesture_start_time = current_time
                    elif current_time - gesture_start_time > gesture_confirmation_time and not action_triggered:
                        #Perform corresponding actions based on gestures
                        if current_gesture == "thumb_up":
                            print('good good')
                            mc.thum_up()
                        elif current_gesture == "palm_open":
                            print('forward')
                            mc.increment_x_and_send()
                        # You can add more gestures and corresponding action judgments
                        action_triggered = True
                        cooldown_start_time = current_time
                else:
                    gesture_detected = False
                    gesture_start_time = None
                    if action_triggered and current_time - cooldown_start_time > cooldown_period:
                        print('can continue')
                        action_triggered = False
                        cooldown_start_time = None

Video

https://youtu.be/9vOPKO_IG9M
Summary
This project demonstrates a method of using gesture recognition to control the myCobot 320, creating a new form of human-machine interaction. Although currently only a limited number of gestures and corresponding robotic arm movements have been implemented, it lays the groundwork for broader applications of robotic arms in the future. The innovative attempt to combine gestures with robotic arm control has not only improved my programming skills but also enhanced my problem-solving abilities, providing valuable experience for future related projects.

ElephantRobotics

This article on the mylangrobot project was created by neka-nat, and the author Shirokuma has authorized its editing and reproduction. A big shout out at neka-nat:)
https://twitter.com/neka_nat

alt text

Introduction

This project demonstrates a simple pick-and-place operation using a robotic arm, employing GPT-4V and myCobot. The demonstration uses a method named SoM (Object Detection Method) to generate robotic movements through natural language. In simpler terms, the machine receives natural language instructions, locates the target, and then commands the robotic arm to perform the grabbing task. The highlight of this project is the combination of GPT-4V's image processing and the SoM object detection algorithm, enabling the robotic arm to move through natural language interaction with the machine.

Software

SoM

Set of Mark (SoM) is a method designed to enhance the visual understanding capabilities of large language models. After an image is processed with SoM, a series of markers are added to it. These markers can be recognized and processed by language models, aiding in more accurate identification and understanding of objects and content within the image. This enhancement allows language models to conduct more precise analysis and descriptions of elements in images, thereby improving their performance in visual tasks.
alt text

GPT-4V

The commonly discussed GPT is a large predictive model with which we can have conversational interactions. The new iteration, GPT-4V, is a multimodal large language model that not only processes textual information but now also handles image information. It is capable of image understanding, image generation, and image description. This large model combines the powerful natural language processing abilities of GPT-4 with current image analysis technologies, offering more efficient and accurate integrated capabilities in vision and language.
Below is an example provided by OpenAI:

User:
"These are frames from a video that I want to upload. Generate a compelling description that I can upload along with the video."

alt text

#GPT-4V Image description processing
GPT4:
"🐺 Survival of the Fittest: An Epic Tale in the Snow ❄️ - Witness the intense drama of nature as a pack of wolves face off against mighty bison in a harsh winter landscape. This raw footage captures the essence of the wild where every creature fights for survival. With each frame, experience the tension, the strategy, and the sheer force exerted in this life-or-death struggle. See nature's true colors in this gripping encounter on the snowy plains. 🦬"

Remember to respect wildlife and nature. This video may contain scenes that some viewers might find intense or distressing, but they depict natural animal behaviors important for ecological studies and understanding the reality of life in the wilderness.

Openai-whisper

https://openai.com/research/whisper
An automatic speech recognition (ASR) system developed by OpenAI. It utilizes advanced deep learning techniques to transcribe, translate, and understand human speech. Whisper is characterized by its ability to support multiple languages and its exceptional performance in recognizing voices with various accents and in noisy environments. Additionally, it can handle different audio qualities, making it suitable for various applications such as transcribing meeting records, automatically generating subtitles, and assisting in translations.
alt text

pymycobot

https://github.com/elephantrobotics/pymycobot/
pymycobot is a Python library for the mycobot robot series. MyCobot is a compact, multifunctional collaborative robotic arm suitable for education, research, and lightweight industrial applications. The PyMyCobot library provides a simple programming interface, enabling developers to control and program MyCobot robots for operations such as movement, grabbing, and sensing. This library supports multiple operating systems and development environments, facilitating its integration into various projects, especially in robotics and automation. By using Python, a widely-used programming language, pymycobot makes operating and experimenting with MyCobot robots more accessible and flexible.

Hardware

myCobot 280M5

The myCobot 280 M5 is a desktop-level compact six-axis collaborative robot produced by Elephant Robotics. Designed for compactness, it is suitable for education, research, and light industrial applications. The myCobot 280 M5 supports various programming and control methods, compatible with different operating systems and programming languages, including:
●Main and auxiliary control chips: ESP32
●Supports Bluetooth (2.4G/5G) and wireless (2.4G 3D Antenna)
●Multiple input and output ports
●Supports free movement, joint movement, Cartesian movement, trajectory recording, and wireless control
●Compatible operating systems: Windows, Linux, MAC
●Supported programming languages: Python, C++, C#, JavaScript
●Supported programming platforms and tools: RoboFlow, myblockly, Mind+, UiFlow, Arduino, mystudio
●Supported communication protocols: Serial port control protocol, TCP/IP, MODBUS
These features make the myCobot 280 M5 a versatile, user-friendly robot solution suitable for a variety of application scenarios.
alt text

myCobot Vertical Suction Pump V2.0

Operates on the principle of vacuum adhesion, providing 3.3V IO control, and can be extensively used in the development of various embedded devices.
alt text

Camera

Standard USB and LEGO interfaces. The USB interface can be used with various PC devices, and the LEGO interface can be conveniently fixed. It is applicable to machine vision, image recognition, and other applications.
alt text
mylangrobot Software Analysis
The specific workflow of the project described at the beginning is as follows:

Audio Input: Record audio instructions first.
Audio Processing: Use "openai-whisper" to process the audio and convert it into text.
Language Model Interaction: Use the GPT-4 model to process the converted text instructions and understand the user's commands.
Image Processing: Use GPT-4V and the enhanced image capability of SoM to process images and find the target mentioned in the instructions.
Robotic Arm Control: Control the robotic arm to grab the identified target.

Audio Processing
This function utilizes speech_recognition to capture audio data from the microphone, enabling the computer to recognize it.
Libraries used:

import io
import os
from enum import Enum
from typing import Protocol

import openai
import speech_recognition as sr
from pydub import AudioSegment
from pydub.playback import play

Define interfaces, capture user input, and provide output to the user.

class Interface(Protocol):
    def input(self, prefix: str = "") -> str:
        return prefix + self._input_impl()

    def _input_impl(self) -> str:
        ...

    def output(self, message: str) -> None:
        ...

Initialize the microphone for audio input and output.

class Audio(Interface):
    def __init__(self):
        self.r = sr.Recognizer()
        self.mic = sr.Microphone()
        # openai-whisper API key
        self.client = openai.OpenAI(api_key=os.environ.get("OPENAI_API_KEY"))

Convert the input audio into text format for output.

 def _input_impl(self) -> str:
        print("Please tell me your command.")
        with self.mic as source:
            self.r.adjust_for_ambient_noise(source)
            audio = self.r.listen(source)

        try:
            return self.r.recognize_whisper(audio, language="japanese")

        except sr.UnknownValueError:
            print("could not understand audio")
        except sr.RequestError as e:
            print("Could not request results from Google Speech Recognition service; {0}".format(e))

The final return 'r' is the text format of the audio, which can be used for interaction with the GPT-4 model.

Image Processing and GPT-4 Language Interaction

When transmitting text to the GPT-4 model for interaction, images are sent along, so image processing and interaction are discussed together.
Libraries used for image processing:

import cv2
import numpy as np
import supervision as sv
import torch
from segment_anything import SamAutomaticMaskGenerator, sam_model_registry

from .utils import download_sam_model_to_cache

Primarily uses the SamAutomaticMaskGenerator feature to mark and draw markers on detected targets.

#Convert image to RGB format
        image_rgb = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
#Image processing, target detection and marker rendering
        sam_result = self.mask_generator.generate(image_rgb)
        detections = sv.Detections.from_sam(sam_result=sam_result)
        height, width, _ = image.shape
        image_area = height * width

        min_area_mask = (detections.area / image_area) > self.MIN_AREA_PERCENTAGE
        max_area_mask = (detections.area / image_area) < self.MAX_AREA_PERCENTAGE
        detections = detections[min_area_mask & max_area_mask]
        
        
        #Returns the result of the image and detected information
        labels = [str(i) for i in range(len(detections))]
        annotated_image = mask_annotator.annotate(scene=image_rgb.copy(), detections=detections)
        annotated_image = label_annotator.annotate(scene=annotated_image, detections=detections, labels=labels)
        return annotated_image, detections

alt text

This results in the following effect.
Note: The below function requires obtaining the GPT-4 API-Key for usage.
The resulting image is passed to the GPT-4 model, which requires some processing before use. Through GPT-4V, the image can be processed to return information about the image content and corresponding object information.

def prepare_inputs(message: str, image: np.ndarray) -> dict:
    # # Path to your image
    # image_path = "temp.jpg"
    # # Getting the base64 string
    base64_image = encode_image_from_cv2(image)

    payload = {
        "model": "gpt-4-vision-preview",
        "messages": [
            {"role": "system", "content": [metaprompt]},
            {
                "role": "user",
                "content": [
                    {
                        "type": "text",
                        "text": message,
                    },
                    {"type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{base64_image}"}},
                ],
            },
        ],
        "max_tokens": 800,
    }

    return payload


def request_gpt4v(message: str, image: np.ndarray) -> str:
    payload = prepare_inputs(message, image)
    response = requests.post("https://api.openai.com/v1/chat/completions", headers=headers, json=payload)
    res = response.json()["choices"][0]["message"]["content"]
    return res

Robotic Arm Control and Overall Integration
After image processing and GPT-4V model processing, the interpreted instructions generate target position information. This position information is passed to the robotic arm control system, which moves to the corresponding location to perform the grabbing action.
Key methods involved:
Move to the target object.

    def move_to_object(self, object_no: int, speed: Optional[float] = None) -> None:
        object_no = self._check_and_correct_object_no(object_no)
        print("[MyCobotController] Move to Object No. {}".format(object_no))
        detection = (
            np.array([-self._detections[object_no][0], -self._detections[object_no][1]]) + self.capture_coord.pos[:2]
        )
        print("[MyCobotController] Object pos:", detection[0], detection[1])
        self.move_to_xy(detection[0], detection[1], speed)

grab action
  def grab(self, speed: Optional[float] = None) -> None:
        print("[MyCobotController] Grab to Object")
        current_pos = self.current_coords().pos
        self.move_to_z(self.object_height + self.end_effector_height, speed)
        self._mycobot.set_basic_output(self._suction_pin, 0)
        time.sleep(2)
        self.move_to_z(current_pos[2], speed)

drop action
    def move_to_place(self, place_name: str, speed: Optional[float] = None) -> None:
        print("[MyCobotController] Move to Place {}".format(place_name))
        self._current_position = self.positions[place_name]
        self._mycobot.sync_send_angles(
            np.array(self._current_position) + self.calc_gravity_compensation(self._current_position),
            speed or self._default_speed,
            self._command_timeout,
        )
        print("Current coords: {}".format(self.current_coords()))

After each function is implemented, coordinate the entire process, streamline the workflow logic, and complete the task.
The specific code can be viewed in the operator.py file.

Example

Below is an example test to observe the project's outcome. The content involves a voice input saying "pick up the chocolate," and the robotic arm executes the task.
https://youtu.be/Eda1m7DnIhQ

Summary

This project demonstrates how to leverage advanced artificial intelligence and robotics technologies to accomplish complex automation tasks. By integrating voice recognition, natural language processing, image analysis, and precise robotic arm control, the project has successfully created a robotic system capable of understanding and executing spoken instructions. This not only enhances the naturalness and efficiency of robot-human interaction but also opens up new possibilities for robotic technology in various practical applications, such as automated manufacturing, logistics, assistive robots, and more.
Finally, thanks again to Shirokuma for sharing this case with us. If you have better examples, feel free to contact us!

ElephantRobotics

Introduction

Today, I am going to present a robotic arm model that I have independently designed and implemented. The core feature of this model is to achieve real-time gesture tracking - just a gentle drag with your hand, and the robotic arm can immediately follow your movements.

The reason why I wanted to create such a model is that in some dangerous environments, we can use robotic arms to replace manual work, thereby avoiding threats to human life.

You might ask, why not directly use remote keyboard control, joystick control, or APP control, but choose to manually drag? I believe that only manual operation can satisfy our need for precision to the greatest extent. Therefore, I decided to start trying to make this model and have initially completed the entire demo.

I hope that through this demo, I can demonstrate the infinite possibilities of robotic arms to everyone, and at the same time, I hope to inspire everyone's infinite longing for future technology.

Robotic Arm

The mechArm 270 is a 6 DOF robotic arm, with a compact structure design that can fit into a backpack for easy transportation. Importantly, it has many open control APIs, which allows you to quickly start controlling the robotic arm using Python. There are no complicated operations, and it even supports graphical programming, which allows people who are not very familiar with code to quickly get started with controlling the robotic arm.
alt text
The mechArm is a desktop robotic arm with a structure that mimics industrial designs. Its maximum working radius is 270mm, it can carry a load of 250g, and its repeat positioning accuracy is controlled within ±0.5mm.

Project

After introducing the basic equipment, let's start with the record of how I created this demo.

Environment:
Operating system: Windows 11

Programming language: Python 3.9+

Python libraries: pymycobot, time

pymycobot is an open-source library for Elephant Robotics, specifically designed to control the robotic arm of Elephant Robotics. Here is an example of a simple control code.

Code：

#Main methods used

#Create objects to communicate with the robotic arm.
MyCobot(serial,baud)
# angles control robot,
send_angles([list_angles],speed)
# coords control robot
send_coords([list_coords],speed,mode)

Example:

import time
from pymycobot.mycobot import MyCobot

# create a object
mc = MyCobot("com7",115200)

# angles control
mc.send_angles([0,0,0,0,0,0],100)
time.sleep(1)
mc.send_angles([90,90,90,90,90,90],100)
time.sleep(1)

I briefly introduced how to use python to control mechArm. Isn’t it easy?

Problem Analysis

Before starting the project, it's important to set up a framework and understand the specific problems we need to solve. For this, I made a flowchart of the project. In the following, I will refer to the manually controlled robotic arm as R1, and the following motion robotic arm as R2.
alt text
Control Robotic Arm: As mentioned above, the robotic arm can be controlled using the methods provided by the pymycobot library.

Motion Control Methods: The R1 robotic arm can be dragged by hand, returning the current angle information of the robotic arm at all times. The R2 robotic arm controls based on the angle information received from R1.

Communication Between Robotic Arms: This step is quite important in the entire project. Once established, the robotic arm can easily implement information transmission.

Next, I will mainly explain the Motion Control Methods and Communication Between Robotic Arms.

Motion Control methods

Get real-time angle information

pymycobot provides the "get_angles()" method to obtain the angle information of the current robot arm.

# Can obtain the current angle information of the robotic arm in real time
get_angles()

# example
print("real-time angles:",mc.get_anlges())

result: real-time angles:[0,0,0,0,0,0]

# Continuously obtain the current angle
while True:
    angels = mc.get_angles()
    print(angles) 
    time.sleep(0.1) #Go to the next step every 0.1s

Set the Refresh Mode for the Robotic Arm

The refresh mode of the robotic arm mainly falls into two categories: interpolation mode and non-interpolation mode. These refer to the ways in which the end effector of the robotic arm is controlled during motion trajectory planning. If no mode is set, the robotic arm may not be able to perform the expected motion correctly, which may lead to the following consequences:

Unsmooth motion
Inaccurate motion
Discontinuous motion

Interpolation Mode: The interpolation mode can realize smooth and continuous trajectory planning, ensuring that the position and posture of the end effector of the robotic arm transition smoothly during the motion process.

Non-Interpolation Mode: The non-interpolation mode means that the robotic arm only focuses on specific target points during the motion process, without performing interpolation calculations. Under the non-interpolation mode, the position and posture of the robotic arm will jump directly between key points, without undergoing a smooth transition.

When multiple robotic arms use the interpolation mode for motion at the same time, there may be situations of waiting or queuing. Therefore, we choose to use the non-interpolation mode.

#Set refresh mode
set_fresh_mode(1/0) 
1：no interpolation
0：interpolation

mc.set_fresh_mode(1)

Our code that integrates the previous ones is as follows.

Code:

import time
from pymycobot.mycobot import MyCobot

mc = MyCobot("COM7", 115200)    #release arm
mb = MyCobot("COM11", 115200)   #move arm

mb.set_fresh_mode(1)    #no interpolation
time.sleep(1)
mc.release_all_servos() #release robot
time.sleep(1)
speed = 100
while True:
    angles = mc.get_angles()    #get release arm angles
    mb.send_angles(angles, speed)   #send angles to move arm
    time.sleep(0.1)
## Communication between robotic arms：
Our solution is to connect two robotic arms to the same PC and connect them through a serial port.

# build connection
from pymycobot.mycobot import MyCobot

mc = MyCobot("COM7", 115200)   
mb = MyCobot("COM11", 115200)

By using the most basic USB data cables for connection, we have two serial port numbers for the robotic arms on our computer, and we can send instructions to them separately.
0_1699432439569_微信图片_20231108163344.png
https://www.youtube.com/watch?v=NByjgoqc2O4

Summary

From the content, it can be seen that although we can achieve about 70-80% synchronization, there are other factors that can cause significant delays. The reasons for the delays could be various, such as the speed of data processing and transmission, the reaction speed of the robotic arm, software optimization, hardware performance, etc. All of these are potential factors that can cause delays.

In addition, there is a significant limitation in that their communication is connected through serial ports. If the distance is a bit further, this method cannot be used, and its practicality is not strong. In the future, I will try to use wireless connections such as Bluetooth and WiFi to control the robotic arm.

ElephantRobotics

Introduction

Robotic arms are essential tools in modern industry and research fields, with widespread applications in manufacturing, healthcare, agriculture, education, and more. These robotic arms can not only perform precise operations but also work in hazardous environments inaccessible to humans, greatly enhancing work efficiency and safety.However, traditional control methods of robotic arms often require specialized knowledge and complex programming, which to some extent limits their popularization and application. Therefore, the method of controlling robotic arms via mobile apps emerged. The advantage of mobile app control is that it allows non-professionals to conveniently operate robotic arms. Moreover, through the mobile app, users can remotely control the robotic arm, greatly improving the convenience and flexibility of operation.This article aims to delve into the topic of how to use a mobile app to control robotic arms.

Product

myCobot 320

The myCobot 320 is a 6-axis collaborative robotic arm designed for user-oriented programming and development. It has built-in forward and inverse kinematics algorithms, offers an open ROS simulation development environment, and comes equipped with 12 standard 24V industrial IO interfaces, which can be expanded for PLC control programming. The working radius of the myCobot 320 can reach up to 350mm, and the end can bear a maximum load of 1kg.It offers precise positioning repeatability within a range of 0.5mm.
alt text
myCobot Controller
The "myCobot Controller" is an application launched by Elephant Robotics specifically for controlling the myCobot series of robotic arms. This app employs Bluetooth technology, enabling wireless connection between the user's smartphone or tablet and the myCobot robotic arm.
alt text
The working principle of the myCobot Controller APP is mainly based on Bluetooth communication technology. After successfully pairing with the myCobot robotic arm via Bluetooth, the app can control various functions of the robotic arm. This method is not only easy to operate, but it also allows for real-time, wireless control, greatly enhancing the flexibility and convenience of operation.

Steps for APP to control myCobot
Next, we will introduce how to use this function while operating.

Install APP

Currently, the myCobot Controller only supports Android system phones, and IOS system users will need to wait for some time. Clicking connect can directly lead to the download location.Currently, there are two ways to install the app. The first one is to directly search for "myCobot controller" on the Google Play Store for download.The second one is to download the APK directly from the official website to the mobile phone for installation. (You can install it directly after a normal download).
alt text

Connect myCobot

First, we power up the myCobot and select "Transponder".
alt text
After entering, we choose the Bluetooth mode, "Bluetooth".

After successful operation, the interface will appear as shown in the following image. The name of the Bluetooth is: mycobot320m5ble, and its MAC address is: 08:3a:f2:42:f0:26 (Each machine is different, which is convenient for distinguishing when two machines are on at the same time). Upon accessing this page, we simply need to wait, as the subsequent steps can be performed on a mobile phone.
alt text
Open the downloaded APP, click the Bluetooth icon to connect to myCobot.

Make sure the name of the Bluetooth corresponds to the name of the robotic arm, "mycobot320m5ble". If you are uncertain, you can verify the MAC address due to its uniqueness. If the display is as shown in the following image, the connection is successful.
alt text

Robotic arm motion control

The APP controls the robotic arm using forward and inverse kinematics, mainly in two ways: one is the control of the robotic arm's angle, and the other is the control of the robotic arm's coordinates.

Joint Control

Joint control is a control method of the forward kinematics algorithm. Forward kinematics refers to the process of calculating the position and posture of the robot's end effector when the parameters (Joint) of each joint of the robot are known. In other words, the spatial coordinates of the end of the robotic arm can be calculated by rotating the angles.
alt text
The Joint (1-6) on both sides can be increased or decreased to control the corresponding myCobot joint angles. The six parameters in the middle TCL part represent the Cartesian coordinate values of the robotic arm's end at this moment.

Coords Control

Coordinate control is a control method of the inverse kinematics algorithm. Inverse kinematics refers to the process of calculating the parameters (angles) of each joint of the robot when the coordinate position of the robot's end effector is known. This allows the robot to be moved by adjusting the coordinates of the robot's end, thereby calculating the parameters of the robot's joint angles under the current state.
alt text
The buttons on the left are easy to understand, corresponding to the position of the robotic arm's end in space, moving linearly up, down, left, and right. In other words, the end of the robotic arm moves forward, backward, left, right, up, and down in space, mainly in straight line movements. In Cartesian coordinates, 'Rz, Ry, Rx' are usually used to represent rotations around the z-axis, y-axis, and x-axis. These rotations typically follow the right-hand rule, that is, if your right hand's four fingers curl in the direction of rotation, then the direction your thumb points is the direction of the rotation axis.

Generally speaking:

● "rz": rotation around the z-axis. This will cause the end of the robotic arm to rotate within the x-y plane, that is, it will change the yaw angle of the end effector.

● "ry": rotation around the y-axis. This will cause the end of the robotic arm to rotate within the z-x plane, that is, it will change the pitch angle of the end effector.

● "rx": rotation around the x-axis. This will cause the end of the robotic arm to rotate within the y-z plane, that is, it will change the roll angle of the end effector.

In simpler terms, it is the robotic arm rotating around the x, y, z axes.

Operation video

https://youtu.be/8JBhk55pz8I

Summary

Overall, the mobile app control provides a more intuitive and easier-to-understand operating method, allowing non-professionals to quickly get started. In addition, we can remotely control the robotic arm, which makes it possible to operate the robotic arm in some complex or dangerous environments, thus expanding the application scenarios of the robotic arm.

Currently, the functionality of the app can still be expanded. What features do you think need to be added to the app? Feel free to comment below, let's discuss the optimization of the app's functionality together! If you like this article, likes and comments are the greatest support for us!

ElephantRobotics