WO2020228489A1 - 基于视觉的触觉测量方法、装置、芯片及存储介质 - Google Patents
基于视觉的触觉测量方法、装置、芯片及存储介质 Download PDFInfo
- Publication number
- WO2020228489A1 WO2020228489A1 PCT/CN2020/085608 CN2020085608W WO2020228489A1 WO 2020228489 A1 WO2020228489 A1 WO 2020228489A1 CN 2020085608 W CN2020085608 W CN 2020085608W WO 2020228489 A1 WO2020228489 A1 WO 2020228489A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sample
- neural network
- curvature
- feedforward neural
- estimation model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J13/00—Controls for manipulators
- B25J13/08—Controls for manipulators by means of sensing devices, e.g. viewing or touching devices
- B25J13/081—Touching devices, e.g. pressure-sensitive
- B25J13/084—Tactile sensors
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/016—Input arrangements with force or tactile feedback as computer generated output to the user
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01B—MEASURING LENGTH, THICKNESS OR SIMILAR LINEAR DIMENSIONS; MEASURING ANGLES; MEASURING AREAS; MEASURING IRREGULARITIES OF SURFACES OR CONTOURS
- G01B5/00—Measuring arrangements characterised by the use of mechanical techniques
- G01B5/20—Measuring arrangements characterised by the use of mechanical techniques for measuring contours or curvatures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0499—Feedforward networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/40—Analysis of texture
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/60—Analysis of geometric attributes
- G06T7/64—Analysis of geometric attributes of convexity or concavity
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
- G06V10/225—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition based on a marking or identifier characterising the area
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/74—Image or video pattern matching; Proximity measures in feature spaces
- G06V10/75—Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
- G06V10/758—Involving statistics of pixels or of feature values, e.g. histogram matching
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/776—Validation; Performance evaluation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
Definitions
- This application relates to the field of human-computer interaction, and in particular to a vision-based tactile measurement method, device, chip and storage medium.
- the tactile sensor is a sensor used to imitate the function of the tactile sense, which can measure the tactile sense of the contacting object, such as the contact position and contact force.
- tactile sensors are mostly used in the robotics field.
- a tactile sensor is provided.
- the tactile sensor is provided with a semicircular flexible sensing surface.
- the inner surface of the flexible sensing surface is provided with a plurality of marking points arranged in an array and an image arranged toward the inner surface.
- Sensing components After the outer surface of the flexible sensing surface is in contact with the object, the flexible sensing surface will be deformed, causing multiple marking points on the inner surface to change positions due to the deformation.
- the image sensing component collects the inner surface image of the flexible sensing surface, and transmits the inner surface image to the chip.
- a convolutional neural network (CNN) is set in the chip, and the inner surface image is processed through the convolutional neural network to obtain the analysis result of the contact force.
- the training process of the above-mentioned convolutional neural network is relatively complicated, and as many as 20,000 training samples are required to achieve better training results.
- Various embodiments of the present application provide a vision-based tactile measurement method, device, chip, and storage medium, a feedforward neural network training method, device, computer equipment and storage medium, and a tactile sensor system , And a robotic system.
- a vision-based tactile measurement method is executed by a chip, the chip is connected to a tactile sensor, the tactile sensor includes a sensing surface and an image sensing component, the sensing surface is provided with a marking pattern;
- the method includes: acquiring an image sequence collected by the image sensing component on the sensing surface, the image of the image sequence includes the mark pattern; and according to the image sequence in the adjacent images in the image sequence Mark patterns, calculate the difference features of the mark patterns; and call a feedforward neural network to process the difference characteristics of the mark patterns to obtain a tactile measurement result; wherein the number of hidden layers in the feedforward neural network is less than a threshold.
- a method for training a feedforward neural network which is executed by a computer device, and the method includes:
- the training samples including a sample image sequence and sample tactile results, the sample image sequence being an image sequence collected by an image sensing component in a tactile sensor, the tactile sensor including a sensing surface and an image sensing component,
- the sensing surface is provided with a marking pattern, and the images of the image sequence include the marking pattern; according to the position of the marking pattern in adjacent images in the sample image sequence, a sample of the marking pattern is calculated Difference feature; call the feedforward neural network to process the sample difference feature of the mark pattern to obtain the predicted haptic result; the number of hidden layers in the feedforward neural network is less than the threshold; compare the predicted haptic result and the sample Error calculation is performed on the haptic result to obtain an error loss; and the feedforward neural network is trained according to the error loss through an error back propagation algorithm to obtain a trained feedforward neural network.
- a vision-based tactile measurement device the device is applied to a chip, the chip is connected to a tactile sensor, the tactile sensor includes a sensing surface and an image sensing component, the sensing surface is provided with a marking pattern;
- the device includes: a first acquisition module, configured to acquire an image sequence collected by the image sensing component on the sensing surface, the image of the image sequence includes the marking pattern; a first calculation module, Calculating the difference characteristics of the marking patterns according to the marking patterns in the adjacent images in the image sequence; and a feedforward neural network for processing the difference characteristics of the marking patterns to obtain a tactile measurement result; Wherein, the number of hidden layers in the feedforward neural network is less than a threshold.
- a training device for a feedforward neural network comprising:
- the second acquisition module is used to acquire training samples, the training samples include a sample image sequence and sample tactile results, the sample image sequence is an image sequence collected by an image sensing component in a tactile sensor, and the tactile sensor includes a flexible sensor.
- a sensing surface and an image sensing component disposed toward the inner surface of the flexible sensing surface, the flexible sensing surface is provided with a marking pattern, and the images of the image sequence include the position of the marking pattern;
- a second calculation The module is used to calculate the sample difference feature of the marking pattern according to the position of the marking pattern in the adjacent images in the sample image sequence;
- the feedforward neural network model is used to calculate the sample difference of the marking pattern
- the feature is processed to obtain a predicted haptic result, and the number of hidden layers in the feedforward neural network is less than a threshold; an error calculation module is used to perform error calculation on the predicted haptic result and the sample haptic result to obtain an error loss;
- a training module which is used to train the feedforward neural network according to the error loss through an
- a tactile sensor system comprising: a tactile sensor and a chip, the tactile sensor comprising a sensing surface and an image sensing component, the sensing surface is provided with a marking pattern, the image sensing component and the chip Connected; the chip includes at least one of a programmable logic circuit and program instructions, and when the chip is running, the chip is used to perform the visual-based tactile measurement method as described in the above aspect.
- a computer device includes a memory and a processor.
- the memory stores computer readable instructions.
- the processor executes the computer readable instructions, the steps of the above-mentioned feedforward neural network training method are realized.
- a computer-readable storage medium that stores computer-readable instructions that, when executed by a processor, implement the steps of the above-mentioned vision-based tactile measurement method, or the above-mentioned feedforward neural network training method step.
- a robot system comprising: a chip and a tactile sensor, the tactile sensor is arranged at least one of a fingertip part and a skin part, the tactile sensor includes a sensing surface and an image sensing component, the The sensing surface is provided with a marking pattern, and the image sensing component is connected to the chip; the chip includes at least one of a programmable logic circuit and a program instruction, and when the chip is running, the chip is used to execute the above The vision-based tactile measurement method described in the aspect.
- FIG. 1 is a schematic structural diagram of a tactile sensor in related technology provided by an exemplary embodiment of the present application
- Fig. 2 is a schematic structural diagram of a touch sensor system provided by an exemplary embodiment of the present application
- Fig. 3 is a schematic diagram of a flexible sensing surface provided by an exemplary embodiment of the present application.
- Fig. 4 is a flowchart of a method for using a touch sensor provided by an exemplary embodiment of the present application
- FIG. 5 is a schematic diagram of displacement recording of marking points provided by an exemplary embodiment of the present application.
- Fig. 6 is a schematic structural diagram of a feedforward neural network provided by an exemplary embodiment of the present application.
- FIG. 7 is a flow chart of a method for using a feedforward neural network provided by an exemplary embodiment of the present application.
- FIG. 8 is a schematic diagram of each model structure in a feedforward neural network provided by an exemplary embodiment of the present application.
- Fig. 9 is a flowchart of a method for using a position estimation model provided by an exemplary embodiment of the present application.
- Fig. 10 is a flowchart of a method for using a contact force estimation model provided by another exemplary embodiment of the present application.
- Fig. 11 is a flowchart of a method for using a surface classification model and a curvature estimation model provided by an exemplary embodiment of the present application;
- FIG. 12 is a flowchart of a method for using a surface classification model provided by an exemplary embodiment of the present application.
- FIG. 13 is a flowchart of a method for using a curvature estimation model of a spherical surface provided by an exemplary embodiment of the present application;
- FIG. 14 is a flowchart of a method for using a curvature estimation model of a cylindrical surface provided by an exemplary embodiment of the present application.
- FIG. 15 is a flowchart of a method for calculating the displacement of marker points on an image array provided by an exemplary embodiment of the present application
- Fig. 16 is a flowchart of a visual-based tactile measurement training method provided by an exemplary embodiment of the present application.
- FIG. 17 is a flowchart of a method for training a position estimation model provided by an exemplary embodiment of the present application.
- FIG. 18 is a flowchart of a training method of a contact force estimation model provided by an exemplary embodiment of the present application.
- Fig. 19 is a flowchart of a training method representing a surface classification model provided by an exemplary embodiment of the present application.
- FIG. 20 is a flowchart of a training method of a spherical estimation model provided by an exemplary embodiment of the present application.
- FIG. 21 is a flowchart of a method for training a cylindrical surface estimation model provided by another exemplary embodiment of the present application.
- FIG. 22 is a schematic structural diagram of models other than the surface classification model provided by an exemplary embodiment of the present application.
- FIG. 23 is a schematic diagram of a model structure of a surface classification model provided by an exemplary embodiment of the present application.
- FIG. 24 is a schematic diagram of a sample curved surface for training a feedforward neural network provided by an exemplary embodiment of the present application.
- FIG. 25 is a table of the number of used training sample surfaces provided by an exemplary embodiment of the present application.
- FIG. 26 is a schematic diagram of a geometric model of a tactile sensor provided by an exemplary embodiment of the present application.
- FIG. 27 is a schematic diagram of training results of a position estimation model provided by an exemplary embodiment of the present application.
- FIG. 28 is a schematic diagram of training results of a contact force estimation model provided by an exemplary embodiment of the present application.
- FIG. 29 is a confusion matrix representing the accuracy of training results of a surface classification model provided by an exemplary embodiment of the present application.
- FIG. 30 is a schematic diagram of training results of a spherical estimation model provided by an exemplary embodiment of the present application.
- FIG. 31 is a schematic diagram of training results of a cylindrical surface estimation model provided by an exemplary embodiment of the present application.
- Fig. 32 is a flowchart of a method of using the touch sensor system provided by an exemplary embodiment of the present application.
- Fig. 33 is a block diagram of a vision-based tactile measurement device provided by an exemplary embodiment of the present application.
- Fig. 34 is a block diagram of a training device for a feedforward neural network provided by another exemplary embodiment of the present application.
- Fig. 35 is a block diagram of a computer device provided by an exemplary embodiment of the present application.
- Feedforward Neural Network an artificial neural network with a unidirectional structure.
- the feedforward neural network includes at least two neural network layers. Among them, each neural network layer contains several neurons, and each neuron is arranged hierarchically. There is no interconnection between neurons in the same layer, and the transmission of information between layers is only carried out in one direction.
- Logistic (Sigmoid) function It is an "S"-shaped function used to describe the growth trend as roughly exponential growth at the initial stage; then as it becomes saturated, the increase slows down; finally, it increases when it reaches maturity The process of stopping.
- Softmax Normalized exponent
- Hidden layer a neural network layer structure used to input or analyze data.
- Output layer A neural network layer structure used to output results.
- a tactile sensor is provided in the related art, as shown in FIG. 1.
- the tactile sensor includes: a silicone sensor surface 11, marking points 12 arranged on the inner surface of the silicone sensor surface 11, a model front end 13 of the tactile sensor made by 3D printing technology, and the model front end 13 is used to fix the silicone sensor surface 11.
- LED light emitting diodes
- This technology uses a probabilistic model based on Bayesian theorem to distinguish the position of the contact point, the radius of curvature and the direction of the contact edge, and the convolutional neural network model (CNN) algorithm is used in the latest research.
- CNN convolutional neural network model
- a feedforward neural network is used to provide a vision-based tactile measurement solution.
- This method uses the displacement of the marked points on two consecutive images in the image array as feature values and inputs them to the feedforward neural network to obtain the position of the contact point, the magnitude and/or direction of the contact force, and the radius of curvature of the contact surface.
- CNN convolutional neural network model
- this method simplifies the input feature value (only the displacement and/or deformation of the marker point is required, not the entire image).
- the training sample Significantly reduce, improve the training efficiency of the neural network, so as to meet the requirements of simplifying the method of using tactile sensors, and without a large number of samples training, the feedforward neural network can achieve the same effect (or better) demand.
- the vision-based tactile measurement method is applied to the chip 117, which can be any one of a CPU, a GPU, a neural network chip, a microprocessor, or an FPGA circuit. It is not limited, the chip 117 is connected to a tactile sensor, and the tactile sensor includes a sensing surface 111 and an image sensing component 115, and the sensing surface 111 is provided with a marking pattern 112.
- the sensing surface 111 is a flexible sensing surface that can be deformed when in contact with other objects.
- the image sensing component 115 may be a camera. The camera can be arranged facing the inner surface of the sensing surface 111.
- Fig. 2 shows a schematic structural diagram of a tactile sensor system 300 provided by an exemplary embodiment of the present application.
- the tactile sensor includes a sensing surface 111, a base 113, a column 114, and an image sensing component 115 disposed toward the inner surface of the sensing surface 111, and a bottom plate 116 is used to place the tactile sensor.
- FIG. 3 shows a schematic diagram of the inner surface of a sensing surface 111 provided by an exemplary embodiment of the present application, and the inner surface of the sensing surface 111 is provided with a marking pattern 112.
- the shape of the sensing surface 111 is not limited.
- the sensing surface 111 can be any one of rectangular, hexagonal, circular, elliptical, hemispherical, or planar shapes.
- a hemispherical flexible sensing surface 111 is used as an example for description, as shown in FIG. 3.
- the marking pattern is implemented by using at least two marking points, or by using a grid, or by using both marking points and a grid.
- a grid is a pattern with intersecting grid lines, and the intersecting grid lines form grid points.
- marking points 112 are provided on the inner surface (or inside) of the sensing surface 111, and the marking points 112 may be arranged in an array or non-array arrangement, for example, a 4 ⁇ A rectangular array of 4 or 6 ⁇ 6, or a circular non-array arrangement.
- the distance between adjacent marking points 112 may be equal or unequal. When the distance between adjacent marking points 112 is equal, the displacement of the marking point 112 changes uniformly.
- the marking points 112 can be centrally arranged on the inner surface of the sensing surface 111, for example, a 4 ⁇ 4 rectangular array of marking points 112 are provided on the sensing surface 111, or they can be arranged along the edge of the sensing surface 111 .
- the color of the marking point can be any color. In this application, the black marking point 112 is selected to distinguish it from the white sensing surface 111, which can better indicate the displacement of the marking point 112.
- the array of black marking points 112 is centrally arranged on the inner surface of the sensing surface 111, and the distance between the marking points 112 on the edge of the marking point array and each edge on the sensing surface 111 is equal to each other. The distances between adjacent marking points 112 are equal.
- a rectangular array with a circular sensing surface 111 and a 6 ⁇ 6 marking point 112 is used as an example for description, as shown in FIG. 3.
- Fig. 4 shows a flowchart of a vision-based tactile measurement method provided by an exemplary embodiment of the present application. The method may be executed by the chip 117 in Fig. 1, and the method includes:
- Step 201 Obtain an image sequence collected by the image sensing component on the inner surface, and the image of the image sequence includes a marking pattern.
- the sensing surface 111 of the touch sensor When the touched object contacts the sensing surface 111 of the touch sensor, the sensing surface 111 is deformed, and the image sensing component continuously photographs the inner surface of the sensing surface 111 at a certain frequency and transmits images to the chip, thus forming an image array.
- the aforementioned frequency can be set according to the displacement of the marking point 112, such as 30 frames/sec or 60 frames/sec.
- the tactile sensor in the high-frequency shooting state of the image sensor component 115, the tactile sensor can also detect the slippage of the contacted object, and can even detect a large sudden force.
- Step 202 Calculate the difference characteristics of the marked images according to the marked images in the adjacent images in the image sequence.
- adjacent images in the image array are two adjacent images.
- the difference feature of the marking pattern includes at least one of the displacement and deformation of the marking point, such as including the displacement and deformation of the marking point.
- the difference feature of the marking pattern includes at least one of the displacement of the grid points in the grid and the deformation of the grid lines, such as the displacement of the grid points in the grid, or the grid The displacement of the grid points and the deformation of the grid lines.
- the above chip marks the same mark according to the two closest mark points 112 in two adjacent images, and can track the movement of each mark point, that is, calculate the displacement of the mark point 112 As a difference feature, as shown in Figure 5.
- Step 203 Invoke the feedforward neural network to process the difference characteristics of the marking pattern, and obtain the tactile measurement result.
- the feedforward neural network can be one or more neural network models, and each neural network model corresponds to a different function.
- the number of hidden layers in the feedforward neural network is less than the threshold. In some embodiments, the threshold is 2.
- the hidden layer and output layer of each model in the feedforward neural network are called, and the difference characteristics of the marking pattern are processed according to the models that realize different functions in the feedforward neural network, and the tactile measurement results are obtained.
- the number of neural network models, hidden layers, and output layers with different functions can be designed according to measurement needs.
- the number of hidden layers and output layers is 1 respectively for description.
- the method provided by the embodiments of the present application uses the difference feature of the mark pattern as the input feature, compared with the use of an image as the input feature in the related technology, can reduce the number of input features, thereby reducing the amount of calculation; at the same time, By using a feedforward neural network with hidden layers less than a threshold for feature extraction and prediction, compared with a CNN network with a larger number of layers, it can use less calculation to predict similar or better tactile measurement results .
- Fig. 6 shows a schematic structural diagram of a feedforward neural network 200 provided by an exemplary embodiment.
- the feedforward neural network 200 is provided with a hidden layer 202 and an output layer 203.
- the method for the above-mentioned feedforward neural network 200 to process the difference features of the marking pattern 112 to obtain the measurement result includes the following steps, as shown in FIG. 7:
- step 301 the hidden layer 201 in the feedforward neural network 200 is called to perform feature extraction on the difference features of the mark pattern 112 to obtain a feature representation.
- step 302 the output layer 202 in the feedforward neural network is called to process the feature representation to obtain the tactile measurement result.
- n hidden neurons in the hidden layer 201 There are n hidden neurons in the hidden layer 201, and n is an integer.
- the hidden layer 201 is constructed based on Sigmoid hidden neurons
- the aforementioned output layer 202 is constructed based on Softmax output neurons or linear output neurons.
- the number of neurons in the hidden layer can be any integer greater than zero, and the number of input feature values can also be any integer greater than zero.
- the above-mentioned neurons can be designed according to different functions. .
- the marking pattern 112 is implemented by using 36 marking points. Taking 100 hidden neurons in the hidden layer 201 as an example, 72 features are input to the input layer 201 of the feedforward neural network 200. The characteristic value is the displacement of 36 mark points 112 (x1, y1, x2, y2,..., x36, y36).
- the structure of the tactile sensor involved in this application is relatively simple, and at the same time, the method of inputting the displacement of the marker point on the continuous image array as the feature value into the neural network model is adopted. And design a feedforward neural network with a simple hidden layer to measure the touched objects, which simplifies the measurement process and reduces the number of training samples.
- the hidden layer in the structure of the feedforward neural network transmits information in a unidirectional manner, and the hidden layer contains at least one.
- the three-dimensional information of the contact force includes the magnitude and/or direction of the contact force
- the above-mentioned feedforward neural network includes: a position estimation model for estimating the position of the contact point, a contact force estimation model for estimating the three-dimensional information of the contact force (the magnitude and/or direction of the contact force), and a surface for classifying the contact surface
- a position estimation model for estimating the position of the contact point
- a contact force estimation model for estimating the three-dimensional information of the contact force (the magnitude and/or direction of the contact force)
- a surface for classifying the contact surface The classification model and the curvature estimation model used to estimate the local radius of curvature of the contact surface are shown in Figure 8.
- the tactile measurement result includes the contact position
- the feedforward neural network includes: a position estimation model
- the position estimation model includes a first hidden layer and a first output
- the measurement method of layer and contact position is shown in Figure 9, including:
- Step 401 Invoke the first hidden layer in the position estimation model to perform feature extraction on the difference features of the marking pattern to obtain a feature representation of the contact position;
- the first hidden layer is used for feature extraction of the displacement of the input marking points.
- the first hidden layer is based on Sigmoid Constructed by hidden neurons, the above-mentioned feature representation of the contact position is a representation of the feature representation corresponding to the contact position in the form of a vector. It has been explained in step 301 of using the feedforward neural network described above, and will not be repeated here.
- Step 402 Invoke the first output layer in the position estimation model to process the contact position feature representation to obtain the contact position.
- step 302 of using the feedforward neural network described above has been described in step 302 of using the feedforward neural network described above, and will not be repeated here.
- the number of hidden layers and output layers in the above position estimation model are both integers greater than zero, and the above neurons can be selected according to the realization of different functions.
- This application uses the first hidden layer and the first output The number of layers is 1, and the neurons are respectively selected based on Sigmoid hidden neurons and linear output neurons as examples, as shown in Figure 22.
- the feature value is input to the feedforward neural network 200, and the Sigmoid hidden neuron in the first hidden layer in the position estimation model is called to process the above feature value to obtain the feature representation of the contact position; the feature representation of the contact position will be used as The input value is input to the first output layer, and the linear output neuron in the first output layer will perform feature extraction on the feature representation to obtain and output the three-dimensional coordinates of the contact position in space.
- the specific three-dimensional coordinate system refer to the following to obtain the sample contact position Time coordinate system (refer to Figure 26).
- the tactile measurement result includes the three-dimensional information of the contact force.
- the feedforward neural network 200 includes a contact force estimation model and a contact force estimation model. Including: the second hidden layer and the second output layer, the method of measuring the three-dimensional information of the contact force is shown in Figure 10, including:
- Step 501 Invoke the second hidden layer in the contact force estimation model to perform feature extraction on the difference features of the marking pattern to obtain a contact force feature representation.
- the second hidden layer inputs the displacement of the marking point as the characteristic value to obtain the contact force characteristic representation, and the contact force characteristic representation As input to the second output layer.
- Step 502 Invoke the second output layer in the contact force estimation model to process the contact force feature representation to obtain three-dimensional information of the contact force, the three-dimensional information including size and/or direction.
- step 402 of using the feedforward neural network described above has been described in step 402 of using the feedforward neural network described above, and will not be repeated here.
- the numbers of the second hidden layer and the second output layer in the above-mentioned contact force estimation model are both integers greater than zero, and the above-mentioned neurons can be selected according to the realization of different functions.
- the number of the layer and the second output layer are 1 respectively, and the neurons are respectively selected based on Sigmoid hidden neurons and linear output neurons as examples, as shown in Figure 22.
- the difference feature is input to the feedforward neural network 200, and the Sigmoid hidden neuron in the second hidden layer in the contact force estimation model is called to process the above feature values to obtain the feature representation of the contact position; the feature representation of the contact position will be As the input value is input to the second output layer, the linear output neuron in the second output layer will predict the feature representation and obtain the three-dimensional information of the contact force in space, that is, the magnitude and/or direction of the contact force and output .
- size and/or direction includes: size only; or, direction only; or, size and direction.
- the tactile measurement result includes the local radius of curvature of the contact surface
- the feedforward neural network includes: a surface classification model and at least two curvature estimation models.
- the at least two curvature estimation models include: a spherical curvature estimation model and a cylindrical curvature estimation model.
- Step 601 Invoke the surface classification model to perform surface recognition on the displacement of the marking point, and obtain the surface type of the contact surface.
- the surface classification model is a neural network model used to predict the surface type of the contacted object.
- the surface type includes at least one of a spherical surface, a flat surface, and a cylindrical surface.
- Step 602 Call the target curvature estimation model of the at least two curvature estimation models according to the surface type to predict the curvature of the contact surface, and obtain the local radius of curvature of the contact surface.
- the chip calls the relevant curvature estimation model to estimate the curvature according to the type of the measured contact surface.
- the surface classification model includes a third hidden layer and a third output layer, and step 601 includes the following sub-steps, as shown in FIG. 12, including:
- Step 601a Invoke the third hidden layer in the surface classification model to perform surface recognition on the displacement of the marked point, and obtain a surface type feature representation.
- Step 601b Invoke the third output layer in the surface classification model to process the surface type feature representation to obtain the surface type of the contact surface.
- the marked point displacement is output as the surface type.
- the surface type includes any one of plane, spherical or cylindrical surface.
- the spherical estimation model includes a fourth hidden layer and a fourth output layer.
- step 602 includes:
- Step 602a calling the fourth hidden layer in the spherical surface estimation model to perform the first curvature prediction on the spherical surface to obtain the spherical curvature prediction feature representation.
- Step 602b call the fourth output layer in the spherical surface estimation model to process the spherical surface curvature prediction feature representation to obtain the local radius of curvature of the spherical surface.
- step 602 includes:
- Step 6021 Invoke the fifth hidden layer in the cylindrical surface estimation model to perform a second curvature prediction on the cylindrical surface to obtain a characteristic representation of the cylindrical surface curvature prediction.
- Step 6022 Call the fifth output layer in the cylindrical surface estimation model to process the cylindrical surface curvature prediction feature representation to obtain the local radius of curvature of the cylindrical surface.
- the surface type of the contact surface can be, but is not limited to, a spherical surface, a cylindrical surface, or a flat surface. In some embodiments, the contact surface is a spherical surface as an example.
- the hidden neurons and output neurons of the surface classification model can be It is set according to different realization functions, and the specific structure of the surface classification model of the present application is described in detail below (refer to FIG. 23).
- the displacement of the marked point is input as the feature value to the above surface classification model.
- the third hidden layer in the surface classification model performs surface recognition on the feature value to obtain the spherical type feature representation; the spherical type The feature representation is input to the third output layer as an input value, and the contact surface type is spherical; the chip calls the fourth hidden layer to predict the spherical curvature radius according to the contact surface type to be spherical, and obtains the spherical curvature prediction feature representation; the spherical curvature feature Indicates that the input is input to the fourth output layer, and the fourth output layer is called to process the spherical curvature prediction feature representation to obtain the local curvature radius of the spherical surface and output it.
- the movement displacement of the marked point is input as a feature value to the above surface classification model, and the third hidden layer in the surface classification model performs surface recognition on the feature value to obtain a cylindrical surface type feature representation;
- the characteristic representation of the cylindrical surface type is input as the input value to the third output layer, and the contact surface type is obtained as a cylindrical surface;
- the chip uses the fifth hidden layer to predict the radius of curvature of the cylindrical surface according to the contact surface type to obtain the cylindrical surface curvature.
- Predicted feature representation input the cylindrical curvature feature representation to the fifth output layer, call the fifth output layer to process the cylindrical curvature prediction feature representation, and obtain the local radius of curvature of the cylindrical surface and output it.
- the above-mentioned local radius of curvature is continuous, not an intermittent interval.
- the difference feature of the mark pattern is calculated according to the mark pattern in the adjacent images in the image sequence, as shown in FIG. 15, including:
- Step 202a in the adjacent i-th frame image and i+1-th frame image in the image sequence, determine that the two closest marking patterns are the same marking pattern;
- Step 202b Calculate the difference feature of the marking pattern according to the position (or position and deformation) of the marking pattern in the i-th frame image and the i+1-th frame image.
- i is assumed to be 1, and in the adjacent first frame image and second frame image in the image sequence, the two closest marker points are determined to be the same marker point; The position in the 1 frame image and the second frame image, calculate the displacement of the marker point, as shown in Figure 15. Among them, the value of i is an integer.
- FIG. 16 shows a flowchart of a method for training a feedforward neural network provided by an exemplary embodiment. According to an embodiment of the present application, the foregoing method is shown in FIG. 16, and includes:
- Step 1601 Obtain training samples.
- the training samples include a sample image sequence and sample tactile results.
- the sample image sequence is an image sequence collected by an image sensing component in the tactile sensor.
- Step 1602 Calculate the sample difference characteristics of the mark pattern according to the mark pattern in the adjacent images in the sample image sequence.
- the sample difference feature of the marking pattern is calculated; or, according to the position and deformation (such as size) of the marking pattern in adjacent images in the sample image sequence ) To calculate the sample difference characteristics of the marking pattern.
- the sample difference feature of the marking pattern includes: displacement of the marking point, or displacement and deformation of the marking point:
- the sample difference characteristics of the marking pattern include: the displacement of the grid points in the grid, or the displacement of the grid points in the grid and the deformation of the grid lines.
- step 1603 the feedforward neural network is called to process the sample difference characteristics of the mark pattern to obtain the predicted tactile result.
- the number of hidden layers in the feedforward neural network is less than the threshold.
- Step 1604 Perform error calculation on the predicted haptic result and the sample haptic result to obtain the error loss.
- Step 1605 Train the feedforward neural network according to the error loss through the error back propagation algorithm to obtain the trained feedforward neural network.
- the feedforward neural network used in this method is consistent with the above-mentioned neural network model.
- the feedforward neural network is trained.
- the specific structure of the feedforward neural network will not be repeated here.
- the hidden layer and the output layer are provided in the feed-forward neural network; step 1603, that is, the step of calling the feed-forward neural network to process the sample difference features of the mark pattern to obtain the predicted tactile result , Including: calling the hidden layer in the feedforward neural network to perform feature extraction on the sample difference features of the mark pattern to obtain a feature representation; and calling the output layer in the feedforward neural network to process the feature representation to obtain a predicted tactile result.
- the hidden layer is provided with n hidden neurons, where n is an integer; the hidden layer is constructed based on the logistic function Sigmoid hidden neurons; and the output layer is based on the normalized exponential function Softmax output Constructed by neurons or linear output neurons.
- the feedforward neural network includes a position estimation model for estimating the contact position.
- the position estimation model includes a first hidden layer and a first output layer.
- the training method is shown in FIG. 17 and includes:
- Step 1701 Obtain a first training sample.
- the first training sample includes a first sample image sequence and sample contact positions.
- the sample contact position is a position represented by coordinates in the form of three-dimensional coordinates.
- Step 1702 Calculate the sample difference feature of the mark pattern according to the mark pattern in the adjacent images in the sample image sequence.
- Step 1703 Call the position estimation model to process the sample difference characteristics of the marking pattern to obtain the predicted contact position.
- Step 1704 Perform error calculation on the predicted contact position and the sample contact position to obtain the first error loss.
- Step 1705 Train the position estimation model according to the first error loss through the Levenberg-Marquardt backpropagation algorithm to obtain the trained position estimation model.
- the coordinates (x 1 , y 1 , z 1 ), the displacement of the marker points in the obtained image array is input to the first hidden layer and the first output layer, and the first output layer obtains the predicted coordinates of the sample contact position (x 1 ',y 1 ',z 1 '), the coordinates (x 1 ,y 1 ,z 1 ) and predicted coordinates (x 1 ',y 1 ',z 1 ) of the sample contact position are determined by the Levenberg-Marquardt backpropagation algorithm ') Perform processing to obtain the first error loss, train the position estimation model according to the first error loss, and obtain the trained position estimation model.
- the feedforward neural network includes a contact force estimation model for estimating a three-dimensional contact force.
- the contact force estimation model includes a second hidden layer and a second output layer.
- the training method is shown in FIG. 18 and includes:
- Step 1801 Obtain a second training sample.
- the second training sample includes a second sample image sequence and sample three-dimensional information.
- the sample three-dimensional information is calibrated based on data collected by a torque sensor arranged at the tail of the tactile sensor.
- the three-dimensional information includes size And/or direction.
- Step 1802 Calculate the sample difference feature of the mark pattern according to the mark pattern in the adjacent images in the sample image sequence.
- Step 1803 Invoke the contact force estimation model to process the sample difference characteristics of the marking pattern to obtain predicted three-dimensional information.
- Step 1804 Perform error calculation on the predicted 3D information and the sample 3D information to obtain the second error loss.
- Step 1805 Train the contact force estimation model according to the second error loss through the Levenberg-Marquardt backpropagation algorithm to obtain the trained contact force estimation model.
- the second training sample is the actual three-dimensional information of the displacement of the marker points in the image array and the sample contact force as an example, and the displacement of the marker points in the image array and the sample three-dimensional information (f x , f y , f z ), the displacement of the marker points in the acquired image array is input to the second hidden layer and the second output layer, and the second output layer obtains the predicted three-dimensional information of the sample contact force (f x ', f y ',f z '), the sample 3D information (f x , f y , f z ) and the predicted 3D information (f x ', f y ', f z ') are processed through the Levenberg-Marquardt backpropagation algorithm to obtain The second error loss is to train the position estimation model according to the second error loss to obtain the trained position estimation model.
- the feedforward neural network includes a surface classification model for classifying contact surfaces, and the surface classification model includes a third hidden layer and a third output layer.
- the training method is shown in FIG. 19 and includes:
- Step 1901 Obtain a third training sample.
- the third training sample includes a third sample image sequence and a sample surface type.
- Step 1902 Calculate the sample difference feature of the mark pattern according to the mark pattern in the adjacent images in the sample image sequence.
- Step 1903 Invoke the surface classification model to process the sample difference features of the marking pattern to obtain the predicted surface type.
- Step 1904 Perform error calculation on the predicted surface type and the sample surface type to obtain a third error loss.
- step 1905 the surface classification model is trained according to the third error loss through the scaled conjugate gradient back propagation algorithm to obtain the trained surface classification model.
- the third training sample is the displacement of the marker points in the image array and the sample surface type as an example for illustration, the marker point displacement and the sample surface type (S 1 ) in the image array are acquired, and the acquired image The marker point displacement in the array is input to the third hidden layer and the third output layer.
- the third output layer obtains the predicted surface type (S 1 ') of the contact surface, and the sample surface type (S 1 ) and the predicted surface type (S 1 ′) are processed to obtain the third error loss, and the surface classification model is trained according to the third error loss to obtain the trained surface classification model.
- the structure of the above-mentioned surface classification model is shown in FIG. 23, and the third hidden layer can be provided with one or two.
- the third hidden layer is provided with one layer as an example.
- the surface classification model includes a third hidden layer and a third output layer.
- the hidden layer is constructed based on Sigmoid hidden neurons.
- the third output layer is constructed based on Softmax hidden neurons.
- the Sigmoid hidden neurons are suitable for classifying objects.
- And Softmax hides neurons to produce different output results corresponding to different shapes of the contact surface.
- the curvature estimation model includes: a spherical estimation model.
- the spherical estimation model includes a fourth hidden layer and a fourth output layer.
- the training method is shown in FIG. 20 and includes:
- Step 2010 Obtain a fourth training sample, where the fourth training sample includes the fourth sample image sequence and the local radius of curvature of the sample sphere;
- Step 2020 Calculate the sample difference characteristics of the marker points according to the marker patterns in the adjacent images in the sample image sequence.
- step 2030 the spherical estimation model is called to process the sample difference characteristics of the marking pattern to obtain the predicted radius of curvature.
- Step 2040 Perform error calculation on the predicted radius of curvature and the local radius of curvature of the sample spherical surface to obtain a fourth error loss.
- step 2050 the sphere estimation model is trained by the Levenberg-Marquardt back propagation algorithm according to the fourth error loss to obtain the trained sphere estimation model.
- the displacement of the marker points in the image array and the local radius of curvature of the sample sphere input the displacement of the marker points in the acquired image array to the fourth hidden layer, and the fourth output layer obtains the predicted radius of curvature (R 1 ') of the sample spherical surface, which is reversed by Levenberg-Marquardt
- the propagation algorithm processes the local radius of curvature (R 1 ) of the sample sphere and the predicted radius of curvature (R 1 ') of the sample sphere to obtain the fourth error loss.
- the sphere estimation model is trained to obtain the trained Spherical estimation model.
- the curvature estimation model includes: a cylindrical surface estimation model, and the cylindrical surface estimation model includes a fifth hidden layer and a fifth output layer.
- the training method is shown in FIG. 21 and includes:
- Step 2101 Obtain a fifth training sample, where the fifth training sample includes the fifth sample image sequence and the local radius of curvature of the sample cylindrical surface;
- Step 2102 Calculate the difference characteristics of the marking patterns according to the marking patterns in the adjacent images in the sample image sequence.
- Step 2103 Call the cylinder estimation model to process the difference features of the marking pattern to obtain the predicted radius of curvature.
- Step 2104 Perform error calculation on the predicted radius of curvature and the local radius of curvature of the sample cylindrical surface to obtain the fifth error loss.
- Step 2105 Train the cylinder estimation model according to the fifth error loss through the Levenberg-Marquardt backpropagation algorithm to obtain the trained cylinder estimation model.
- the displacement of the marker points in the image array and the local radius of curvature of the sample cylindrical surface (R 2 ) is input to the fifth hidden layer, and the fifth output layer obtains the predicted radius of curvature (R 2 ') of the sample cylindrical surface, and passes the Levenberg-Marquardt Levenberg-Marquardt reaction
- the direction propagation algorithm processes the local radius of curvature (R 2 ) of the sample cylindrical surface and the predicted radius of curvature (R 2 ') of the sample cylindrical surface to obtain the fifth error loss, and train the cylindrical surface estimation model according to the fifth error loss. Obtain the trained cylindrical estimation model.
- the contacted surface is not limited to a spherical surface and a cylindrical surface.
- the type of contact surface involved in the training sample and the number of use of each contact surface training sample are shown in Figure 24 and Figure 25, respectively.
- This application selects contact surfaces with multiple shapes including cones, cylinders, triangular pyramids, triangular prisms, planes, etc., to train the surface classification model. Because a center with a contact force greater than 5.5N will cause a larger contact area Deformation, so for the tip surface (triangular pyramid in Figure 24), the application collects a contact force of less than 5.5N (as shown in the table in Figure 25).
- the internal structure of the feedforward neural network 200 used when the feedforward neural network is not trained is the same, and the functions implemented by the internal structure are the same.
- the structure of the feedforward neural network 200 used in training is described in detail, and the specific structure is shown in the above-mentioned feedforward neural network 200.
- the tactile sensor system includes the above-mentioned tactile sensor and a chip 117 connected to the tactile sensor.
- the chip 117 includes a programmable logic circuit and/or program instructions, and when the chip is running, it is used to execute the above-mentioned vision-based tactile measurement method.
- the system is composed of the aforementioned tactile sensor and a chip 117.
- the chip 117 is the same type of chip 117 as the chip in the structure of the vision-based tactile measurement device.
- the chip 117 includes Programming logic circuits and/or program instructions, when the chip is running, is used to implement the above-mentioned vision-based tactile measurement method.
- Fig. 26 shows a geometric model of the force/torque sensor measuring the contact point provided by an exemplary embodiment.
- the actual contact position is obtained by the torque sensor 116 installed below the tactile sensor, and a virtual spatial coordinate system is established on the tactile sensor.
- the tactile sensor measures the force f and the torque m generated by the contact force on the coordinate system established by the tactile sensor.
- the force f is the contact force
- the moment m can be written as follows:
- r is the three-dimensional vector, that is, the position of the contact point relative to the coordinate system of the fingertip. It is a 3 ⁇ 3 antisymmetric matrix, used to represent the cross product. Because the matrix The rank of is two, so the solution of the formula can be written as follows:
- the entire sphere is used to represent the sensing surface 111, therefore, there are two intersection points between the straight line and the sensing surface 111.
- the inner product of the inner normal at one point and the force f measured by the tactile sensor is positive, that is, the actual contact position, and the position must be on the real sensing surface 111.
- the other intersection point may or may not fall on the sensing surface 111, but the other intersection point can be omitted because the inner product of the other intersection point and the normal is negative.
- the position estimation model is trained, and the training result is shown in FIG. 27.
- the value of the correlation coefficient (R) of the position estimation model is close to 1, and the root mean square error (RMSE) is about 0.6mm, which means that the consistency of the trained position estimation model and the input data is highly related.
- the contact force estimation model is trained, and the training result is shown in FIG. 28.
- the performance results of the contact force estimation model in the dynamic force range of 1.5N to 8N are as follows: After training, the value of the correlation coefficient (R) is close to 1, and the root mean square error (RMSE) is about 0.25N. This means that the consistency of the trained contact force estimation model and the input data is highly correlated.
- the surface classification model is trained, and the accuracy of the training result is shown in FIG. 29.
- the spherical estimation model is trained, and the training result is shown in FIG. 30.
- the training results of the spherical estimation model are as follows: After training, the value of the correlation coefficient (R) is about 0.9, and the root mean square error (RMSE) is about 8mm. This means that the trained spherical estimation model and the input The consistency of the data is highly relevant.
- the cylindrical surface estimation model is trained, and the training result is shown in FIG. 31.
- the training results of the spherical estimation model are as follows. After training, the correlation coefficient (R) is about 0.9, and the root mean square error (RMSE) is about 10mm. This means that the trained cylindrical estimation model is The consistency of the input data is highly relevant.
- the flow chart of the method of using the tactile sensor system 300 is shown in FIG. 32.
- the tactile sensor is provided with a sensing surface 111, and the inner surface of the sensing surface is provided with marking points 112, and
- the image sensor assembly 115 facing the inner surface of the surface 111 collects the displacement of the marked point 112 on the image sequence formed by the inner surface of the flexible sensor 111 through the image sensor element 115, and inputs the displacement of the marked point 112 as a feature value to the above
- the feedforward neural network measures the position of the contact point, the magnitude and/or direction of the three-dimensional contact force, and the local radius of curvature of the contact surface.
- the feedforward neural network can also be trained by comparing the real value of the contact position measured by the torque sensor, the real value of the contact force, and the real value of the local curvature radius of the contact surface with the measured value measured by the tactile sensor.
- the structural diagram of the vision-based tactile sensor is connected to the chip.
- the vision-based tactile measurement device includes: a first acquisition module 311, a first calculation module 312, and a feedforward neural network 313, as shown in FIG. 33.
- the first acquisition module 311 is configured to acquire an image sequence collected by the image sensing component 115 on the sensing surface, and the image of the image sequence includes a marking pattern.
- the first calculation module 312 is configured to calculate the difference feature of the marking pattern according to the position of the marking pattern 112 in the adjacent images in the image sequence.
- the feedforward neural network 313 is used to process the difference features of the marking pattern 112 to obtain the tactile measurement result.
- the feedforward neural network is provided with a hidden layer and an output layer.
- the hidden layer is used for feature extraction of the different features of the mark pattern to obtain a feature representation; the output layer is used for Feature represents processing to obtain tactile measurement results.
- the hidden layer is constructed based on Sigmoid hidden neurons; the output layer is constructed based on Softmax output neurons or linear output neurons.
- the above-mentioned tactile measurement result includes the contact position;
- the feedforward neural network includes: a position estimation model, the position estimation model includes a first hidden layer and a first output layer, the first hidden layer is used for the description
- the difference features of the marking pattern are feature extracted to obtain the contact position characteristic representation; the first output layer is used to process the contact position characteristic representation to obtain the contact position.
- the above-mentioned tactile measurement results include three-dimensional information of contact force
- the feedforward neural network includes a contact force estimation model
- the contact force estimation model includes a second hidden layer and a second output layer.
- the second output layer is used to process the characteristic expression of the contact force to obtain the three-dimensional information of the contact force.
- the three-dimensional information includes the size and direction of the contact force. At least one.
- the above-mentioned tactile measurement results include the local radius of curvature of the contact surface
- the feedforward neural network includes: a surface classification model and at least two curvature estimation models; the surface classification model is used to detect differences in the marking patterns The feature performs surface recognition to obtain the surface type of the contact surface; the target curvature estimation model in at least two curvature estimation models is used to predict the curvature of the contact surface according to the surface type to obtain the local curvature radius of the contact surface.
- the curvature estimation model includes: a spherical surface estimation model and a cylindrical surface estimation model.
- the spherical surface estimation model is used to perform the first curvature prediction on the spherical surface when the surface type is a spherical surface to obtain the local curvature of the spherical surface Radius, when the surface type is a cylindrical surface, the cylindrical surface estimation model is used to predict the second curvature of the cylindrical surface to obtain the local radius of curvature of the cylindrical surface.
- the surface classification model includes a third hidden layer and a third output layer.
- the third hidden layer is used to perform surface recognition on the difference features of the marking pattern to obtain a surface type feature representation; the third output The layer is used to process the surface type feature representation to obtain the surface type of the contact surface.
- the spherical surface estimation model includes a fourth hidden layer and a fourth output layer.
- the fourth hidden layer is used to perform the first curvature prediction on the spherical surface to obtain the spherical curvature prediction feature representation; the fourth output layer , Used to process the prediction feature representation of spherical curvature to obtain the local radius of curvature of the spherical surface.
- the cylindrical surface estimation model includes a fifth hidden layer and a fifth output layer.
- the fifth hidden layer is used to perform a second curvature prediction on the cylindrical surface to obtain a cylindrical surface curvature prediction feature representation;
- Five output layers used to process the prediction feature representation of the cylindrical surface curvature to obtain the local radius of curvature of the cylindrical surface.
- the first calculation unit is configured to determine that the two closest marking patterns are the same marking pattern among the adjacent i-th frame image and the i+1-th frame image in the image sequence;
- the second calculating unit is used to calculate the difference feature of the marking pattern according to the position (or position and deformation) of the marking pattern in the i-th frame image and the i+1-th frame image.
- the marking pattern includes at least two marking points, and the sample difference feature of the marking pattern includes at least one of displacement and deformation of the marking point.
- the marking pattern includes a grid
- the difference feature of the marking pattern includes at least one of a displacement of a grid point in the grid and a deformation of a grid line.
- Fig. 34 shows a schematic structural diagram of a training module of a vision-based tactile measurement device provided by an exemplary embodiment.
- the training module of the vision-based tactile measurement device includes:
- the second acquisition module 411 is used to acquire training samples.
- the training samples include a sample image sequence and sample tactile results.
- the sample image sequence is an image sequence collected by an image sensing component in the tactile sensor.
- the tactile sensor includes a sensing surface and an image sensor. The component, the sensing surface is provided with a marking pattern, and the images of the image sequence include the marking pattern.
- the second calculation module 412 is configured to calculate the sample difference characteristics of the marking pattern according to the marking patterns in the adjacent images in the sample image sequence.
- the feedforward neural network model 413 is used to process the sample difference characteristics of the mark pattern to obtain the predicted tactile results, wherein the number of hidden layers in the feedforward neural network is less than the threshold.
- the error calculation module 414 is configured to perform error calculation on the predicted haptic result and the sample haptic result to obtain the error loss.
- the training module 415 is used to train the feedforward neural network model according to the error loss through the error back propagation algorithm to obtain the trained feedforward neural network model.
- the feedforward neural network model is provided with a hidden layer and an output layer.
- the hidden layer is used to extract the different features of the mark pattern to obtain a feature representation; the output layer is used to perform feature representation. Processing, get the tactile measurement result.
- the hidden layer is constructed based on Sigmoid hidden neurons; the output layer is constructed based on Softmax output neurons or linear output neurons.
- the feedforward neural network model includes a position estimation model for estimating the contact position
- the second acquisition module 411 is used for acquiring a first training sample
- the first training sample includes a first sample image sequence The contact position with the sample.
- the estimation model is used to process the sample difference characteristics of the marking pattern to obtain the predicted contact position.
- the error calculation module 414 is further configured to perform error calculation on the predicted contact position and the sample contact position to obtain a first error loss.
- the training module 415 is also used to train the position estimation model according to the first error loss through the Levenberg-Marquardt backpropagation algorithm to obtain the trained position estimation model.
- the feedforward neural network model includes a contact force estimation model for estimating the three-dimensional contact force
- the second acquisition module 411 is also used to acquire a second training sample.
- the second training sample includes a second sample.
- the image sequence and the three-dimensional information of the sample contact force is calibrated based on the data collected by the torque sensor installed at the tail of the tactile sensor.
- the three-dimensional information includes size and/or direction.
- the contact force estimation model is used to process the sample difference characteristics of the marking pattern to obtain predicted three-dimensional information.
- the error calculation module 414 is also used to perform error calculation on the predicted three-dimensional information and the sample three-dimensional information to obtain the second error loss.
- the training module 415 is also used to train the contact force estimation model according to the second error loss through the Levenberg-Marquardt backpropagation algorithm to obtain the trained contact force estimation model.
- the feedforward neural network model includes a surface classification model for classifying contact surfaces
- the second acquisition module 411 is also used to acquire a third training sample
- the third training sample includes a third sample image The type of sequence and sample contact surface.
- the surface classification model is used to process the sample difference characteristics of the marking pattern to obtain the predicted surface type.
- the error calculation module 414 is further configured to perform error calculation on the predicted surface type and the sample surface type to obtain a third error loss.
- the training module 415 is also used to train the surface classification model according to the third error loss through the scaled conjugate gradient backpropagation algorithm to obtain the trained surface classification model.
- the feedforward neural network model includes at least two curvature estimation models for estimating curvature
- the second acquisition module 411 is used to acquire training samples
- the training samples include the type of sample surface and sample tactile results
- the second calculation module 412 is used to train the feedforward neural network according to the error loss through the error back propagation algorithm to obtain the trained feedforward neural network.
- the curvature estimation model includes: a spherical estimation model, the second acquisition module 411 is used to acquire a fourth training sample, and the fourth training sample includes the type of the fourth sample surface and the local curvature of the sample sphere radius.
- the spherical surface estimation model is used to process the sample difference characteristics of the marking pattern to obtain the predicted radius of curvature when the predicted surface type is spherical.
- the error calculation module 414 is also used to perform error calculation on the predicted radius of curvature of the spherical surface and the local radius of curvature of the sample spherical surface to obtain the fourth error loss.
- the training module 415 is also used to train the spherical surface estimation model according to the fourth error loss through the Levenberg-Marquardt backpropagation algorithm to obtain the trained spherical surface estimation model.
- the curvature estimation model includes: a cylindrical surface estimation model, the second acquisition module 411 is used to acquire a fifth training sample, and the fifth training sample includes the type of the fifth sample surface and the sample cylindrical surface The local radius of curvature.
- the cylindrical surface estimation model is used to call the cylindrical surface estimation model to process the difference characteristics of the marking pattern when the predicted surface type is a cylindrical surface to obtain the predicted radius of curvature.
- the error calculation module 414 is also used to perform error calculation on the predicted radius of curvature of the cylinder and the local radius of curvature of the sample cylindrical surface to obtain the fifth error loss.
- the training module 415 is also used to train the cylindrical surface estimation model according to the fifth error loss through the Levenberg-Marquardt backpropagation algorithm to obtain the trained cylindrical surface estimation model.
- the embodiment of the present application also provides a computer device, which includes a memory and a processor, the memory stores computer-readable instructions, and the processor implements the steps of the above-mentioned feedforward neural network training method when the computer-readable instructions are executed.
- Fig. 35 shows a schematic structural diagram of a computer device provided by an exemplary embodiment of the present application.
- the computer device can be used to execute the above-mentioned feedforward neural network training method. Specifically:
- the computer device 3500 includes a central processing unit (CPU) 3501, a system memory 3504 including a random access memory (RAM) 3502 and a read only memory (ROM) 3503, and a system bus 3505 connecting the system memory 3504 and the central processing unit 3501.
- the computer equipment 3500 also includes a basic input/output system (I/O system) 3506 that helps transfer information between various devices in the computer, and a mass storage device for storing the operating system 3513, application programs 3514, and other program modules 3515 3507.
- I/O system basic input/output system
- the basic input/output system 3506 includes a display 3508 for displaying information and an input device 3509 such as a mouse and a keyboard for the user to input information.
- the display 3508 and the input device 3509 are both connected to the central processing unit 3501 through the input and output controller 3510 connected to the system bus 3505.
- the basic input/output system 3506 may also include an input and output controller 3510 for receiving and processing input from multiple other devices such as a keyboard, a mouse, or an electronic stylus.
- the input and output controller 3510 also provides output to a display screen, a printer, or other types of output devices.
- the mass storage device 3507 is connected to the central processing unit 3501 through a mass storage controller (not shown) connected to the system bus 3505.
- the mass storage device 3507 and its associated computer-readable medium provide non-volatile storage for the computer device 3500. That is, the mass storage device 3507 may include a computer-readable medium (not shown) such as a hard disk or a CD-ROM drive.
- Computer-readable media may include computer storage media and communication media.
- Computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storing information such as computer readable instructions, data structures, program modules or other data.
- Computer storage media include RAM, ROM, EPROM, EEPROM, flash memory or other solid-state storage technologies, CD-ROM, DVD or other optical storage, tape cartridges, magnetic tape, disk storage or other magnetic storage devices.
- RAM random access memory
- ROM read-only memory
- EPROM Erasable programmable read-only memory
- EEPROM electrically erasable programmable read-only memory
- the computer device 3500 may also be connected to a remote computer on the network through a network such as the Internet to operate. That is, the computer device 3500 can be connected to the network 3512 through the network interface unit 3511 connected to the system bus 3505, or in other words, the network interface unit 3511 can also be used to connect to other types of networks or remote computer systems (not shown).
- the aforementioned memory also includes one or more programs, and one or more programs are stored in the memory and configured to be executed by the CPU.
- the embodiment of the present application also provides a computer-readable storage medium storing computer-readable instructions, wherein the computer-readable instructions are characterized in that, when the computer-readable instructions are executed by a processor, the above vision-based tactile measurement method is implemented, or the above feedforward Training method of neural network.
- the embodiment of the present application also provides a robot system, the robot system includes: a chip and a tactile sensor, the tactile sensor is provided in at least one of the fingertip part and the skin part, the tactile sensor includes a flexible sensing surface and an orientation flexible sensor.
- the image sensing component is arranged on the inner surface of the sensing surface. Marking points are arranged in the flexible sensing surface.
- the image sensing component is connected to the chip.
- the chip includes at least one of a programmable logic circuit and program instructions. , Used to implement the above-mentioned vision-based tactile measurement method.
- the embodiment of the present application also provides a flow chart of a method for using the touch sensor system 300, as shown in FIG. 32.
- the method of use is suitable for the above-mentioned touch sensor, the feedforward neural network in the touch sensor, and the above-mentioned feedforward neural network.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Human Computer Interaction (AREA)
- Robotics (AREA)
- Mechanical Engineering (AREA)
- Geometry (AREA)
- Image Analysis (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (31)
- 一种基于视觉的触觉测量方法,所述方法由芯片执行,所述芯片与触觉传感器相连,所述触觉传感器包括传感面和图像传感组件,所述传感面设置有标记图案;所述方法包括:获取所述图像传感组件对所述传感面采集的图像序列,所述图像序列的图像中包括所述标记图案;根据所述图像序列中的相邻图像内的所述标记图案,计算所述标记图案的差异特征;及调用前馈神经网络对所述标记图案的差异特征进行处理,得到触觉测量结果;其中,所述前馈神经网络内的隐藏层数量小于阈值。
- 根据权利要求1所述的方法,其特征在于,所述前馈神经网络内设置有所述隐藏层和输出层;所述调用前馈神经网络对所述标记图案的差异特征进行处理,得到触觉测量结果,包括:调用所述前馈神经网络中的所述隐藏层对所述标记图案的差异特征进行特征提取,得到特征表示;及调用所述前馈神经网络中的所述输出层对所述特征表示进行处理,得到触觉测量结果。
- 根据权利要求2所述的方法,其特征在于,所述隐藏层内设置有n个隐藏神经元,n为整数;所述隐藏层是基于逻辑斯蒂函数Sigmoid隐藏神经元构建的;及所述输出层是基于归一化指数函数Softmax输出神经元或线性输出神经元构建的。
- 根据权利要求2所述的方法,其特征在于,所述触觉测量结果包括接触位置;所述前馈神经网络包括:位置估计模型,所述位置估计模型包括第一隐藏层和第一输出层;所述调用所述前馈神经网络中的所述隐藏层对所述标记图案的差异特征 进行特征提取,得到特征表示,包括:调用所述位置估计模型中的所述第一隐藏层对所述标记图案的差异特征进行特征提取,得到接触位置特征表示;所述调用所述前馈神经网络中的所述输出层对所述特征表示进行处理,得到触觉测量结果,包括:调用所述位置估计模型中的所述第一输出层对所述接触位置特征表示进行处理,得到所述接触位置。
- 根据权利要求2所述的方法,其特征在于,所述触觉测量结果包括接触力的三维信息,所述前馈神经网络包括接触力估计模型,所述接触力估计模型包括:第二隐藏层和第二输出层;所述调用所述前馈神经网络中的所述隐藏层对所述标记图案的差异特征进行特征提取,得到特征表示,包括:调用所述接触力估计模型中的所述第二隐藏层对所述标记图案的差异特征进行特征提取,得到接触力特征表示;所述调用所述前馈神经网络中的所述输出层对所述特征表示进行处理,得到触觉测量结果,包括:调用所述接触力估计模型中的所述第二输出层对所述接触力特征表示进行处理,得到所述接触力的三维信息,所述三维信息包括大小和方向中的至少一种。
- 根据权利要求2所述的方法,其特征在于,所述触觉测量结果包括接触表面的局部曲率半径;所述前馈神经网络包括:表面分类模型和至少两个曲率估计模型;所述调用前馈神经网络对所述标记图案的差异特征进行处理,得到触觉测量结果,包括:调用所述表面分类模型对所述标记图案的差异特征进行表面识别,得到接触表面的表面类型;及根据所述表面类型调用所述至少两个曲率估计模型中的目标曲率估计模 型对所述接触表面进行曲率预测,得到所述接触表面的局部曲率半径。
- 根据权利要求6所述的方法,其特征在于,所述曲率估计模型包括:球面估计模型和圆柱面估计模型;所述根据所述表面类型调用所述至少两个曲率估计模型中的目标曲率估计模型对所述接触表面进行曲率预测,得到所述接触表面的局部曲率半径,包括:当所述表面类型是球面时,调用所述球面估计模型对所述球面进行第一曲率预测,得到所述球面的局部曲率半径;及当所述表面类型是圆柱面时,调用所述圆柱面估计模型对所述圆柱面进行第二曲率预测,得到所述圆柱面的局部曲率半径。
- 根据权利要求6所述的方法,其特征在于,所述表面分类模型包括第三隐藏层和第三输出层;所述调用所述表面分类模型对所述标记图案的差异特征进行表面识别,得到接触表面的表面类型,包括:调用所述表面分类模型中的所述第三隐藏层对所述标记图案的差异特征进行表面识别,得到表面类型特征表示;及调用所述表面分类模型中的所述第三输出层对所述表面类型特征表示进行处理,得到接触表面的表面类型。
- 根据权利要求7所述的方法,其特征在于,所述球面估计模型包括第四隐藏层和第四输出层;所述调用所述球面估计模型对所述球面进行第一曲率预测,得到所述球面的局部曲率半径,包括:调用所述球面估计模型中的第四隐藏层对所述球面进行第一曲率预测,得到球面曲率预测特征表示;及调用所述球面估计模型中的第四输出层对所述球面曲率预测特征表示进行处理,得到所述球面的局部曲率半径。
- 根据权利要求7所述的方法,其特征在于,所述圆柱面估计模型包括第五隐藏层和第五输出层;所述调用所述圆柱面估计模型对所述圆柱面进行第二曲率预测,得到所述圆柱面的局部曲率半径,包括:调用所述圆柱面估计模型中的第五隐藏层对所述圆柱面进行第二曲率预 测,得到圆柱面曲率预测特征表示;及调用所述圆柱面估计模型中的第五输出层对所述圆柱面曲率预测特征表示进行处理,得到所述圆柱面的局部曲率半径。
- 根据权利要求1至10任一所述的方法,其特征在于,所述标记图案包括至少两个标记点,所述标记图案的差异特征包括所述标记点的位移和形变中的至少一种。
- 根据权利要求1至10任一项所述的方法,其特征在于,所述标记图案包括网格,所述标记图案的差异特征包括所述网格中网格点的位移和网格线的形变中的至少一种。
- 根据权利要求1至10任一所述的方法,其特征在于,所述根据所述图像序列中的相邻图像内的所述标记图案,计算所述标记图案的差异特征,包括:在所述图像序列中相邻的第i帧图像和第i+1帧图像中,确定最接近的两个标记图案为相同的标记图案,i为整数;及根据所述标记图案在所述第i帧图像和所述第i+1帧图像中的位置和形变中的至少一种,计算所述标记图案的差异特征。
- 一种前馈神经网络的训练方法,由计算机设备执行,所述方法包括:获取训练样本,所述训练样本包括样本图像序列和样本触觉结果,所述样本图像序列是触觉传感器内的图像传感组件采集的图像序列,所述触觉传感器包括传感面和图像传感组件,所述传感面设置有标记图案,所述图像序列的图像中包括所述标记图案;根据所述样本图像序列中的相邻图像内的所述标记图案,计算所述标记图案的样本差异特征;调用前馈神经网络对所述标记图案的样本差异特征进行处理,得到预测触觉结果;所述前馈神经网络中的隐藏层数量少于阈值;对所述预测触觉结果和所述样本触觉结果进行误差计算,得到误差损失; 及通过误差反向传播算法根据所述误差损失对所述前馈神经网络进行训练,得到训练后的前馈神经网络。
- 根据权利要求14所述的方法,其特征在于,所述前馈神经网络内设置有所述隐藏层和输出层;所述调用前馈神经网络对所述标记图案的样本差异特征进行处理,得到预测触觉结果,包括:调用所述前馈神经网络中的所述隐藏层对所述标记图案的样本差异特征进行特征提取,得到特征表示;及调用所述前馈神经网络中的所述输出层对所述特征表示进行处理,得到预测触觉结果。
- 根据权利要求15所述的方法,其特征在于,所述隐藏层内设置有n个隐藏神经元,n为整数;所述隐藏层是基于逻辑斯蒂函数Sigmoid隐藏神经元构建的;及所述输出层是基于归一化指数函数Softmax输出神经元或线性输出神经元构建的。
- 根据权利要求14所述的方法,其特征在于,所述前馈神经网络包括用于估计接触位置的位置估计模型,所述获取训练样本,所述训练样本包括样本图像序列和样本触觉结果,包括:获取第一训练样本,所述第一训练样本包括第一样本图像序列和样本接触位置;所述调用前馈神经网络对所述标记图案的样本差异特征进行处理,得到预测触觉结果,包括:调用所述位置估计模型对所述标记图案的样本差异特征进行处理,得到预测接触位置;所述对所述预测触觉结果和所述样本接触位置进行误差计算,得到误差损失,包括:对所述预测接触位置和所述样本接触位置进行误差计算,得到第一误差 损失;所述通过误差反向传播算法根据所述误差损失对所述前馈神经网络进行训练,得到训练后的前馈神经网络,包括:通过莱文贝格-马夸特Levenberg–Marquardt反向传播算法根据所述第一误差损失对所述位置估计模型进行训练,得到训练后的位置估计模型。
- 根据权利要求14所述的方法,其特征在于,所述前馈神经网络包括用于估计接触力的三维信息的接触力估计模型,所述获取训练样本,所述训练样本包括样本图像序列和样本触觉结果,包括:获取第二训练样本,所述第二训练样本包括第二样本图像序列和样本三维信息,所述样本三维信息是基于设置在所述触觉传感器尾部的力矩传感器采集到的数据进行标定得到的,所述样本三维信息包括大小和方向中的至少一种;所述调用前馈神经网络对所述标记图案的样本差异特征进行处理,得到预测触觉结果,包括:调用所述接触力估计模型对所述标记图案的样本差异特征进行处理,得到预测三维信息;所述对所述预测触觉结果和所述样本触觉结果进行误差计算,得到误差损失,包括:对所述预测三维信息和所述样本三维信息进行误差计算,得到第二误差损失;所述通过误差反向传播算法根据所述误差损失对所述前馈神经网络进行训练,得到训练后的前馈神经网络,包括:通过莱文贝格-马夸特Levenberg–Marquardt反向传播算法根据所述第二误差损失对所述接触力估计模型进行训练,得到训练后的接触力估计模型。
- 根据权利要求14所述的方法,其特征在于,所述前馈神经网络包括用于分类接触表面的表面分类模型;所述获取训练样本,所述训练样本包括样本图像序列和样本触觉结果,包括:获取第三训练样本,所述第三训练样本包括第三样本图像序列和样本表面类型;所述调用前馈神经网络对所述标记图案的样本差异特征进行处理,得到预测触觉结果,包括:调用所述表面分类模型对所述标记图案的样本差异特征进行处理,得到预测表面类型;所述对所述预测触觉结果和所述样本触觉结果进行误差计算,得到误差损失,包括:对所述预测表面类型和所述样本表面类型进行误差计算,得到第三误差损失;所述通过误差反向传播算法根据所述误差损失对所述前馈神经网络进行训练,得到训练后的前馈神经网络,包括:通过缩放共轭梯度反向传播算法根据所述第三误差损失对所述表面分类模型进行训练,得到训练后的表面分类模型。
- 根据权利要求19所述的方法,其特征在于,所述前馈神经网络还包括曲率估计模型,所述曲率估计模型包括球面估计模型,所述方法还包括:获取第四训练样本,所述第四训练样本包括第四样本图像序列和样本球面的局部曲率半径;当所述预测表面类型是球面时,调用所述调用球面估计模型对所述标记图案的样本差异特征进行处理,得到预测曲率半径;对所述球面的预测曲率半径和所述样本球面的局部曲率半径进行误差计算,得到第四误差损失;及通过莱文贝格-马夸特Levenberg–Marquardt反向传播算法根据所述第四误差损失对所述球面估计模型进行训练,得到训练后的球面估计模型。
- 根据权利要求19所述的方法,其特征在于,所述前馈神经网络还包括曲率估计模型,所述曲率估计模型包括圆柱面估计模型,所述方法还包括:获取第五训练样本,所述第五训练样本包括第五样本图像序列和样本圆 柱面的局部曲率半径;当所述预测表面类型是圆柱面时,调用所述圆柱面估计模型对所述标记图案的差异特征进行处理,得到预测曲率半径;对所述圆柱的预测曲率半径和所述样本圆柱面的局部曲率半径进行误差计算,得到第五误差损失;及通过莱文贝格-马夸特Levenberg–Marquardt反向传播算法根据所述第五误差损失对所述圆柱面估计模型进行训练,得到训练后的圆柱面估计模型。
- 根据权利要求14至21任一所述的方法,其特征在于,所述标记图案包括至少两个标记点,所述标记图案的样本差异特征包括所述标记点的位移和形变中的至少一种。
- 根据权利要求14至21任一项所述的方法,其特征在于,所述标记图案包括网格,所述标记图案的样本差异特征包括:所述网格中网格点的位移和网格线的形变中的至少一种。
- 一种基于视觉的触觉测量装置,其特征在于,所述装置应用于芯片中,所述芯片与触觉传感器相连,所述触觉传感器包括传感面和图像传感组件,所述传感面设置有标记图案;所述装置包括:第一获取模块,用于获取所述图像传感组件对所述传感面采集的图像序列,所述图像序列的图像中包括所述标记图案;第一计算模块,用于根据所述图像序列中的相邻图像内的所述标记图案,计算所述标记图案的差异特征;及前馈神经网络,用于对所述标记图案的差异特征进行处理,得到触觉测量结果;其中,所述前馈神经网络内的隐藏层数量小于阈值。
- 一种前馈神经网络的训练装置,其特征在于,所述装置包括:第二获取模块,用于获取训练样本,所述训练样本包括样本图像序列和 样本触觉结果,所述样本图像序列是触觉传感器内的图像传感组件采集的图像序列,所述触觉传感器包括传感面和图像传感组件,所述传感面设置有标记图案,所述图像序列的图像中包括所述标记图案;第二计算模块,用于根据所述样本图像序列中的相邻图像内的所述标记图案,计算所述标记图案的样本差异特征;前馈神经网络模型,用于对所述标记图案的样本差异特征进行处理,得到预测触觉结果;所述前馈神经网络中的隐藏层数量少于阈值;误差计算模块,用于对所述预测触觉结果和所述样本触觉结果进行误差计算,得到误差损失;及训练模块,用于通过误差反向传播算法根据所述误差损失对所述前馈神经网络进行训练,得到训练后的前馈神经网络。
- 一种芯片,其特征在于,所述芯片包括可编程逻辑电路和程序指令中的至少一种,当所述芯片运行时所述芯片用于执行如权利要求1至13任一所述的基于视觉的触觉测量方法。
- 一种触觉传感器系统,其特征在于,所述系统包括:触觉传感器和芯片,所述触觉传感器包括柔性传感面和图像传感组件,所述传感面设置有标记图案,所述图像传感组件与所述芯片相连;所述芯片包括可编程逻辑电路和程序指令中的至少一种,当所述芯片运行时所述芯片用于执行如权利要求1至13任一所述的基于视觉的触觉测量方法。
- 一种计算机设备,包括存储器和处理器,所述存储器存储有计算机可读指令,其特征在于,所述处理器执行所述计算机可读指令时实现权利要求14至23中任一项所述的前馈神经网络的训练方法的步骤。
- 一种计算机可读存储介质,存储有计算机可读指令,其特征在于,所述计算机可读指令被处理器执行时实现权利要求1至13中任一项所述的基于视觉的触觉测量方法的步骤。
- 一种计算机可读存储介质,存储有计算机可读指令,其特征在于,所述计算机可读指令被处理器执行时实现权利要求14至23中任一项所述的前馈神经网络的训练方法的步骤。
- 一种机器人系统,其特征在于,所述机器人系统包括:芯片和触觉传感器,所述触觉传感器设置在指尖部位和皮肤部位中的至少一个部位,所述触觉传感器包括柔性传感面和朝向所述柔性传感面的内表面设置的图像传感组件,所述柔性传感面内设置有标记图案,所述图像传感组件与所述芯片相连;所述芯片包括可编程逻辑电路和程序指令中的至少一种,当所述芯片运行时所述芯片用于执行权利要求1至14任一所述的基于视觉的触觉测量方法。
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP20804776.1A EP3971684B1 (en) | 2019-05-16 | 2020-04-20 | Vision-based tactile measurement method, apparatus, chip and storage medium |
| US17/369,837 US12214487B2 (en) | 2019-05-16 | 2021-07-07 | Vision-based tactile measurement method and apparatus, chip, and storage medium |
| US19/003,999 US20260115934A1 (en) | 2019-05-16 | 2024-12-27 | Vision-based tactile measurement method and apparatus, chip, and storage medium |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910411693.6 | 2019-05-16 | ||
| CN201910411693.6A CN110162175B (zh) | 2019-05-16 | 2019-05-16 | 基于视觉的触觉测量方法、装置、设备及存储介质 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/369,837 Continuation US12214487B2 (en) | 2019-05-16 | 2021-07-07 | Vision-based tactile measurement method and apparatus, chip, and storage medium |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2020228489A1 true WO2020228489A1 (zh) | 2020-11-19 |
Family
ID=67631149
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2020/085608 Ceased WO2020228489A1 (zh) | 2019-05-16 | 2020-04-20 | 基于视觉的触觉测量方法、装置、芯片及存储介质 |
Country Status (4)
| Country | Link |
|---|---|
| US (2) | US12214487B2 (zh) |
| EP (1) | EP3971684B1 (zh) |
| CN (2) | CN110162175B (zh) |
| WO (1) | WO2020228489A1 (zh) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112949197A (zh) * | 2021-03-08 | 2021-06-11 | 北京理工大学 | 一种基于深度学习的平凸透镜曲率半径在线测量方法 |
| WO2022148542A1 (en) * | 2021-01-08 | 2022-07-14 | MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. | Method for force inference, method for training a feed-forward neural network, force inference module, and sensor arrangement |
| WO2022191910A1 (en) * | 2021-03-08 | 2022-09-15 | Massachusetts Institute Of Technology | Enhanced depth estimation using deep learning |
| CN121068067A (zh) * | 2025-11-06 | 2025-12-05 | 哈尔滨工业大学 | 视触觉传感器 |
| US12620107B2 (en) | 2022-01-06 | 2026-05-05 | Massachusetts Institute Of Technology | Enhanced depth estimation using deep learning |
Families Citing this family (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110162175B (zh) * | 2019-05-16 | 2022-04-19 | 腾讯科技(深圳)有限公司 | 基于视觉的触觉测量方法、装置、设备及存储介质 |
| CN110722591B (zh) * | 2019-10-12 | 2021-06-22 | 哈尔滨工业大学(深圳) | 柔性体的力感知方法、夹爪及机械手 |
| CN110737339B (zh) * | 2019-10-28 | 2021-11-02 | 福州大学 | 基于深度学习的视觉—触觉交互模型构建方法 |
| CN112060085B (zh) * | 2020-08-24 | 2021-10-08 | 清华大学 | 一种基于视触多尺度定位的机器人操作位姿控制方法 |
| CN112652060A (zh) * | 2021-01-06 | 2021-04-13 | 上海交通大学 | 一种基于粒子图像测速法的多模态视触觉传感系统及方法 |
| WO2022148541A1 (en) * | 2021-01-08 | 2022-07-14 | MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. | Sensor arrangement for sensing forces and methods for fabricating a sensor arrangement and parts thereof |
| CN113670928B (zh) * | 2021-06-29 | 2023-03-28 | 华中农业大学 | 一种基于视触觉的无损瓜果损伤识别与硬度检测装置 |
| CN114179082B (zh) * | 2021-12-07 | 2024-09-13 | 南京工程学院 | 一种基于接触力信息的图像力触觉检测装置及再现方法 |
| CN114544052B (zh) * | 2022-01-19 | 2023-03-28 | 清华大学 | 触觉传感器、机器人、实现触觉信息获取的方法及装置 |
| CN114407084B (zh) * | 2022-01-21 | 2025-03-28 | 广州市香港科大霍英东研究院 | 基于机器视觉的机器人关节形变传感器、机械臂及方法 |
| CN116803625B (zh) * | 2022-03-17 | 2024-11-19 | 腾讯科技(深圳)有限公司 | 机器人控制方法、装置、设备和存储介质 |
| CN114732396B (zh) * | 2022-04-02 | 2026-02-10 | 冯军峰 | 硬膜外和硬膜下血肿监测系统、计算机可存储介质及装置 |
| CN114659678B (zh) * | 2022-04-12 | 2024-11-12 | 深圳市松果体机器人科技有限公司 | 面形柔性触觉传感器 |
| CN114769021B (zh) * | 2022-04-24 | 2022-11-25 | 广东天太机器人有限公司 | 一种基于全角度模板识别的机器人喷涂系统及方法 |
| CN115018700B (zh) * | 2022-04-26 | 2025-07-01 | 大连理工大学 | 一种触觉图像超分辨率重建方法及采集系统 |
| CN115824470B (zh) * | 2022-10-25 | 2025-05-02 | 中国科学院自动化研究所 | 一种触觉传感器、制备方法和点云重建方法 |
| CN115719385B (zh) * | 2022-11-14 | 2025-10-31 | 北京通用人工智能研究院 | 基于视觉的触觉传感器标定方法及装置 |
| CN116399500B (zh) * | 2023-03-29 | 2026-04-21 | 杜承金 | 一种接触力检测方法及磁触觉传感器 |
| CN116337013B (zh) * | 2023-03-29 | 2026-01-30 | 北京信息科技大学 | 一种基于神经网络的软体操作臂的位姿测量方法 |
| US12481403B2 (en) * | 2023-04-21 | 2025-11-25 | Samsung Electronics Co., Ltd. | Continuous tactile feedback for electronic devices |
| CN116596022B (zh) * | 2023-05-18 | 2025-12-09 | 浙江大学 | 一种秀丽隐杆线虫仿生多感知-运动行为方法及系统 |
| CN118809592B (zh) * | 2024-07-09 | 2025-09-23 | 深圳市优必选科技股份有限公司 | 机器人控制方法、装置、电子设备及存储介质 |
| CN119473019A (zh) * | 2025-01-14 | 2025-02-18 | 华东交通大学 | 基于改进图卷积神经网络的穿戴设备触觉感知方法及系统 |
| CN119880231B (zh) * | 2025-03-28 | 2025-06-06 | 湖南大学 | 一种集力觉与视触觉的多功能传感器及其测量方法 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110029470A1 (en) * | 2009-07-30 | 2011-02-03 | Northwestern University | Systems, methods, and apparatus for reconstruction of 3-d object morphology, position, orientation and texture using an array of tactile sensors |
| CN106204620A (zh) * | 2016-07-21 | 2016-12-07 | 清华大学 | 一种基于微视觉的触觉三维力检测方法 |
| CN109716361A (zh) * | 2016-09-08 | 2019-05-03 | 谷歌有限责任公司 | 执行触摸运动预测的深度机器学习 |
| CN110162175A (zh) * | 2019-05-16 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 基于视觉的触觉测量方法、装置、设备及存储介质 |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1057246C (zh) * | 1998-07-01 | 2000-10-11 | 中国科学院合肥智能机械研究所 | 一种多功能类皮肤型传感器 |
| CN1280069C (zh) * | 2003-11-01 | 2006-10-18 | 中国科学院合肥智能机械研究所 | 一种柔性触觉传感器及触觉信息检测方法 |
| EP3387516B1 (en) * | 2015-12-09 | 2022-04-20 | FlatFrog Laboratories AB | Improved stylus identification |
| RU2665273C2 (ru) * | 2016-06-03 | 2018-08-28 | Автономная некоммерческая образовательная организация высшего образования "Сколковский институт науки и технологий" | Обучаемые визуальные маркеры и способ их продуцирования |
| EP3267288A1 (en) | 2016-07-08 | 2018-01-10 | Thomson Licensing | Method, apparatus and system for rendering haptic effects |
| CN106092382B (zh) * | 2016-07-20 | 2018-09-11 | 山东大学 | 一种基于弹性体三维形变的触觉传感器及检测方法 |
| CN106650918B (zh) * | 2016-11-25 | 2019-08-30 | 东软集团股份有限公司 | 构建系统模型的方法和装置 |
| CN108229548A (zh) * | 2017-12-27 | 2018-06-29 | 华为技术有限公司 | 一种物体检测方法及装置 |
| CN108692644B (zh) * | 2018-03-26 | 2019-09-27 | 华中科技大学 | 一种复杂曲面三坐标测量装置及误差补偿方法 |
| WO2020222027A1 (en) * | 2019-04-29 | 2020-11-05 | Ecole Polytechnique Federale De Lausanne (Epfl) | Dynamic media content categorisation method |
-
2019
- 2019-05-16 CN CN201910411693.6A patent/CN110162175B/zh active Active
- 2019-05-16 CN CN202210431476.5A patent/CN114661169B/zh active Active
-
2020
- 2020-04-20 EP EP20804776.1A patent/EP3971684B1/en active Active
- 2020-04-20 WO PCT/CN2020/085608 patent/WO2020228489A1/zh not_active Ceased
-
2021
- 2021-07-07 US US17/369,837 patent/US12214487B2/en active Active
-
2024
- 2024-12-27 US US19/003,999 patent/US20260115934A1/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110029470A1 (en) * | 2009-07-30 | 2011-02-03 | Northwestern University | Systems, methods, and apparatus for reconstruction of 3-d object morphology, position, orientation and texture using an array of tactile sensors |
| CN106204620A (zh) * | 2016-07-21 | 2016-12-07 | 清华大学 | 一种基于微视觉的触觉三维力检测方法 |
| CN109716361A (zh) * | 2016-09-08 | 2019-05-03 | 谷歌有限责任公司 | 执行触摸运动预测的深度机器学习 |
| CN110162175A (zh) * | 2019-05-16 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 基于视觉的触觉测量方法、装置、设备及存储介质 |
Non-Patent Citations (3)
| Title |
|---|
| FANG, BIN ET AL.: "A Dual-Modal Vision-Based Tactile Sensor for Robotic Hand Grasping", 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 21 May 2018 (2018-05-21), pages 1 - 9, XP033403444, ISSN: 2577-087X * |
| See also references of EP3971684A4 |
| WANG,FE ILU ET AL.: "Application of RBF neural network in decoupling research of flexible tactile sensor", COMPUTER ENGINEERING AND APPLICATIONS, vol. 51, no. 10, 31 May 2015 (2015-05-31), pages 20 - 25, XP055753127, ISSN: 1002-8331 * |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2022148542A1 (en) * | 2021-01-08 | 2022-07-14 | MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. | Method for force inference, method for training a feed-forward neural network, force inference module, and sensor arrangement |
| CN112949197A (zh) * | 2021-03-08 | 2021-06-11 | 北京理工大学 | 一种基于深度学习的平凸透镜曲率半径在线测量方法 |
| WO2022191910A1 (en) * | 2021-03-08 | 2022-09-15 | Massachusetts Institute Of Technology | Enhanced depth estimation using deep learning |
| US12620107B2 (en) | 2022-01-06 | 2026-05-05 | Massachusetts Institute Of Technology | Enhanced depth estimation using deep learning |
| CN121068067A (zh) * | 2025-11-06 | 2025-12-05 | 哈尔滨工业大学 | 视触觉传感器 |
| CN121068067B (zh) * | 2025-11-06 | 2026-02-10 | 哈尔滨工业大学 | 视触觉传感器 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP3971684B1 (en) | 2026-02-18 |
| CN110162175B (zh) | 2022-04-19 |
| CN110162175A (zh) | 2019-08-23 |
| US20260115934A1 (en) | 2026-04-30 |
| CN114661169A (zh) | 2022-06-24 |
| EP3971684A1 (en) | 2022-03-23 |
| US12214487B2 (en) | 2025-02-04 |
| EP3971684A4 (en) | 2022-06-29 |
| CN114661169B (zh) | 2024-08-13 |
| US20210334584A1 (en) | 2021-10-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110162175B (zh) | 基于视觉的触觉测量方法、装置、设备及存储介质 | |
| US11565407B2 (en) | Learning device, learning method, learning model, detection device and grasping system | |
| CN108764048B (zh) | 人脸关键点检测方法及装置 | |
| EP3853764A1 (en) | Training neural networks for vehicle re-identification | |
| Hoang et al. | Context-aware grasp generation in cluttered scenes | |
| JP7051751B2 (ja) | 学習装置、学習方法、学習モデル、検出装置及び把持システム | |
| JP7349423B2 (ja) | 学習装置、学習方法、学習モデル、検出装置及び把持システム | |
| CN110335314A (zh) | 基于目标物体的二维图像预测目标物体实时位姿的方法 | |
| US12524087B2 (en) | Pen state detection circuit, system, and method | |
| CN115082511A (zh) | 一种基于轻量级堆叠沙漏网络的机械臂姿态估计方法 | |
| CN114399628A (zh) | 复杂空间环境下的绝缘子高效检测系统 | |
| Wang et al. | Hierarchical viewpoint planning for complex surfaces in industrial product inspection | |
| Hu et al. | Active shape reconstruction using a novel visuotactile palm sensor | |
| Salah et al. | NeuTac: Zero-shot Sim2Real measurement for neuromorphic vision-based tactile sensors | |
| Yu et al. | Mtgrasp: Multiscale 6-DoF robotic grasp detection | |
| Liang et al. | AllTact fin ray: A compliant robot gripper with omni-directional tactile sensing | |
| CN115331021B (zh) | 基于多层特征自身差异融合的动态特征提取与描述方法 | |
| JP7254262B2 (ja) | 作業推定装置、作業推定方法、及び、作業推定プログラム | |
| Daqi et al. | An industrial intelligent grasping system based on convolutional neural network | |
| Chu et al. | ViTac-Gripper: A Vision-Based Tactile Gripper With Enhanced Multi-Physical Field Perception for Underwater Robots | |
| Tu et al. | Contact localization from soft tactile array sensor using tactile image | |
| Luo et al. | Efficacy of varying sensing features for enhanced performance of deep-learning-informed multidimensional force platform | |
| CN121936103A (zh) | 视触觉传感器的仿真方法及装置、存储介质、程序产品、计算设备 | |
| CN120706215A (zh) | 抓取参数模型训练方法、装置、电子设备和存储介质 | |
| KR20240059519A (ko) | 치수 측정 장치 및 방법 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20804776 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2020804776 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2020804776 Country of ref document: EP Effective date: 20211216 |
|
| WWG | Wipo information: grant in national office |
Ref document number: 2020804776 Country of ref document: EP |