WO2020087960A1 - 一种影像识别的方法、装置、终端设备和医疗系统 - Google Patents
一种影像识别的方法、装置、终端设备和医疗系统 Download PDFInfo
- Publication number
- WO2020087960A1 WO2020087960A1 PCT/CN2019/093602 CN2019093602W WO2020087960A1 WO 2020087960 A1 WO2020087960 A1 WO 2020087960A1 CN 2019093602 W CN2019093602 W CN 2019093602W WO 2020087960 A1 WO2020087960 A1 WO 2020087960A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- lesion
- medical image
- recognition
- degree
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/0002—Inspection of images, e.g. flaw detection
- G06T7/0012—Biomedical image inspection
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/11—Region-based segmentation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/50—Extraction of image or video features by performing operations within image blocks; by using histograms, e.g. histogram of oriented gradients [HoG]; by summing image-intensity values; Projection analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20021—Dividing image into blocks, subimages or windows
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
- G06T2207/30092—Stomach; Gastric
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
- G06T2207/30096—Tumor; Lesion
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/03—Recognition of patterns in medical or anatomical images
Definitions
- the present invention relates to the field of image processing technology, and in particular, to a method, device, terminal device, and medical system for image recognition.
- the terminal device can determine the probability of a patient's cancer based on the patient's esophagus image through computer technology. In this way, the doctor can perform further diagnostic analysis based on the output result, which improves the accuracy and efficiency of medical diagnosis.
- a conventional machine learning method eg, wavelet operator
- a machine learning method is generally used to classify the image based on pre-extracted feature information to obtain a disease classification result.
- Embodiments of the present invention provide an image recognition method, device, terminal device, and medical system, which are used to improve the efficiency and accuracy of recognition when performing disease recognition based on medical images.
- the medical image is recognized through a second recognition model to generate a lesion degree recognition result indicating the degree of the lesion.
- Acquisition unit used to acquire medical images to be identified
- the discriminating unit is used to discriminate the medical image through the first recognition model and generate a disease recognition result indicating whether the medical image includes a disease;
- the recognition unit is configured to recognize the medical image through the second recognition model when the lesion recognition result indicates that the medical image includes a lesion, and generate a lesion degree recognition result indicating the degree of the lesion.
- Obtain a recognition model to identify the degree of lesion output after identifying the medical image to be identified, and the degree of lesion recognition result includes: the multiple image blocks included in the medical image, the degree of recognition of the lesion degree of each image block and its medical image Region information of the region, and / or the disease degree indicator image after corresponding indication information is set in the corresponding region according to the recognition result of the disease degree of each image block;
- the recognition model to recognize medical images specifically includes: dividing the medical image into multiple image blocks, and for each image block, extracting the feature information of the image block and determining the degree of lesion recognition of the image block according to the extracted feature information result.
- the input unit is used to obtain the medical image to be recognized, and to recognize the medical image to be recognized through the recognition model;
- the obtaining unit is used to obtain a lesion degree recognition result output after the recognition model recognizes the medical image to be recognized, and the lesion degree recognition result includes: among multiple image blocks included in the medical image, the lesion degree recognition result of each image block and The area information in the medical image, and / or the pathological degree indication image after corresponding indication information is set in the corresponding area according to the identification result of the pathological degree of each image block;
- the recognition model to recognize medical images specifically includes: dividing the medical image into multiple image blocks, and for each image block, extracting the feature information of the image block and determining the degree of lesion recognition of the image block according to the extracted feature information result.
- the terminal device of each embodiment may include at least one processing unit and at least one storage unit, where the storage unit stores a computer program, and when the program is executed by the processing unit, causes the processing unit to perform any of the steps of the image recognition method described above .
- the medical system of each embodiment may include an image acquisition device and an image recognition device, wherein,
- Image collection device used to collect medical images of patients
- the image recognition device is used to obtain the medical image collected by the image collection device, and to discriminate the medical image through the first recognition model, to generate a disease recognition result indicating whether the medical image includes a disease, and when the disease recognition result When indicating that the medical image includes a lesion, identify the medical image through a second recognition model to generate a lesion degree recognition result indicating the degree of the lesion;
- the display device is used for presenting the recognition result of the lesion degree.
- Another medical system of each embodiment may include an image acquisition device and an image recognition device, where,
- Image collection device used to collect medical images of patients
- the image recognition device is used to obtain the medical images collected by the image collection device, and to recognize the medical images including the lesion through the second recognition model to generate the recognition result of the lesion degree; wherein, the second recognition model is used to recognize the medical image The extent of the lesion;
- the display device is used for presenting the recognition result of the lesion degree.
- the computer-readable storage medium of each embodiment stores a computer program, and when the program is executed by one or more processors, the one or more processors may be caused to perform any of the above steps of the image recognition method.
- whether the medical image is a medical image with a lesion is determined by the first recognition model, and then, the medical image with the lesion is detected by the second recognition model Further identification is performed to obtain a lesion degree recognition result to indicate the extent to which the medical image includes the lesion. This does not require manual analysis and custom feature extraction schemes, which improves the efficiency and accuracy of medical image recognition.
- FIG. 1 is a schematic structural diagram of a terminal device according to an embodiment of the present invention.
- 3a is a schematic diagram of the principle of a first recognition model in an embodiment of the present invention.
- FIG. 3b is an example diagram 1 of an esophagus image in an embodiment of the present invention.
- 3c is a schematic diagram of the principle of a second recognition model in an embodiment of the present invention.
- 3d is a schematic diagram of an image block in an embodiment of the present invention.
- 3e is an example diagram 2 of an esophageal image in an embodiment of the present invention.
- FIG. 3f is an example of an image indicating the degree of lesion of an esophageal image in an embodiment of the present invention
- 3g is a schematic structural diagram of a medical system according to an embodiment of the present invention.
- 4a is a schematic structural diagram 1 of an image recognition device according to an embodiment of the present invention.
- 4b is a second schematic structural diagram of an image recognition device according to an embodiment of the present invention.
- FIG. 5 is a schematic structural diagram of a terminal device in an embodiment of the present invention.
- embodiments of the present invention provide an image identification method, device, terminal device, and medical system.
- Terminal device A device that can install various types of applications and can display the entities provided in the installed applications.
- the electronic device can be mobile or fixed. For example, mobile phones, tablet computers, in-vehicle devices, personal digital assistants (PDAs) or other electronic devices that can achieve the above functions.
- PDAs personal digital assistants
- Medical imaging It is the material reproduction of human visual perception, which can be obtained by optical equipment, such as cameras, mirrors, telescopes, and microscopes; it can also be created manually, such as hand-painted images. Pathology can be recorded and stored on paper media, film, and other media sensitive to light signals. With the development of digital acquisition technology and signal processing theory, more and more medical images are stored in digital form.
- CNN Convolutional Neural Network
- Dense convolutional network Each layer is connected to any other layer in a feed-forward form, that is, any layer is not only connected to the adjacent layer, but also to all subsequent layers All have direct connections.
- the first recognition model a model obtained by training DenseNet training on normal image samples and lesion image samples, which is used to determine whether the medical image has a lesion.
- whether or not a medical image has a lesion also referred to as whether the medical image includes a lesion, refers to whether the medical image shows that a disease has occurred in the organ, that is, whether the medical image includes the image content corresponding to the organ lesion.
- Second recognition model a model obtained by training a medical image sample with various degrees of lesions using a convolutional neural network, which is used to identify the degree of lesions in medical images.
- Lesion recognition result It is used to indicate whether the input medical image is a medical image with a lesion.
- Pathological degree recognition result used to indicate the degree of pathological changes in medical images.
- Esophageal cancer is a malignant tumor in the upper digestive tract.
- Patients with esophageal cancer at different times have very different treatment procedures. Early patients are mainly treated by endoscopic minimally invasive treatment. They can be discharged within 3 to 5 days after surgery. The cost of treatment is low and there are few complications. 90% can be cured.
- patients with advanced esophageal cancer are mainly treated by thoracotomy / abdominal / neck "three incision" surgery. This treatment is invasive, expensive, and unsatisfactory. The cure rate is less than 40%.
- the normal esophageal image is generally smooth on the surface mucosa, and the diseased esophageal image has obvious features, such as bumps and erosion.
- the difference between normal and diseased esophageal images may be a small area in the image ( For example, the color depth and the roughness of the skin) may also be due to the change in the overall smoothness of the image.
- This application uses artificial intelligence technology to identify lesions from medical images and identify the extent of the lesions.
- AI systems refer to computer systems that exhibit intelligent behavior.
- the functions of the AI system include learning, maintaining a large number of knowledge bases, performing reasoning, applying analytical capabilities, discerning the relationship between facts, exchanging ideas with others, understanding the communication of others, and perceiving and understanding the situation, etc.
- AI systems can make machines progress through their own learning and judgment.
- AI systems create new knowledge by looking for previously unknown patterns in data, and drive solutions by learning data patterns.
- the recognition rate of the AI system can be improved, and the user's taste can be more accurately understood. Therefore, the existing rule-based intelligent systems are gradually replaced by AI systems.
- neural networks are usually used. Neural networks are computer systems designed, constructed, and configured to simulate the human nervous system.
- the neural network architecture consists of an input layer, an output layer, and one or more hidden layers.
- the input layer inputs data into the neural network.
- the output layer produces guess results.
- the hidden layer assists in information dissemination.
- a neural network or artificial neural network is based on a collection of connected units called neurons or artificial neurons. Each connection (synapse) between neurons can transmit signals to another neuron.
- the receiving (post-synaptic) neuron can process the signal and then signal the downstream neuron connected to the neuron.
- Neurons can have states, usually represented by real numbers, usually between 0 and 1.
- Neurons and synapses can also have weights that change as the learning progresses, and the weights are used to increase or decrease the strength of the signal they send downstream.
- the neuron may have a threshold so that the downstream signal is only sent when the aggregated signal is below (or above) this level.
- neurons are organized in layers. Different layers can perform different types of conversions on their inputs. The signal may move from the first (input) layer to the last (output) layer after traversing the layers multiple times.
- the initial layer can detect primitives (eg, pupils, irises, eyelashes, etc. in the eye), and its output is fed forward to deeper layers that perform more abstract generalizations (eg, eyes, mouth). And so on, until the last layer performs complex object recognition (for example, face).
- Neural networks are trained using data such as a series of data points. The neural network guesses which response should be given, and compares the guess with the correct "best" guess for each data point. If an error occurs, the neuron is adjusted and the process is repeated.
- Some embodiments of the present application may use convolutional neural networks in neural networks to process and recognize medical images.
- Convolutional neural networks also known as ConvNets or CNN
- CNN convolutional neural networks
- CNNs convolutional neural networks
- CNNs convolutional neural networks
- DenseNet-121 is used to train normal image samples and lesion image samples to obtain a first recognition model, so as to determine whether a medical image has a lesion.
- a convolutional neural network Train the medical image with the lesion to obtain the second recognition model, and determine the canceration recognition result of the medical image based on the lesion degree of the image block with the most severe lesion in the medical image.
- the above description takes esophageal cancer as an example, but the diagnosis of other cancers also has similar problems.
- the method provided in the embodiments of the present application to perform preliminary lesion discrimination on medical images through the first recognition model, and to further identify the degree of lesions on the medical images where the lesions occur through the second recognition model to determine whether the medical images include
- the degree of the lesion greatly improves the efficiency and accuracy of cancer identification.
- the second recognition model can also be used to directly recognize the medical image to determine the degree of canceration of the medical image.
- An image recognition method provided by an embodiment of the present invention may be applied to a terminal device, and the terminal device may be a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), and so on.
- the terminal device may be a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), and so on.
- FIG. 1 shows a schematic structural diagram of a terminal device.
- the terminal device 100 includes a processor 110, a memory 120, a power supply 130, a display unit 140, and an input unit 150.
- the processor 110 is the control center of the terminal device 100, connects various components using various interfaces and lines, and executes various functions of the terminal device 100 by running or executing software programs and / or data stored in the memory 120, thereby The equipment is monitored overall.
- the processor 110 may include one or more processing units.
- the memory 120 may mainly include a storage program area and a storage data area.
- the storage program area may store an operating system, various application programs, etc .
- the storage data area may store data created according to the use of the terminal device 100 and the like.
- the memory 120 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device.
- FIG. 1 is only an example of a terminal device, and does not constitute a limitation on the terminal device, and may include more or fewer components than those illustrated, or combine some components, or different components.
- FIG. 2 it is an implementation flowchart of an image recognition method provided by the present invention.
- the specific implementation process of the method is as follows:
- Step 200 The terminal device obtains the medical image to be recognized.
- Medical imaging is the physical reproduction of human visual perception. Medical images can be acquired by optical devices, such as cameras, mirrors, telescopes, and microscopes; they can also be created manually, such as painting images by hand. Medical images can be recorded and stored on paper media, film, and other media sensitive to optical signals. With the development of digital acquisition technology and signal processing theory, more and more medical images are stored in digital form.
- the medical image may be a taken medical picture, for example, the internal body acquired through an endoscope (neuroscopy, urethral cystoscopy, resectoscope, laparoscope, arthroscopy, sinusoscope, and laryngoscope, etc.) image.
- endoscope neuroscopy, urethral cystoscopy, resectoscope, laparoscope, arthroscopy, sinusoscope, and laryngoscope, etc.
- the technical solution provided by the present invention can be applied to the recognition of various images.
- the recognition of the esophageal image is only used as an example for description, and the recognition of other images will not be repeated here.
- Step 201 The terminal device inputs the medical image into the first recognition model to obtain a lesion recognition result.
- the terminal device normalizes the medical image to be recognized to a specified size and inputs a pre-trained first recognition model to obtain a lesion recognition result indicating whether the input medical image is a medical image where a lesion has occurred.
- the designated size of the medical image can be normalized to 224 * 224 pixels.
- the lesion identification result may include a lesion label, and / or a lesion probability.
- the lesion label may include a normal image label and a medical image label.
- the first recognition model when the medical image is discriminated by the first recognition model to generate a lesion recognition result indicating whether the medical image includes a lesion, the first recognition model can use a trained deep learning network to Searching for a lesion feature in the medical image, and generating the lesion recognition result according to the search result.
- the deep learning network may be a neural network with multiple layers of perceptrons, and may include, but is not limited to, convolutional neural networks, recurrent neural networks, deep belief networks, and so on.
- the lesion feature is that the deep learning network learns from the first medical image set of the labeled normal organ and the second medical image set of the organ with the lesion during training, and exists in the second medical image set and Image features that do not exist in the first set of medical images.
- a constraint condition can be set in the objective function when training the deep learning network. For example, the feature response of the learned lesion features in the medical image of normal organs is less than the first The threshold (the first threshold may be a value close to 0), and the characteristic response in the medical image of the diseased organ is distributed in the dimension of the diseased area.
- the first recognition model is obtained by using DenseNet to train normal image samples and lesion image samples in advance.
- the terminal device obtains the first recognition error according to the average value of the cross entropy of the lesion recognition result of each medical image and the sum of the corresponding constraint expression values, and the regular value, and optimizes the first recognition model according to the first recognition error.
- the regular value is obtained through the L2 regular function.
- the first recognition error is negatively correlated with the above average value and positively correlated with the above regular value.
- the constrained expression value of the medical image is effective when the result of manual judgment of the medical image is normal.
- the cross entropy is obtained through the cross entropy expression.
- Cross entropy is positively correlated with the actual analysis results of medical images and the logarithm of the lesion recognition results.
- the constraint expression value is obtained through the constraint expression.
- the constraint expression value is positively correlated with the square of the norm of the feature vector extracted from the medical image, and negatively correlated with the actual analysis result of the medical image.
- the constraint expression value is non-zero to take effect.
- the constraint expression value of 0 does not take effect.
- the regular value is obtained through the L2 regular function.
- the regular value is positively related to the square of the norm of the model parameter of the first recognition model.
- the following objective function when the terminal device determines the first recognition error, the following objective function may be used:
- i is the serial number of the medical image (or medical image sample)
- n is the total number of medical images (or medical image samples)
- y is the actual analysis result of the medical image
- x is the medical image
- w is the number of the first recognition model Model parameters
- f represents the first recognition model
- p is the feature vector extracted from the medical image using the first recognition model
- r and ⁇ are the set weight values.
- y i log f (x i ; w) is the cross-entropy expression, Is a constraint expression, It is a L2 regular function.
- the constraint expression value when the actual analysis result is a lesion, the constraint expression value is 0, that is, it does not take effect.
- the constraint expression value is The constrained expression is used to make the first medical recognition model's lesion recognition result for normal medical images tend to 0, and conversely, the lesion recognition result for the medical image of the lesion is distributed in the dimension of each lesion area.
- the first recognition model when training the first recognition model, can be adjusted by the first recognition error obtained by the target function; when the first recognition model is used to recognize and apply medical images, it can be obtained by the target function The first recognition error corrects the first recognition model and determines the accuracy of the recognition.
- DenseNet-121 in DenseNet is used to train normal image samples and lesion image samples to obtain a first recognition model, and the first recognition model is further optimized through the first recognition error.
- the normal image sample includes at least: normal image and normal image label.
- the medical image samples include at least medical images and medical image tags.
- DenseNet-121 uses the configuration parameters shown in Table 1 for configuration.
- the DenseNet-121 network structure contains 4 dense blocks (dense Block), growth rate (growth-rate) is set to 12, 3 transition layers (transition layer), the characteristic compression ratio of transition layer is set to 0.5, and finally through the classification layer ( Classification (Layer) outputs the lesion recognition result.
- FIG. 3a it is a schematic diagram of a principle of a first recognition model.
- the terminal device inputs the medical image to the first recognition model, and passes through the convolutional layer, dense layer-1, convolutional layer, pooling layer, dense layer-N, convolutional layer, and linear layer in order to output lesion recognition result.
- FIG. 1 is an example of an esophageal image.
- the terminal device inputs the esophageal image (a) and the esophageal image (b) in FIG. 3b to the first recognition model, and outputs the esophageal image (a). It is a normal medical image, and the esophageal image (b) is the esophageal image of the lesion.
- the first recognition model can be used to classify the medical images according to the extracted features of each small area of the medical image and the high-level semantic features of the overall medical image to obtain the lesion recognition result.
- the medical images can be preliminarily screened through the first recognition model to screen out the medical images with lesions, so that the medical images with lesions can be directly recognized through the subsequent second recognition model, which improves the processing efficiency.
- Step 202 The terminal device determines whether the lesion recognition result is a medical image where the lesion has occurred, and if so, step 203 is performed, otherwise, step 204 is performed.
- Step 203 The terminal device inputs the medical image to the second recognition model to obtain the recognition result of the lesion degree.
- the terminal device recognizing the medical image through the second recognition model specifically includes: dividing the medical image whose lesion recognition result is the lesion into a plurality of image blocks, and separately extracting the feature information of each image block, and determining each according to the extracted feature information The recognition result of the lesion degree of the image block.
- the medical image when the lesion recognition result indicates that the medical image includes a lesion, the medical image can be recognized by the second recognition model to generate a lesion degree recognition result indicating the degree of the lesion.
- the second recognition model may use the trained second deep learning network to search the medical image for the lesion degree feature corresponding to the first lesion degree, and generate the lesion degree recognition result according to the search result.
- the second deep learning network may be a neural network with multiple layers of perceptrons, and may include, but is not limited to, a convolutional neural network, a recurrent neural network, a deep belief network, and so on.
- the lesion degree feature is that the second deep learning network is trained from a set of labeled third medical images without organs with the first degree of disease and a fourth set of medical images with organs with the first degree of disease during training
- the image features learned in which exist in the fourth medical image set and do not exist in the third medical image set.
- the learned lesion features of the first lesion degree are higher than those of the first lesion
- the characteristic response in the medical image of the diseased organ with a low degree is less than the first threshold (the first threshold may be a value close to 0), while the characteristic response in the medical image of the diseased organ with the first degree of disease is distributed in the lesion area Dimensionally.
- the second recognition model is obtained based on convolutional neural network training, and in the process of training and recognition application, the second recognition error is used to optimize the second recognition model.
- the lesion degree recognition result is used to indicate the degree to which the medical image includes the lesion.
- the recognition result of the lesion degree includes: the recognition result of the lesion degree of each image block and the area information in the medical image, and / or the lesion degree indication image after the corresponding indication information is set in the corresponding area according to the recognition result of each image block .
- the lesion degree recognition result also includes the lesion degree label of the medical image.
- the lesion degree label is: the first recognition result of the image block with the most serious lesion among the multiple image blocks segmented from the medical image; or, based on the feature information of all image blocks
- the second recognition result of the determined degree of lesion of the medical image or a comprehensive result determined according to the first recognition result and the second recognition result.
- the disease severity label can be inflammation, early cancer, intermediate cancer, and advanced cancer, and can also be the probability of canceration.
- the area information may be information such as coordinates or location names of image blocks.
- the degree of lesion indication indicates that each area in the image can be set with different colors or patterns for indication according to different recognition results.
- the terminal device may adopt the following methods when determining the recognition result of the lesion degree of each image block through the second recognition model:
- the first method is: first, according to the feature information of each image block, determine the canceration probability of each image block, and obtain the association relationship between the canceration probability range and the lesion degree label; then, according to the association relationship, determine The lesion degree label corresponding to the canceration probability range to which the highest canceration probability belongs, and the obtained lesion degree label is determined as the first recognition result; then, according to the canceration probability of each image block, the corresponding indication information is set in the corresponding area to obtain the lesion The degree indication image; finally, the canceration probability and area information of each image block, the lesion degree indication image and the first recognition result are determined as the disease degree recognition result of the medical image.
- FIG. 3c is a schematic diagram of the principle of a second recognition model.
- Figure 3d is a schematic diagram of an image block.
- the terminal device inputs the medical image to the second recognition model, divides the medical image into a plurality of image blocks shown in FIG. 3d through the second recognition model, and separately extracts the feature information of each image block, and According to the extracted feature information, estimate the canceration probability of each image block separately; then, the terminal device sorts the canceration probability of each image block, and tags the lesion degree label of the image block with the highest canceration probability as the lesion of the medical image Degree label, and then obtain the canceration degree recognition result.
- the second method is: according to the feature information of each image block, determine the canceration probability of the medical image, and obtain the association between the canceration probability range and the lesion degree label; then, according to the association, determine the canceration to which the canceration probability belongs The lesion degree label corresponding to the probability range, and the obtained lesion degree label is determined as the second recognition result; then, according to the canceration probability of each image block, corresponding indication information is set in the corresponding region to obtain the lesion degree indication image; finally, the The canceration probability and area information of each image block, the lesion degree indication image and the second recognition result are determined as the recognition result of the disease degree of the medical image.
- the disease degree label can be determined through the feature information of the overall medical image.
- the third method is to obtain the first recognition result and the second recognition result through the first method and the second method described above, respectively, and determine the recognition result with the most serious lesion degree among the first recognition result and the second recognition result as The comprehensive result, and the canceration probability and area information of each image block, the lesion degree indication image, and the comprehensive result are determined as the lesion degree recognition result of the medical image.
- FIG. 3e is an example image 2 of an esophageal image.
- (c) in FIG. 3e is an example cancer image of an esophageal image
- (d) is an example inflammation image of an esophageal image.
- the terminal device inputs the medical images of (c) and (d) in FIG. 3e to the second recognition model, and the recognition result of the degree of lesion corresponding to (c) in FIG. 3e is a medical image with cancer, and the corresponding image in (d) in FIG. 3e
- the result of recognition of the degree of lesion is a medical image of inflammation.
- FIG. 3f is an example of an indication image of the degree of lesion of an esophageal image.
- the terminal device inputs the left image of (e), the left image of (f), the left image of (g), and the left image of (h) of FIG. 3f into the second recognition model, and the output recognition results of the lesion degree are:
- the right picture of (e) of FIG. 3f is the lesion degree indication image of the left picture of FIG. 3f (e); the right picture of (f) of FIG. 3f is the lesion degree indication image of the left picture of (f) of FIG. 3f;
- the right picture of (g) of FIG. 3f is the lesion degree indication image of the left picture of (g) of FIG. 3f;
- the right picture of (h) of FIG. 3f is the disease degree indication image of the left picture of (h) of FIG. 3f;
- the user can determine the lesion degree of the medical image through the lesion degree label, and instruct the image according to the lesion degree to determine the judgment basis of the lesion degree.
- the second recognition model is obtained based on convolutional neural network training, and the second recognition error is used to optimize the second recognition model.
- the second recognition error is obtained by using a specified loss function, and the cross entropy in the loss function is determined based on the recognition result of the degree of lesion of the image block with the highest degree of lesion in each image block of the medical image.
- the loss function may be a maximum pooling loss function (max pooling loss), a label allocation loss function (labed assignment loss), and a sparse loss function (sparsity loss).
- the network structure shown in Table 2 contains two branches.
- the branch pointed by the right arrow is the configuration parameter for medical image recognition using the first method above
- the branch pointed by the left arrow is the configuration for medical image recognition by the second method. parameter.
- the common layer outputs the feature information extracted for each image block
- the branch pointed by the arrow on the right, through 1 * 1 convolution obtains the canceration probability of each image block according to the feature information of each image block output by the common layer, and then based on For each canceration probability, a lesion degree recognition result is obtained.
- the branch pointed by the arrow on the left, through the overall image convolution obtains the canceration probability of the medical image based on the feature information of each image block output by the common layer, and then obtains the lesion degree recognition result.
- the comprehensive result may be determined according to the first recognition result output by the branch pointed by the left arrow and the second recognition result output by the branch pointed by the right arrow.
- the model parameters of the common layer are updated according to the second recognition error determined by the synthesis result, and the branches pointed by the left and right arrows are based on the corresponding recognition
- the identification error determined by the results is optimized for the model parameters.
- Step 204 The terminal device outputs a lesion recognition result indicating that the medical image is normal.
- step 202 is used to make a preliminary judgment on the medical image to determine whether the medical image has a lesion.
- step 201 and step 202 may not be executed, that is, the canceration recognition of the medical image is directly performed through the subsequent step 203.
- the specific implementation process of image recognition can also be:
- the terminal device obtains the medical image to be recognized.
- the terminal device inputs the medical image to the second recognition model to obtain the recognition result of the lesion degree.
- the terminal device inputs the medical image to the second recognition model to obtain the recognition result of the lesion degree.
- the medical system 300 includes an image acquisition device 301, an image recognition device 302, and a display device 303.
- the image acquisition device 301 is used to shoot a patient's lesion (eg, inside the body) and other locations through a built-in camera or endoscope, etc., to collect medical images of the patient.
- the endoscope may be a neuroscope, a urethral cystoscope, a resectoscope, a laparoscope, an arthroscope, a sinusoscope, and a laryngoscope.
- the image recognition device 302 is used to obtain the medical image collected by the image collection device 301, and determine whether the medical image is a diseased medical image through the first recognition model, generate a disease recognition result, and use the second recognition model to treat the diseased medical
- the images are further identified to obtain a lesion degree recognition result to indicate the extent to which the medical image includes lesions. Further, the image recognition device 302 may also directly use the second recognition model to recognize the medical image where the lesion has occurred, and obtain a recognition result of the degree of the lesion.
- the display device 303 is used to obtain the lesion recognition result or the lesion degree recognition result output by the image recognition device 302, and present the lesion recognition result or the lesion degree recognition result to the user.
- the medical system 300 can collect the medical images of the patient through the image collection device 301, and recognize the collected medical images through the image recognition device 302 to obtain the lesion recognition result or the lesion degree recognition result, and present the lesion to the user through the display device 303 Identification result or lesion degree identification result.
- an embodiment of the present invention also provides a device for image recognition. Since the principle of the above device and equipment to solve the problem is similar to a method for image recognition, the implementation of the above device can be referred to the method implementation. The repetition is not repeated here.
- FIG. 4a it is a schematic structural diagram 1 of an image recognition device according to an embodiment of the present invention, including:
- the obtaining unit 410 is used to obtain medical images to be recognized
- the discriminating unit 411 is used to discriminate the medical image through the first recognition model to generate a lesion recognition result, and the first recognition model is used to discriminate whether the medical image has a lesion;
- the recognition unit 412 is used for recognizing the medical image of the lesion through the second recognition model to generate a recognition result of the degree of lesion.
- the second recognition model is used for recognizing the extent to which the medical image includes the lesion.
- the discriminating unit 411 may use the trained deep learning network in the first recognition model to search for lesion features in the medical image, and generate the lesion recognition result according to the search result.
- the lesion feature is that the deep learning network learns from the first medical image set of the labeled normal organ and the second medical image set of the organ with the lesion during training and exists in the second medical Image features in the image collection and not in the first medical image collection.
- the recognition unit 412 may use the second deep learning network trained in the second recognition model to search for the lesion degree feature corresponding to the first lesion degree in the medical image, and generate the lesion degree according to the search result Recognize the results.
- the lesion degree feature is the third medical image set of the second deep learning network from the labeled organs without the first lesion degree and the first Image features learned from the four medical image collections that exist in the fourth medical image collection and do not exist in the third medical image collection.
- the recognition unit 412 is specifically configured to: divide the medical image whose lesion recognition result is a lesion into multiple image blocks;
- the recognition result of the lesion degree includes: the recognition result of the lesion degree of each image block and the area information in the medical image, and / or, after the corresponding indication information is set in the corresponding area according to the recognition result of the lesion degree of each image block
- the degree of lesion indicates the image.
- the recognition result of the lesion degree further includes a lesion degree label of the medical image, and the lesion degree label of the medical image is:
- the second recognition result of the degree of lesion of the medical image determined according to the feature information of all image blocks; or,
- the integrated result determined according to the first recognition result and the second recognition result The integrated result determined according to the first recognition result and the second recognition result.
- the second recognition model is obtained based on convolutional neural network training, and the second recognition error is used to optimize the second recognition model
- the second recognition error is obtained by using a specified loss function, and the cross entropy in the loss function is determined based on the recognition result of the lesion degree of the image block with the highest degree of lesion among each image block of the medical image.
- the first recognition model is obtained based on DenseNet training, and the first recognition error is used to optimize the first recognition model;
- the first recognition error is obtained based on the average value of the cross entropy of each medical image's lesion recognition result and the sum of the corresponding constraint expression values, and the regular value, where the constraint expression value of the medical image is preset Constrained expressions are obtained.
- the constrained expressions are used to obtain valid constrained expression values when the artificial discrimination results of medical images are normal.
- FIG. 4b it is a schematic structural diagram 2 of an image recognition device according to an embodiment of the present invention, including:
- the input unit 420 is used to obtain the medical image to be recognized, and to recognize the medical image to be recognized through the recognition model;
- the obtaining unit 421 is used to obtain a lesion degree recognition result output after the recognition model recognizes the medical image to be recognized, and the lesion degree recognition result includes: among multiple image blocks included in the medical image, the lesion degree recognition result of each image block And the area information in the medical image, and / or the pathological degree indication image after corresponding indication information is set in the corresponding area according to the identification result of the pathological degree of each image block;
- the recognition model to recognize medical images specifically includes: dividing the medical image into multiple image blocks, and for each image block, extracting the feature information of the image block and determining the degree of lesion recognition of the image block according to the extracted feature information result.
- the first recognition model is used to determine whether the medical image is a diseased medical image, and then the second recognition model is used The medical image is further identified to obtain a lesion degree recognition result to indicate the degree of the lesion included in the medical image. This does not require manual analysis and custom feature extraction schemes, which improves the efficiency and accuracy of medical image recognition.
- an embodiment of the present invention also provides a terminal device 500.
- the terminal device 500 is used to implement the methods described in the foregoing method embodiments.
- the terminal shown in FIG. 2 is implemented.
- the device 500 may include a memory 501, a processor 502, an input unit 503, and a display panel 504.
- the memory 501 is used to store a computer program executed by the processor 502.
- the memory 501 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, application programs required for at least one function, etc .; the storage data area may store data created according to the use of the terminal device 500 and the like.
- the processor 502 may be a central processing unit (CPU) or a digital processing unit.
- the input unit 503 may be used to obtain user instructions input by the user.
- the display panel 504 is used to display information input by the user or provided to the user. In the embodiment of the present invention, the display panel 504 is mainly used to display the display interface of each application program in the terminal device and the control entity displayed in each display interface .
- the display panel 504 may be configured in the form of a liquid crystal display (LCD) or an OLED (organic light-emitting diode).
- LCD liquid crystal display
- OLED organic light-emitting diode
- a specific connection medium between the above-mentioned memory 501, processor 502, input unit 503, and display panel 504 is not limited.
- the memory 501, the processor 502, the input unit 503, and the display panel 504 are connected by a bus 505.
- the bus 505 is indicated by a thick line in FIG. 5, and the connection between other components is only It is for illustrative purposes, not for limitation.
- the bus 505 can be divided into an address bus, a data bus, and a control bus. For ease of representation, only a thick line is used in FIG. 5, but it does not mean that there is only one bus or one type of bus.
- the memory 501 may be volatile memory (volatile memory), such as random-access memory (RAM); the memory 501 may also be non-volatile memory (non-volatile memory), such as read-only memory, flash memory
- volatile memory volatile memory
- non-volatile memory non-volatile memory
- read-only memory flash memory
- flash memory flash memory
- HDD hard disk
- SSD solid-state drive
- the memory 501 can be used to carry or store the desired program code in the form of instructions or data structures and can be used by Any other media accessed by the computer, but not limited to this.
- the memory 501 may be a combination of the aforementioned memories.
- the processor 502 configured to implement the embodiment shown in FIG. 2, includes:
- the processor 502 is configured to call a computer program stored in the memory 501 to execute the embodiment shown in FIG. 2.
- An embodiment of the present invention also provides a computer-readable storage medium that stores computer-executable instructions required to execute the above-mentioned processor, which includes programs required to execute the above-mentioned processor.
- various aspects of an image recognition method provided by the present invention may also be implemented in the form of a program product, which includes program code.
- the program product runs on a terminal device, the program code is used
- the terminal device may execute the embodiment shown in FIG. 2.
- the program product may use any combination of one or more readable media.
- the readable medium may be a readable signal medium or a readable storage medium.
- the readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any combination of the above. More specific examples of readable storage media (non-exhaustive list) include: electrical connections with one or more wires, portable disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the foregoing.
- the program product for image recognition may use a portable compact disk read-only memory (CD-ROM) and include program code, and may run on a computing device.
- CD-ROM portable compact disk read-only memory
- the program product of the present invention is not limited to this.
- the readable storage medium may be any tangible medium containing or storing a program, which may be used by or in combination with an instruction execution system, apparatus, or device.
- the readable signal medium may include a data signal that is propagated in baseband or as part of a carrier wave, in which readable program code is carried. This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
- the readable signal medium may also be any readable medium other than a readable storage medium, and the readable medium may send, propagate, or transmit a program for use by or in combination with an instruction execution system, apparatus, or device.
- the program code contained on the readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
- the program code for performing the operations of the present invention can be written in any combination of one or more programming languages.
- the programming language includes entity-oriented programming languages such as Java, C ++, etc., as well as conventional procedural programming Language-such as "C" language or similar programming language.
- the program code may be executed entirely on the user's computing device, partly on the user's device, as an independent software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server On the implementation.
- the remote computing device may be connected to the user computing device through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computing device (eg, using Internet services Provide entropy to connect via the Internet).
- LAN local area network
- WAN wide area network
- the embodiments of the present invention may be provided as methods, systems, or computer program products. Therefore, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware. Moreover, the present invention may take the form of a computer program product implemented on one or more computer usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer usable program code.
- computer usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
- These computer program instructions can be provided to the processor of a general-purpose computer, special-purpose computer, embedded processing machine, or other programmable data processing device to produce a machine that enables the generation of instructions executed by the processor of the computer or other programmable data processing device
- These computer program instructions may also be stored in a computer-readable memory that can guide a computer or other programmable data processing device to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including an instruction device, the instructions The device implements the functions specified in one block or multiple blocks of the flowchart one flow or multiple flows and / or block diagrams.
- These computer program instructions can also be loaded onto a computer or other programmable data processing device, so that a series of operating steps are performed on the computer or other programmable device to produce computer-implemented processing, which is executed on the computer or other programmable device
- the instructions provide steps for implementing the functions specified in one block or multiple blocks of the flowchart one flow or multiple flows and / or block diagrams.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Radiology & Medical Imaging (AREA)
- Quality & Reliability (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
- Measuring And Recording Apparatus For Diagnosis (AREA)
- Endoscopes (AREA)
Abstract
Description
Claims (20)
- 一种医疗影像的识别方法,包括:获取待识别的医疗影像;通过第一识别模型对所述医疗影像进行判别,生成用于指示所述医疗影像是否包括病变的病变识别结果;当所述病变识别结果指示所述医疗影像包括病变时,通过第二识别模型对所述医疗影像进行识别,生成用于指示所述病变的程度的病变程度识别结果。
- 如权利要求1所述的方法,其中,通过第一识别模型对所述医疗影像进行判别,生成用于指示所述医疗影像是否包括病变的病变识别结果包括:所述第一识别模型利用经过训练的深度学习网络在所述医疗影像中搜索病变特征,根据搜索结果生成所述病变识别结果;其中,所述病变特征为所述深度学习网络在训练时从经过标记的正常器官的第一医疗影像集合和发生病变的器官的第二医疗影像集合中学习得到的、存在于所述第二医疗影像集合中且不存在于所述第一医疗影像集合中的图像特征。
- 如权利要求1所述的方法,其中,通过第二识别模型对所述医疗影像进行识别,生成用于指示所述病变的程度的病变程度识别结果包括:所述第二识别模型利用经过训练的第二深度学习网络在所述医疗影像中搜索第一病变程度对应的病变程度特征,根据搜索结果生成所述病变程度识别结果;其中,所述病变程度特征为所述第二深度学习网络在训练时从经过标记的不具有所述第一病变程度的器官的第三医疗影像集合和具有所述第一病变程度的器官的第四医疗影像集合中学习得到的、存在于所述第四医疗影像集合中且不存在于所述第三医疗影像集合中的图像特征。
- 如权利要求1所述的方法,其中,通过第二识别模型对所述医疗影像进行识别包括:将所述医疗影像分割为多个影像块;分别提取所述多个影像块的特征信息,根据提取的特征信息确定各个影像块的病变程度识别结果;以及所述病变程度识别结果包括:每个影像块的病变程度识别结果及其在医疗影像中的区域信息,和/或,根据每个影像块的病变程度识别结果在其对应区域设置了相应指示信息后的病变程度指示影像。
- 如权利要求2所述的方法,其中,所述病变程度识别结果还包括医疗影像的病变程度标签,所述医疗影像的病变程度标签为:所述病例影像分割出的多个影像块中病变程度最严重的影像块的第一识别结果;或者,根据所有影像块的特征信息确定的所述医疗影像的病变程度的第二识别结果;或者,根据所述第一识别结果和所述第二识别结果确定的综合结果。
- 如权利要求2-5中任一权利要求所述的方法,其中,所述第二识别模型是基于卷积神经网络训练得到的,并且采用第二识别误差对所述第二识别模型进行优化;其中,所述第二识别误差是采用指定的损失函数获得的,所述损失函数中的交叉熵,是基于医疗影像的各个影像块中病变程度最高的影像块的病变程度识别结果确定的。
- 如权利要求2-5中任一权利要求所述的方法,其中,所述第一识别模型,是基于稠密卷积网络DenseNet训练获得的,并且采用第一识别误差对所述第一识别模型进行优化;其中,所述第一识别误差是根据各个医疗影像的病变识别结果的交叉熵和相应约束表达式值的加和的平均值,以及正则值获得的,其中,医疗影像的约束表达值是通过预设的约束表达式获得的,所述约束表达式用于在医疗影像的人工判别结果为正常时获得有效的约束表达值。
- 一种影像识别的装置,包括:获取单元,用于获取待识别的医疗影像;判别单元,用于通过第一识别模型对所述医疗影像进行判别,生成用于指示所述医疗影像是否包括病变的病变识别结果;识别单元,用于当所述病变识别结果指示所述医疗影像包括病变时,通过第二识别模型对所述医疗影像进行识别,生成用于指示所述病变的程度的病变程度识别结果。
- 如权利要求8所述的装置,其中,所述判别单元用于:利用所述第一识别模型中经过训练的深度学习网络在所述医疗影像中搜索病变特征,根据搜索结果生成所述病变识别结果;其中,所述病变特征为所述深度学习网络在训练时从经过标记的正常器官的第一医疗影像集合和发生病变的器官的第二医疗影像集合中学习得到的、存在于所述第二医疗影像集合中且不存在于所述第一医疗影像集合中的图像特征。
- 如权利要求8所述的装置,其中,所述识别单元用于:利用所述第二识别模型中经过训练的第二深度学习网络在所述医疗影像中搜索第一病变程度对应的病变程度特征,根据搜索结果生成所述病变程度识别结果;其中,所述病变程度特征为所述第二深度学习网络在训练时从经过标记的不具有所述第一病变程度的器官的第三医疗影像集合和具有所述第一病变程度的器官的第四医疗影像集合中学习得到的、存在于所述第四医疗影像集合中且不存在于所述第三医疗影像集合中的图像特征。
- 如权利要求8所述的装置,所述识别单元具体用于:将所述医疗影像分割为多个影像块;分别提取所述多个影像块的特征信息,根据提取的特征信息确定各个影像块的 病变程度识别结果;以及所述病变程度识别结果包括:每个影像块的病变程度识别结果及其在医疗影像中的区域信息,和/或,根据每个影像块的病变程度识别结果在其对应区域设置了相应指示信息后的病变程度指示影像。
- 如权利要求11所述的装置,所述病变程度识别结果还包括医疗影像的病变程度标签,所述医疗影像的病变程度标签为:所述病例影像分割出的多个影像块中病变程度最严重的影像块的第一识别结果;或者,根据所有影像块的特征信息确定的所述医疗影像的病变程度的第二识别结果;或者,根据所述第一识别结果和所述第二识别结果确定的综合结果。
- 如权利要求9-12中任一权利要求所述的装置,所述第二识别模型,是基于卷积神经网络训练得到的,并且采用第二识别误差对所述第二识别模型进行优化;其中,所述第二识别误差是采用指定的损失函数获得的,所述损失函数中的交叉熵,是基于医疗影像的各个影像块中病变程度最高的影像块的病变程度识别结果确定的。
- 如权利要求9-12中任一权利要求所述的装置,所述第一识别模型,是基于稠密卷积网络DenseNet训练获得的,并且采用第一识别误差对所述第一识别模型进行优化;其中,所述第一识别误差是根据各个医疗影像的病变识别结果的交叉熵和相应约束表达式值的加和的平均值,以及正则值获得的,其中,医疗影像的约束表达值是通过预设的约束表达式获得的,所述约束表达式用于在医疗影像的人工判别结果为正常时获得有效的约束表达值。
- 一种医疗影像的识别方法,包括:获取待识别的医疗影像,并通过识别模型对所述待识别的医疗影像进行识别;获得所述识别模型对所述待识别的医疗影像进行识别后输出的病变程度识别结果,且所述病变程度识别结果包括:所述医疗影像包括的多个影像块中,每个影像块的病变程度识别结果及其在医疗影像中的区域信息,和/或根据每个影像块的病变程度识别结果在其对应区域设置了相应指示信息后的病变程度指示影像;其中,所述识别模型对所述医疗影像进行识别包括:将所述医疗影像分割为多个影像块,并针对每个影像块,提取该影像块的特征信息并根据提取的特征信息确定该影像块的病变程度识别结果。
- 一种影像识别的装置,包括:输入单元,用于获取待识别的医疗影像,并通过识别模型对所述待识别的医疗影像进行识别;获得单元,用于获得所述识别模型对所述待识别的医疗影像进行识别后输出的 病变程度识别结果,且所述病变程度识别结果包括:所述医疗影像包括的多个影像块中,每个影像块的病变程度识别结果及其在医疗影像中的区域信息,和/或根据每个影像块的病变程度识别结果在其对应区域设置了相应指示信息后的病变程度指示影像;其中,所述识别模型对所述医疗影像进行识别具体包括:将所述医疗影像分割为多个影像块,并针对每个影像块,提取该影像块的特征信息并根据提取的特征信息确定该影像块的病变程度识别结果。
- 一种终端设备,包括至少一个处理单元、以及至少一个存储单元,其中,所述存储单元存储有计算机程序,当所述程序被所述处理单元执行时,使得所述处理单元执行权利要求1~7任一权利要求所述方法的步骤。
- 一种医疗系统,包括影像采集装置、影像识别装置和显示装置,其中,所述影像采集装置,用于采集病人的医疗影像;所述影像识别装置,用于获取所述影像采集装置采集的医疗影像,并通过第一识别模型对所述医疗影像进行判别,生成用于指示所述医疗影像是否包括病变的病变识别结果,以及当所述病变识别结果指示所述医疗影像包括病变时,通过第二识别模型对所述医疗影像进行识别,生成用于指示所述病变的程度的病变程度识别结果;所述显示装置,用于呈现所述病变程度识别结果。
- 一种医疗系统,包括影像采集装置、影像识别装置和显示装置,其中,所述影像采集装置,用于采集病人的医疗影像;所述影像识别装置,用于获取所述影像采集装置采集的医疗影像,并通过第二识别模型对包括病变的医疗影像进行识别,生成病变程度识别结果,其中,所述第二识别模型用于识别医疗影像所包括的病变的程度;所述显示装置,用于呈现所述病变程度识别结果。
- 一种计算机可读存储介质,存储有计算机程序,当所述程序被一个或多个处理器执行时,使得所述一个或多个处理器执行权利要求1~7任一权利要求所述方法的步骤。
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP19879486.9A EP3876192B1 (en) | 2018-10-30 | 2019-06-28 | Image recognition method and device |
| JP2020561046A JP7152513B2 (ja) | 2018-10-30 | 2019-06-28 | 画像認識方法、装置、端末機器及び医療システム、並びにそのコンピュータプログラム |
| US17/078,878 US11410306B2 (en) | 2018-10-30 | 2020-10-23 | Method, apparatus, system, and storage medium for recognizing medical image |
| US17/856,043 US11610310B2 (en) | 2018-10-30 | 2022-07-01 | Method, apparatus, system, and storage medium for recognizing medical image |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811278418.3 | 2018-10-30 | ||
| CN201811278418.3A CN109427060A (zh) | 2018-10-30 | 2018-10-30 | 一种影像识别的方法、装置、终端设备和医疗系统 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/078,878 Continuation US11410306B2 (en) | 2018-10-30 | 2020-10-23 | Method, apparatus, system, and storage medium for recognizing medical image |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2020087960A1 true WO2020087960A1 (zh) | 2020-05-07 |
Family
ID=65514793
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2019/093602 Ceased WO2020087960A1 (zh) | 2018-10-30 | 2019-06-28 | 一种影像识别的方法、装置、终端设备和医疗系统 |
Country Status (5)
| Country | Link |
|---|---|
| US (2) | US11410306B2 (zh) |
| EP (1) | EP3876192B1 (zh) |
| JP (1) | JP7152513B2 (zh) |
| CN (1) | CN109427060A (zh) |
| WO (1) | WO2020087960A1 (zh) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112233194A (zh) * | 2020-10-15 | 2021-01-15 | 平安科技(深圳)有限公司 | 医学图片优化方法、装置、设备及计算机可读存储介质 |
| CN113130050A (zh) * | 2021-04-20 | 2021-07-16 | 皖南医学院第一附属医院(皖南医学院弋矶山医院) | 一种医学信息显示方法及显示系统 |
| CN115311268A (zh) * | 2022-10-10 | 2022-11-08 | 武汉楚精灵医疗科技有限公司 | 食管内窥镜图像的识别方法及装置 |
Families Citing this family (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109427060A (zh) | 2018-10-30 | 2019-03-05 | 腾讯科技(深圳)有限公司 | 一种影像识别的方法、装置、终端设备和医疗系统 |
| CN110432977A (zh) * | 2019-08-07 | 2019-11-12 | 杭州睿笛生物科技有限公司 | 一种电脉冲消融设备以及适用其仿真方法 |
| CN110610489B (zh) * | 2019-08-30 | 2021-11-23 | 西安电子科技大学 | 基于注意力机制的光学喉镜图像病变区标注方法 |
| TWI705458B (zh) * | 2019-09-19 | 2020-09-21 | 沐恩生醫光電股份有限公司 | 一種醫療影像辨識的方法與系統 |
| CN111488912B (zh) * | 2020-03-16 | 2020-12-11 | 哈尔滨工业大学 | 一种基于深度学习神经网络的喉部疾病诊断系统 |
| CN111860209B (zh) * | 2020-06-29 | 2024-04-26 | 北京字节跳动网络技术有限公司 | 手部识别方法、装置、电子设备及存储介质 |
| CN112037167B (zh) * | 2020-07-21 | 2023-11-24 | 苏州动影信息科技有限公司 | 一种基于影像组学和遗传算法的目标区域确定系统 |
| CN112509688B (zh) * | 2020-09-25 | 2024-06-11 | 卫宁健康科技集团股份有限公司 | 压疮图片自动分析系统、方法、设备和介质 |
| CN113159238B (zh) * | 2021-06-23 | 2021-10-26 | 安翰科技(武汉)股份有限公司 | 内窥镜影像识别方法、电子设备及存储介质 |
| CN116342859B (zh) * | 2023-05-30 | 2023-08-18 | 安徽医科大学第一附属医院 | 一种基于影像学特征识别肺部肿瘤区域的方法及系统 |
| WO2025150130A1 (ja) * | 2024-01-11 | 2025-07-17 | 富士通株式会社 | 病変識別方法および病変識別プログラム |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107609503A (zh) * | 2017-09-05 | 2018-01-19 | 刘宇红 | 智能癌变细胞识别系统及方法、云平台、服务器、计算机 |
| CN108573490A (zh) * | 2018-04-25 | 2018-09-25 | 王成彦 | 一种针对肿瘤影像数据的智能读片系统 |
| WO2018189551A1 (en) * | 2017-04-12 | 2018-10-18 | Kheiron Medical Technologies Ltd | Malignancy assessment for tumors |
| CN109427060A (zh) * | 2018-10-30 | 2019-03-05 | 腾讯科技(深圳)有限公司 | 一种影像识别的方法、装置、终端设备和医疗系统 |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105997128A (zh) * | 2016-08-03 | 2016-10-12 | 上海联影医疗科技有限公司 | 利用灌注成像识别病灶的方法及系统 |
| CA3040518C (en) * | 2016-10-21 | 2023-05-23 | Nantomics, Llc | Digital histopathology and microdissection |
| CN108095683A (zh) * | 2016-11-11 | 2018-06-01 | 北京羽医甘蓝信息技术有限公司 | 基于深度学习的处理眼底图像的方法和装置 |
| CN107203778A (zh) * | 2017-05-05 | 2017-09-26 | 平安科技(深圳)有限公司 | 视网膜病变程度等级检测系统及方法 |
| JP6890184B2 (ja) * | 2017-09-15 | 2021-06-18 | 富士フイルム株式会社 | 医療画像処理装置及び医療画像処理プログラム |
| CN107563123A (zh) * | 2017-09-27 | 2018-01-09 | 百度在线网络技术(北京)有限公司 | 用于标注医学图像的方法和装置 |
| WO2019087790A1 (ja) * | 2017-10-31 | 2019-05-09 | 富士フイルム株式会社 | 検査支援装置、内視鏡装置、検査支援方法、及び検査支援プログラム |
| CN108305249B (zh) * | 2018-01-24 | 2022-05-24 | 福建师范大学 | 基于深度学习的全尺度病理切片的快速诊断和评分方法 |
| CN108346154B (zh) * | 2018-01-30 | 2021-09-07 | 浙江大学 | 基于Mask-RCNN神经网络的肺结节分割装置的建立方法 |
| CN108510482B (zh) * | 2018-03-22 | 2020-12-04 | 姚书忠 | 一种基于阴道镜图像的宫颈癌检测装置 |
| CN108710901B (zh) * | 2018-05-08 | 2022-03-01 | 广州市新苗科技有限公司 | 一种基于深度学习的脊柱畸形筛查系统及方法 |
| CN108665457B (zh) * | 2018-05-16 | 2023-12-19 | 腾讯医疗健康(深圳)有限公司 | 图像识别方法、装置、存储介质及计算机设备 |
-
2018
- 2018-10-30 CN CN201811278418.3A patent/CN109427060A/zh active Pending
-
2019
- 2019-06-28 WO PCT/CN2019/093602 patent/WO2020087960A1/zh not_active Ceased
- 2019-06-28 EP EP19879486.9A patent/EP3876192B1/en active Active
- 2019-06-28 JP JP2020561046A patent/JP7152513B2/ja active Active
-
2020
- 2020-10-23 US US17/078,878 patent/US11410306B2/en active Active
-
2022
- 2022-07-01 US US17/856,043 patent/US11610310B2/en active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2018189551A1 (en) * | 2017-04-12 | 2018-10-18 | Kheiron Medical Technologies Ltd | Malignancy assessment for tumors |
| CN107609503A (zh) * | 2017-09-05 | 2018-01-19 | 刘宇红 | 智能癌变细胞识别系统及方法、云平台、服务器、计算机 |
| CN108573490A (zh) * | 2018-04-25 | 2018-09-25 | 王成彦 | 一种针对肿瘤影像数据的智能读片系统 |
| CN109427060A (zh) * | 2018-10-30 | 2019-03-05 | 腾讯科技(深圳)有限公司 | 一种影像识别的方法、装置、终端设备和医疗系统 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3876192A4 * |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112233194A (zh) * | 2020-10-15 | 2021-01-15 | 平安科技(深圳)有限公司 | 医学图片优化方法、装置、设备及计算机可读存储介质 |
| CN112233194B (zh) * | 2020-10-15 | 2023-06-02 | 平安科技(深圳)有限公司 | 医学图片优化方法、装置、设备及计算机可读存储介质 |
| CN113130050A (zh) * | 2021-04-20 | 2021-07-16 | 皖南医学院第一附属医院(皖南医学院弋矶山医院) | 一种医学信息显示方法及显示系统 |
| CN113130050B (zh) * | 2021-04-20 | 2023-11-24 | 皖南医学院第一附属医院(皖南医学院弋矶山医院) | 一种医学信息显示方法及显示系统 |
| CN115311268A (zh) * | 2022-10-10 | 2022-11-08 | 武汉楚精灵医疗科技有限公司 | 食管内窥镜图像的识别方法及装置 |
| CN115311268B (zh) * | 2022-10-10 | 2022-12-27 | 武汉楚精灵医疗科技有限公司 | 食管内窥镜图像的识别方法及装置 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP3876192B1 (en) | 2026-02-25 |
| EP3876192A1 (en) | 2021-09-08 |
| EP3876192A4 (en) | 2021-11-17 |
| CN109427060A (zh) | 2019-03-05 |
| JP2021521553A (ja) | 2021-08-26 |
| US11610310B2 (en) | 2023-03-21 |
| US20220343502A1 (en) | 2022-10-27 |
| US11410306B2 (en) | 2022-08-09 |
| JP7152513B2 (ja) | 2022-10-12 |
| US20210042920A1 (en) | 2021-02-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2020087960A1 (zh) | 一种影像识别的方法、装置、终端设备和医疗系统 | |
| CN111340819B (zh) | 图像分割方法、装置和存储介质 | |
| Younas et al. | A deep ensemble learning method for colorectal polyp classification with optimized network parameters: A deep ensemble learning method for colorectal polyp classification with optimized network parameters | |
| Yue et al. | Automated endoscopic image classification via deep neural network with class imbalance loss | |
| Ay et al. | Automated classification of nasal polyps in endoscopy video-frames using handcrafted and CNN features | |
| WO2020088288A1 (zh) | 内窥镜图像的处理方法、系统及计算机设备 | |
| CN108268870A (zh) | 基于对抗学习的多尺度特征融合超声图像语义分割方法 | |
| CN110689025A (zh) | 图像识别方法、装置、系统及内窥镜图像识别方法、装置 | |
| CN110363768A (zh) | 一种基于深度学习的早期癌病灶范围预测辅助系统 | |
| CN109948671B (zh) | 图像分类方法、装置、存储介质以及内窥镜成像设备 | |
| Meer et al. | Deep convolutional neural networks information fusion and improved whale optimization algorithm based smart oral squamous cell carcinoma classification framework using histopathological images | |
| Xie et al. | Optic disc and cup image segmentation utilizing contour-based transformation and sequence labeling networks | |
| Sornapudi et al. | DeepCIN: attention-based cervical histology image classification with sequential feature modeling for pathologist-level accuracy | |
| Haj‐Manouchehri et al. | Polyp detection using CNNs in colonoscopy video | |
| CN117218129B (zh) | 食道癌图像识别分类方法、系统、设备及介质 | |
| Chen et al. | Deep transfer learning for histopathological diagnosis of cervical cancer using convolutional neural networks with visualization schemes | |
| CN112668668B (zh) | 一种术后医学影像评估方法、装置、计算机设备及存储介质 | |
| Bhandari et al. | Improved diabetic retinopathy severity classification using squeeze-and-excitation and sparse light weight multi-level attention u-net with transfer learning from xception | |
| CN118657993B (zh) | 一种具备可解释性的病灶感知眼底图像分类方法及系统 | |
| WO2024016691A1 (zh) | 一种图像检索方法、模型训练方法、装置及存储介质 | |
| WO2022160070A1 (en) | Machine learning enabled system for skin abnormality interventions | |
| Deepa et al. | Pre-Trained Convolutional Neural Network for Automated Grading of Diabetic Retinopathy | |
| CN115908224A (zh) | 目标检测模型的训练方法、目标检测方法和训练装置 | |
| Yang et al. | An automatic method for sublingual image segmentation and color analysis | |
| Asif et al. | SFI-ensemble: Sugeno fuzzy integral-based ensemble of CNN models with meta-heuristic fuzzy measures for mouth and oral disease detection |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19879486 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2020561046 Country of ref document: JP Kind code of ref document: A |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2019879486 Country of ref document: EP Effective date: 20210531 |
|
| WWG | Wipo information: grant in national office |
Ref document number: 2019879486 Country of ref document: EP |

