WO2021114592A1 - 视频降噪方法、装置、终端及存储介质 - Google Patents
视频降噪方法、装置、终端及存储介质 Download PDFInfo
- Publication number
- WO2021114592A1 WO2021114592A1 PCT/CN2020/095359 CN2020095359W WO2021114592A1 WO 2021114592 A1 WO2021114592 A1 WO 2021114592A1 CN 2020095359 W CN2020095359 W CN 2020095359W WO 2021114592 A1 WO2021114592 A1 WO 2021114592A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- pixel
- noise reduction
- pixels
- target image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/20—Image enhancement or restoration using local operators
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/268—Signal distribution or switching
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/70—Denoising; Smoothing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/50—Image enhancement or restoration using two or more images, e.g. averaging or subtraction
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G06T7/254—Analysis of motion involving subtraction of images
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G06T7/277—Analysis of motion involving stochastic approaches, e.g. using Kalman filters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N9/00—Details of colour television systems
- H04N9/64—Circuits for processing colour signals
- H04N9/70—Circuits for processing colour signals for colour killing
- H04N9/71—Circuits for processing colour signals for colour killing combined with colour gain control
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20024—Filtering details
- G06T2207/20028—Bilateral filtering
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20172—Image enhancement details
- G06T2207/20182—Noise reduction or smoothing in the temporal domain; Spatio-temporal filtering
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20212—Image combination
- G06T2207/20221—Image fusion; Image merging
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/14—Picture signal circuitry for video frequency region
- H04N5/21—Circuitry for suppressing or minimising disturbance, e.g. moiré or halo
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N9/00—Details of colour television systems
- H04N9/64—Circuits for processing colour signals
- H04N9/646—Circuits for processing colour signals for image enhancement, e.g. vertical detail restoration, cross-colour elimination, contour correction, chrominance trapping filters
Definitions
- This application relates to the field of multimedia technology, and in particular to a video noise reduction method, device, terminal and storage medium.
- the pixels in the neighborhood often include the pixels that have completed the filtering process before, so that the subsequent pixels
- the filtering process is a serial process, which results in a slower algorithm operation speed.
- a video noise reduction method device, terminal, and storage medium are provided.
- a video noise reduction method executed by a terminal, and the method includes:
- the spatial filtering is used to eliminate the dependency between the pixels of the target image
- time-domain filtering is performed on the pixels of the target image in parallel to obtain a second image
- the first denoised image is the image of the target image
- the first image and the second image are fused to obtain a second noise reduction image corresponding to the target image that has undergone noise reduction processing.
- a video noise reduction device including:
- the spatial filtering module is used to perform spatial filtering on the pixels of the target image in the video to be processed to obtain the first image; the spatial filtering is used to eliminate the dependency between the pixels of the target image;
- the time domain filtering module is configured to perform time domain filtering on the pixels of the target image in parallel according to the frame difference between the first image and the first noise reduction image to obtain a second image.
- the first noise reduction image The image is an image that has undergone noise reduction processing corresponding to the previous frame image of the target image;
- the fusion module is configured to predict the corresponding first gain of the pixel of the second image in the second noise-reduced image according to the second gain coefficient corresponding to the pixel of the target image in the first noise-reduced image Coefficient; and according to the first gain coefficient, the first image and the second image are fused to obtain a second noise reduction image corresponding to the target image that has undergone noise reduction processing.
- a non-volatile storage medium storing computer-readable instructions.
- the computer-readable instructions are executed by one or more processors, the one or more processors execute the steps of the video noise reduction method.
- a terminal includes a memory and a processor, and computer-readable instructions are stored in the memory.
- the processor executes the steps of the video noise reduction method.
- FIG. 1 is a video image collected by a low-performance camera configured by a laptop computer according to an embodiment of the present application
- FIG. 2 is a schematic diagram of a video conference process provided by an embodiment of the present application.
- FIG. 3 is a structural block diagram of a video noise reduction system provided by an embodiment of the present application.
- FIG. 4 is a flowchart of a video noise reduction method provided by an embodiment of the present application.
- FIG. 5 is a schematic diagram of image filtering before pixel dependence is released according to an embodiment of the present application.
- FIG. 6 is a schematic diagram of image filtering provided by an embodiment of the present application after pixel dependence is released;
- FIG. 7 is a schematic diagram of a spatial filtering effect comparison provided by an embodiment of the present application.
- FIG. 8 is a schematic diagram of a comparison before and after noise reduction processing provided by an embodiment of the present application.
- FIG. 9 is a schematic diagram of a key process of a video noise reduction method provided by an embodiment of the present application.
- FIG. 10 is a schematic diagram of an algorithm flow chart of a video noise reduction method provided by an embodiment of the present application.
- FIG. 11 is a block diagram of a video noise reduction device provided by an embodiment of the present application.
- FIG. 12 is a structural block diagram of a terminal provided by an embodiment of the present application.
- the embodiment of the present application mainly relates to a scene of performing noise reduction processing on a video, and takes the noise reduction processing on a remote conference video as an example for description.
- Remote video conferencing is an important part of the various functions of office collaboration products. It has very strict requirements on the captured video, and usually requires the use of high-definition cameras for video capture. When using a weaker camera for video capture, the captured video generally has noise. If these noises are not processed, the experience of the video conference will be poor.
- Figure 1 is a video image captured by a low-performance camera configured on a laptop computer. As can be seen from Figure 1, the collected video images contain a lot of noise.
- the embodiments of the present application can also be applied to perform noise reduction processing on the video collected by the mobile phone camera during a video call, or perform noise reduction processing on the video collected by the monitoring device, etc. The embodiment of the application does not limit this.
- the video noise reduction method provided by the embodiments of the present application solves the dependency between pixels in the image to meet the needs of parallel computing. Because the parallel computing capability of GPU (Graphics Processing Unit, graphics processing unit) is stronger than that of CPU Therefore, the video noise reduction method provided by the embodiments of the present application replaces the CPU by calling the Metal (an image processing interface provided by Apple) or DirectX (an image processing interface provided by Microsoft) provided by the GPU. Realize the parallel processing of each pixel. Thereby, the processing speed of the noise reduction processing on the video is improved and the CPU usage is reduced. That is, the video noise reduction method provided by the embodiments of this application can achieve fast video noise reduction with a very low CPU occupancy rate.
- FIG. 2 is a schematic diagram of a video conference process provided by an embodiment of the present application.
- the video image collected by the camera is displayed locally after noise reduction processing and other operations, for example, displayed on the screen of a laptop computer.
- the encoder encodes the video image after noise reduction processing, and transmits it to the remote end through the network.
- the remote decoder decodes the video image and displays the decoded video image at the remote end.
- the remote end can also be a notebook computer.
- the video noise reduction system 300 may be used to implement video noise reduction, and includes: a terminal 310 and a video service platform 320.
- the terminal 310 may be connected to the video service platform 320 through a wireless network or a wired network.
- the terminal 310 may be at least one of a smart phone, a video camera, a desktop computer, a tablet computer, an MP4 player, and a laptop portable computer.
- the terminal 310 installs and runs an application program that supports remote video conferences.
- the terminal 310 may be a terminal used by a user, and an account of the user is logged in an application program running on the terminal.
- the video service platform 320 includes at least one of a server, multiple servers, and a cloud computing platform.
- the video service platform 320 is used to provide background services for remote video conferences, such as user management, video stream forwarding, and so on.
- the video service platform 320 includes: an access server, a data management server, a user management server, and a database.
- the access server is used to provide access services for the terminal 310.
- the data management server is used to forward the video stream uploaded by the terminal, etc.
- the same service is provided in a load balancing manner or the same service is provided in the manner of a main server and a mirror server, which is not limited in the embodiment of the present application.
- the database is used to store user account information.
- the account information is the data information that the user has authorized to collect.
- the terminal 310 may generally refer to one of multiple terminals, and this embodiment only uses the local terminal 310 and two remote terminals 310 for illustration. Those skilled in the art may know that the number of the aforementioned terminals may be more or less. For example, the above-mentioned remote terminal may be only one, or the above-mentioned remote terminal may be dozens or hundreds, or more. The embodiment of the present application does not limit the number and types of terminals 310.
- FIG. 4 is a flowchart of a video noise reduction method provided by an embodiment of the present application, as shown in FIG. 4. The method includes the following steps:
- the terminal spatially filters the pixels of the target image in the video to be processed to obtain the first image; the spatial filtering is used to eliminate the dependency between pixels in the target image.
- the terminal may implement spatial filtering of the pixels of the target image based on the first filter, that is, input the target image into the first filter, and the output of the first filter is the first filter after the spatial filtering.
- the first filter may be an improved bilateral filter, and the first filter may process pixels of the target image in parallel.
- the first filter is described below:
- the bilateral filtering algorithm is a non-linear edge-preserving filtering algorithm, which is a compromise processing method that combines the spatial proximity of the image and the similarity of pixel values.
- the bilateral filtering algorithm considers both spatial information and gray-scale similarity to achieve the purpose of edge preservation and denoising. It has simple, non-iterative, and local characteristics. Among them, edge preservation and denoising refers to replacing the original pixel value of the pixel by the average value of the neighboring pixels of the currently processed pixel.
- the pixels of the image to be processed are usually first from left to right, then from top to bottom (or from top to bottom, then from top to bottom).
- spatial filtering it is often achieved by performing linear or nonlinear processing on neighboring pixels of the currently processed pixel. Because in the process of filtering the image to be processed, when the processing order is performed on the subsequent pixels, the pixels in the neighborhood of the pixel often include the pixels that have completed the spatial filtering process before, resulting in the subsequent order.
- the pixel points of has a dependency on the pixels that have been filtered, and this dependency causes the spatial filtering of the entire image to become a serial processing process.
- the elimination of pixel dependence refers to the elimination of the dependence relationship between pixels.
- I(p) represents the pixel value of the currently processed pixel in the image
- I(q) represents the neighboring pixel of the currently processed pixel in the image
- the pixel value of a point p represents the coordinates of the currently processed pixel in the image
- q represents the coordinate of the neighboring pixel of the currently processed pixel in the image
- ⁇ (p,q) represents the weight related to the position of the pixel
- g ( ⁇ ) represents the Gaussian function
- the ⁇ s and ⁇ r sub-tables represent the variance of the Gaussian function.
- the I(q) corresponding to the neighborhood pixel before the currently processed pixel is the pixel value after spatial filtering
- the I(q) corresponding to the neighborhood pixel after the currently processed pixel is (q) is the original pixel value of the neighborhood pixel.
- the neighborhood pixel of the currently processed pixel refers to the pixel within the neighborhood of the currently processed pixel.
- the size of the neighborhood of the pixel is different, and the number of the neighborhood of the pixel is different.
- the neighborhood of a pixel can be four neighborhoods, that is, the upper neighborhood, the lower neighborhood, the left neighborhood, and the right neighborhood; the neighborhood pixels of the pixel are the four pixels adjacent to the top, bottom, left, and right of the pixel.
- the neighborhood of a pixel can be eight neighborhoods, namely, the upper neighborhood, the upper left neighborhood, the upper right neighborhood, the lower neighborhood, the lower left neighborhood, the lower right neighborhood, the left neighborhood, and the right neighborhood; the neighborhood of the pixel
- the pixel points are eight pixels surrounding the pixel point.
- the neighborhood of the pixel can also be selected in other ways.
- FIG. 5 is a schematic diagram of an image filter provided by an embodiment of the present application before pixel dependence is released.
- the currently processed pixel is the central pixel, and the central pixel corresponds to 12 neighboring pixels.
- the neighboring pixels located on the left and above the center pixel are the pixels that have been processed.
- the neighboring pixels located to the right and below the center pixel are unprocessed pixels.
- the first improvement is made to the above-mentioned process, that is, the above-mentioned bilateral filter is improved.
- the pixel dependence between pixels is calculated, and the above-mentioned first filter is obtained.
- the first filter is also based on the bilateral filtering algorithm. The difference is that when the pixel of the target image is filtered by the above formulas (1) and (2), the pixel value of the neighboring pixel of the pixel is The value of I(q) uses the original pixel value of the image, that is, the pixel value after filtering is not used. In this way, each pixel no longer depends on the pixel that is arranged before the current pixel in the processing order, and the influence of the pixel that is arranged before the current pixel in the processing order on the current pixel after filtering is eliminated.
- FIG. 6 is a schematic diagram of an image filter provided by an embodiment of the present application after pixel dependence is released.
- the currently processed pixel is the central pixel, and the central pixel corresponds to 12 neighboring pixels.
- These 12 neighboring pixels are all unprocessed pixels, that is, the pixels of the neighboring pixels.
- the values are all initial pixel values.
- FIG. 7 is a schematic diagram of a spatial filtering effect comparison provided by an embodiment of the present application.
- FIG. 7 exemplarily shows the target image, the image filtered by the bilateral filter, and the image filtered by the first filter.
- the terminal can call the image processing interface provided by the GPU, such as Metal or DirectX. And so on, transfer the steps of performing spatial filtering on the pixels of the target image to the GPU for implementation.
- the terminal can also call the image processing interface of the graphics processor, through the image processing interface to obtain the pixels of the target image in the video to be processed in parallel, and perform spatial filtering on the pixels obtained in parallel, thereby realizing parallel processing
- the pixel points of the target image in the video are processed for spatial filtering, which accelerates the entire spatial filtering process, saves CPU resources and reduces the CPU occupancy rate.
- the terminal acquires a first noise reduction image, where the first noise reduction image is an image that has undergone noise reduction processing corresponding to a previous frame of the target image.
- the terminal after the terminal performs spatial filtering on the pixels of the target image, it may also perform temporal filtering on the pixels of the target image. Before performing temporal filtering on the pixels of the target image, the terminal may obtain the first denoising image corresponding to the previous frame of the target image. The subsequent step of temporally filtering the target image is performed based on the first noise-reduced image and the above-mentioned first image.
- the terminal determines the frame difference between the first image and the first denoising image.
- the terminal may store the noise-reduction pixel value of each pixel of the first noise-reduction image that has undergone noise reduction processing in the form of a two-dimensional array.
- the terminal may also store the filtered pixel value of each pixel of the first image in the form of a two-dimensional array, and the pixel of the first image corresponds to each pixel of the first denoising image one-to-one.
- the size of the array is the product of the height of the target image and the width of the image.
- the terminal can calculate the difference between the noise-reduced pixel value of the pixel in the first noise-reduced image and the corresponding filtered pixel value in the first image, and use the difference as the pixel corresponding to the pixel.
- Pixel frame difference the frame difference between the first image and the first denoising image is obtained, and the frame difference may be in the form of a two-dimensional array.
- the terminal performs temporal filtering on the pixels of the target image in parallel according to the frame difference between the first image and the first denoising image to obtain a second image.
- the terminal may input the frame difference between the first image and the first noise-reduced image and the target image into the second filter ,
- the time domain filtering is performed based on the second filter, and the output of the second filter is the second image.
- the second filter may be an improved Kalman filter based on the Kalman filter algorithm, that is, the third improvement in the embodiment of the present application is to improve the Kalman filter based on the Kalman filter algorithm to obtain The second filter described above.
- the second filter is described below:
- the process of time-domain filtering based on the Kalman filtering algorithm mainly includes two steps, one is prediction and the other is correction.
- the prediction step the terminal predicts the corresponding pixel value and variance of any pixel in the target image based on the noise-reduced pixel value and variance corresponding to any pixel in the first noise-reduced image.
- the correction step the terminal determines the gain coefficient corresponding to each pixel, and determines the gain coefficient, the corresponding pixel value of the pixel in the target image, and the corresponding noise reduction pixel value of the pixel in the first noise reduction image.
- the gain coefficient is a parameter of the relationship between the pixel values of the corresponding pixels between two frames of images.
- P k-1 represents the corresponding variance of the pixel in the first denoised image.
- Q represents the variance offset coefficient, which is an empirical parameter in the Kalman filter algorithm. In this embodiment, Q is a constant
- K k represents the corresponding gain coefficient of the pixel in the predicted noise reduction image of the target image.
- R represents the gain offset coefficient, which is also an empirical parameter in the Kalman filter algorithm.
- R is a parameter that changes following an iterative operation. It can be understood that both Q and R are empirical parameter factors, and Kalman filters with different performances can be obtained by adjusting them.
- P k represents the variance that the pixel needs to use in the next frame of image.
- the video noise reduction method provided by the embodiment of the present application optimizes the formula (4), introduces the frame difference when calculating the variance, and obtains the formula (8).
- ⁇ represents the frame difference between the first image and the first noise-reduction image.
- the video noise reduction method provided by the embodiment of the present application adds formula (9) and formula (10), and optimizes formula (5) to obtain formula (11).
- R k represents the corresponding gain bias coefficient of the pixel in the target image
- R k-1 represents the corresponding gain bias coefficient of the pixel in the first noise reduction image
- K k-1 represents the pixel in the first noise reduction image.
- U k represents the motion compensation coefficient.
- this step can be implemented through the following sub-step 4041 to sub-step 4043. Since the terminal can perform temporal filtering on the pixels of the target image in parallel, in sub-step 4041 to sub-step 4044, any pixel in the target image is taken as an example for description. The pixels are the same. When all the pixels of the target image are processed by the terminal, the second image is obtained.
- the terminal predicts the corresponding first gain coefficient of the pixel of the second image in the second noise-reduced image according to the second gain coefficient of the pixel of the target image in the first noise-reduced image. For details, refer to sub-step 4041 to sub-step 4044.
- the terminal determines the second variance of the pixel according to the corresponding first variance of the pixel in the first noise-reduction image, the frame difference between the first image and the first noise-reduction image, and the variance offset coefficient.
- the corresponding first variance of the pixel in the first noise-reduction image is P k-1
- the frame difference between the first image and the first noise-reduction image is ⁇
- the variance offset coefficient is Q.
- the terminal obtains the second gain coefficient and the second gain offset coefficient corresponding to the pixel in the first noise reduction image, and determines the first gain coefficient corresponding to the pixel according to the second gain coefficient and the second gain offset coefficient.
- Gain offset coefficient the first gain coefficient corresponding to the pixel according to the second gain coefficient and the second gain offset coefficient.
- the second gain coefficient corresponding to the pixel in the first noise reduction image is K k-1
- the second gain offset coefficient corresponding to the pixel in the first noise reduction image is R k-1
- the first gain offset coefficient R k corresponding to the pixel can be calculated.
- the terminal determines the motion compensation coefficient corresponding to the pixel point according to the frame difference.
- the motion compensation coefficient U k corresponding to the pixel can be calculated according to formula (10).
- the terminal determines the first gain coefficient corresponding to the pixel point according to the second variance, the first gain offset coefficient corresponding to the pixel point, and the motion compensation coefficient.
- the second variance obtained from the above sub-step 4041 to sub-step 4043 The first gain offset coefficient R k and the motion compensation coefficient U k are calculated to obtain the first gain coefficient K k corresponding to the pixel point.
- the terminal can also use formula (7) and the second variance To determine the third-party difference P k that the pixel needs to use in the next frame of image.
- the terminal fuses the first image and the second image according to the first gain coefficient to obtain a second noise reduction image corresponding to the target image that has undergone noise reduction processing.
- the terminal also obtains the first gain coefficient corresponding to the pixels of the second image in the process of performing temporal filtering on the pixels of the target image to obtain the second image.
- the terminal may use the product of the difference between the first gain coefficient K k corresponding to the pixel and the preset value and the first pixel value x k of the pixel as the first fusion value, and the pixel The product of the corresponding first gain coefficient and the second pixel value Z k of the pixel is used as the second fusion value.
- the first pixel value is the pixel value of the pixel after time-domain filtering
- the second pixel value is the pixel value of the pixel after being spatially filtered.
- the terminal sums the first fusion value and the second fusion value to obtain the noise reduction pixel value corresponding to the pixel.
- the above summation process can be realized according to formula (12).
- the terminal may use the first gain coefficient as a weighting coefficient for fusing the first image and the second image.
- the difference between the first gain coefficient corresponding to the pixel of the second image in the second noise reduction image and the preset value 1 is used as the fusion weight of the pixel of the second image;
- the first gain coefficient corresponding to the pixels in the second noise-reduced image is used as the fusion weight of the pixels of the first image, and the pixel values of the first image and the second image are weighted and fused to obtain the second noise-reduced image.
- FIG. 8 is a schematic diagram of a comparison before and after noise reduction processing provided by an embodiment of the present application.
- Figure 8 includes the target image before noise reduction and the target image after noise reduction. It can be seen from the figure that the noise in the target image after noise reduction is significantly reduced compared to the target image before noise reduction, that is, the embodiment of the present application provides The video noise reduction method effectively realizes the noise reduction of the target image.
- a third filter can be set.
- the third filter has the same structure as the first filter.
- the third filter, the first filter, and the second filter can be processed by calling the GPU.
- the interface processes each pixel in the target image in parallel to achieve noise reduction processing on the target image.
- FIG. 9 is a schematic diagram of a key process of a video noise reduction method provided by an embodiment of the present application.
- the figure includes input, noise reduction processing and output three parts, the input is the target image f C and the first noise reduction image
- the first filter and the third filter are represented by image noise reduction filters F1 and F2, respectively.
- the second filter is represented by the Kalman filter Fk. Parallel acceleration through GPU image processing interface.
- the terminal When the terminal performs noise reduction on the target image, it processes the target image f C through the image noise reduction filter F1 to obtain the first image Calculate the first denoised image according to the processing result And the first image Frame difference between f D and the frame difference f D f C and a target input image Fk Kalman filter, the Kalman filter output image and the second image Fk noise reduction filter output F2 is fused, Obtain the second denoising image corresponding to the target image that has been denoised
- the second denoising image may also be stored in the Kalman filter to participate in subsequent image operations.
- FIG. 10 is a schematic diagram of the algorithm flow of a video noise reduction method provided in an embodiment of the present application.
- Spatial filtering of the target image includes: Among them, the arrow indicates assignment.
- Time-domain filtering of the target image includes: Time-domain filtering of any pixel of the target image includes: Calculate the frame difference; R k ⁇ 1+R k-1 (1+K k-1 ) -1 , calculate the gain offset coefficient;
- the first noise reduction map Calculate the first gain coefficient; Calculate the pixel value after time-domain filtering of the pixel; Calculate the noise reduction pixel value; Calculate the variance to be used in the next frame of image and return
- the video noise reduction method provided by the embodiments of this application has a fourth improvement, that is, the format of the input image is set to adopt the YCbCr (YUV) format, and the image is processed for noise reduction.
- the first filter and the second filter respectively perform spatial filtering and temporal filtering on the brightness component of the target image, that is, only perform noise reduction processing on the Y channel that characterizes the brightness detail information.
- the pixels of the target image are filtered in a spatial domain that eliminates pixel dependence, so that there is no longer a dependency relationship between pixels in the target image, and the first image and the first image obtained by the spatial filtering are filtered according to the spatial domain.
- the frame difference between the noise reduction images is used to perform temporal filtering on the pixels of the target image in parallel, so that the video noise reduction process is converted from serial processing to parallel processing, and the noise reduction processing process is accelerated.
- FIG. 11 is a block diagram of a video noise reduction device provided by an embodiment of the present application.
- the device is used to perform the steps of the foregoing video noise reduction method.
- the device includes: a spatial filtering module 1101, a temporal filtering module 1102, and a fusion module 1103.
- the various modules included in the video noise reduction device may be implemented in whole or in part by software, hardware or a combination thereof.
- the spatial filtering module 1101 is used to perform spatial filtering on the pixels of the target image in the video to be processed to obtain the first image; the spatial filtering is used to eliminate the dependency between the pixels of the target image.
- the temporal filtering module 1102 is used to perform temporal filtering on the pixels of the target image in parallel according to the frame difference between the first image and the first noise-reduced image to obtain a second image, and the first noise-reduced image is the image of the target image.
- the previous image corresponds to the image that has been processed for noise reduction.
- the fusion module 1103 is used for predicting the corresponding first gain coefficient of the pixel of the second image in the second noise-reduced image according to the second gain coefficient of the pixel of the target image in the first noise-reduced image; and A gain coefficient is used to fuse the first image and the second image to obtain a second noise reduction image corresponding to the target image that has undergone noise reduction processing.
- the spatial filtering module 1101 is further configured to obtain the initial pixel value of the neighboring pixels of each pixel for all pixels of the target image in the to-be-processed video; and according to the neighboring pixels The initial pixel value of, the pixel is spatially filtered.
- the video noise reduction device further includes: an interface calling module for calling the image processing interface of the graphics processor; and a parallel acquisition module for acquiring pixels of the target image in the video to be processed in parallel through the image processing interface Points; and filtering the pixels obtained in parallel through the image processing interface.
- the temporal filtering module 1102 is also used to obtain each pixel of the target image in parallel; for any pixel of the target image, according to the first variance of the pixel in the first denoising image , The frame difference between the first image and the first denoising image and the variance offset coefficient to determine the second variance of the pixel; according to the second variance, the first gain offset coefficient corresponding to the pixel, and the motion compensation coefficient, determine The first gain coefficient corresponding to the pixel; according to the first gain coefficient, the initial pixel value of the pixel and the corresponding noise reduction pixel value of the pixel in the first noise reduction image, the first pixel value after the time domain filtering of the pixel is determined And obtain a second image according to the first pixel value after time domain filtering of each pixel of the target image.
- the video noise reduction device further includes: a first determining module, configured to determine the motion compensation coefficient according to the frame difference.
- the video noise reduction device further includes: an acquisition module for acquiring a second gain coefficient and a second gain offset coefficient corresponding to a pixel in the first noise reduction image; and a second determination module for According to the second gain coefficient and the second gain offset coefficient, the first gain offset coefficient corresponding to the pixel point is determined.
- the time-domain filtering module 1102 is also used for any pixel of the second image to calculate the difference between the first gain coefficient corresponding to the pixel and the preset value and the first pixel value of the pixel Take the product of the first gain coefficient corresponding to the pixel and the second pixel value of the pixel as the second fusion value, and the second pixel value is the pixel value of the pixel after spatial filtering; and The first fusion value and the second fusion value are summed to obtain the noise reduction pixel value corresponding to the pixel point.
- the spatial filtering and the temporal filtering respectively process the brightness components of the pixels.
- the pixels of the target image are filtered in a spatial domain that eliminates pixel dependence, so that there is no longer a dependency relationship between pixels in the target image, and the first image and the first image obtained by the spatial filtering are filtered according to the spatial domain.
- the frame difference between the noise reduction images is used to perform temporal filtering on the pixels of the target image in parallel, so that the video noise reduction process is converted from serial processing to parallel processing, and the noise reduction processing process is accelerated.
- the device provided in the above embodiment runs an application program, only the division of the above-mentioned functional modules is used as an example.
- the above-mentioned function allocation can be completed by different functional modules as required, that is, the device The internal structure is divided into different functional modules to complete all or part of the functions described above.
- the device and method embodiments provided in the above embodiments belong to the same concept, and the specific implementation process is described in the method embodiments.
- FIG. 12 is a structural block diagram of a terminal 1200 provided by an embodiment of the present application.
- the terminal 1200 can be: a smart phone, a tablet computer, an MP3 player (Moving Picture Experts Group Audio Layer III, moving picture expert compression standard audio layer 3), MP4 (Moving Picture Experts Group Audio Layer IV, moving picture expert compressing standard audio Level 4) Player, laptop or desktop computer.
- the terminal 1200 may also be called user equipment, portable terminal, laptop terminal, desktop terminal and other names.
- the terminal 1200 includes a processor 1201 and a memory 1202.
- the processor 1201 may include one or more processing cores, such as a 4-core processor, an 8-core processor, and so on.
- the processor 1201 may adopt at least one hardware form among DSP (Digital Signal Processing), FPGA (Field-Programmable Gate Array), and PLA (Programmable Logic Array, Programmable Logic Array). achieve.
- the processor 1201 may also include a main processor and a coprocessor.
- the main processor is a processor used to process data in the awake state, also called a CPU (Central Processing Unit, central processing unit); the coprocessor is A low-power processor used to process data in the standby state.
- the processor 1201 may be integrated with a GPU (Graphics Processing Unit, image processor), and the GPU is used to render and draw content that needs to be displayed on the display screen.
- the processor 1201 may further include an AI (Artificial Intelligence) processor, and the AI processor is used to process computing operations related to machine learning.
- AI Artificial Intelligence
- the memory 1202 may include one or more computer-readable storage media, which may be non-transitory.
- the memory 1202 may also include high-speed random access memory and non-volatile memory, such as one or more magnetic disk storage devices and flash memory storage devices.
- the non-transitory computer-readable storage medium in the memory 1202 is used to store at least one instruction, and the at least one instruction is used to be executed by the processor 1201 to implement the video reduction provided by the method embodiment of the present application. Noise method.
- the terminal 1200 may optionally further include: a peripheral device interface 1203 and at least one peripheral device.
- the processor 1201, the memory 1202, and the peripheral device interface 1203 may be connected by a bus or a signal line.
- Each peripheral device can be connected to the peripheral device interface 1203 through a bus, a signal line, or a circuit board.
- the peripheral device includes: at least one of a radio frequency circuit 1204, a display screen 1205, a camera component 1206, an audio circuit 1207, a positioning component 1208, and a power supply 1209.
- the peripheral device interface 1203 may be used to connect at least one peripheral device related to I/O (Input/Output) to the processor 1201 and the memory 1202.
- the processor 1201, the memory 1202, and the peripheral device interface 1203 are integrated on the same chip or circuit board; in some other embodiments, any one of the processor 1201, the memory 1202, and the peripheral device interface 1203 or The two can be implemented on a separate chip or circuit board, which is not limited in this embodiment.
- the radio frequency circuit 1204 is used for receiving and transmitting RF (Radio Frequency, radio frequency) signals, also called electromagnetic signals.
- the radio frequency circuit 1204 communicates with a communication network and other communication devices through electromagnetic signals.
- the radio frequency circuit 1204 converts electrical signals into electromagnetic signals for transmission, or converts received electromagnetic signals into electrical signals.
- the radio frequency circuit 1204 includes: an antenna system, an RF transceiver, one or more amplifiers, a tuner, an oscillator, a digital signal processor, a codec chipset, a user identity module card, and so on.
- the radio frequency circuit 1204 can communicate with other terminals through at least one wireless communication protocol.
- the wireless communication protocol includes, but is not limited to: metropolitan area networks, various generations of mobile communication networks (2G, 3G, 4G, and 5G), wireless local area networks, and/or WiFi (Wireless Fidelity, wireless fidelity) networks.
- the radio frequency circuit 1204 may also include a circuit related to NFC (Near Field Communication), which is not limited in this application.
- the display screen 1205 is used to display a UI (User Interface, user interface).
- the UI can include graphics, text, icons, videos, and any combination thereof.
- the display screen 1205 also has the ability to collect touch signals on or above the surface of the display screen 1205.
- the touch signal may be input to the processor 1201 as a control signal for processing.
- the display screen 1205 may also be used to provide virtual buttons and/or virtual keyboards, also called soft buttons and/or soft keyboards.
- the display screen 1205 may be one display screen 1205, which is provided with the front panel of the terminal 1200; in other embodiments, there may be at least two display screens 1205, which are respectively arranged on different surfaces of the terminal 1200 or in a folded design; In still other embodiments, the display screen 1205 may be a flexible display screen, which is arranged on the curved surface or the folding surface of the terminal 1200. Furthermore, the display screen 1205 can also be set as a non-rectangular irregular pattern, that is, a special-shaped screen.
- the display screen 1205 may be made of materials such as LCD (Liquid Crystal Display) and OLED (Organic Light-Emitting Diode).
- the camera assembly 1206 is used to capture images or videos.
- the camera assembly 1206 includes a front camera and a rear camera.
- the front camera is set on the front panel of the terminal, and the rear camera is set on the back of the terminal.
- the camera assembly 1206 may also include a flash.
- the flash can be a single-color flash or a dual-color flash. Dual color temperature flash refers to a combination of warm light flash and cold light flash, which can be used for light compensation under different color temperatures.
- the audio circuit 1207 may include a microphone and a speaker.
- the microphone is used to collect sound waves of the user and the environment, and convert the sound waves into electrical signals and input them to the processor 1201 for processing, or input to the radio frequency circuit 1204 to implement voice communication. For the purpose of stereo collection or noise reduction, there may be multiple microphones, which are respectively set in different parts of the terminal 1200.
- the microphone can also be an array microphone or an omnidirectional collection microphone.
- the speaker is used to convert the electrical signal from the processor 1201 or the radio frequency circuit 1204 into sound waves.
- the speaker can be a traditional thin-film speaker or a piezoelectric ceramic speaker.
- the speaker When the speaker is a piezoelectric ceramic speaker, it can not only convert the electrical signal into human audible sound waves, but also convert the electrical signal into human inaudible sound waves for distance measurement and other purposes.
- the audio circuit 1207 may also include a headphone jack.
- the positioning component 1208 is used to locate the current geographic location of the terminal 1200 to implement navigation or LBS (Location Based Service, location-based service).
- the positioning component 1208 may be a positioning component based on the GPS (Global Positioning System, Global Positioning System) of the United States, the Beidou system of China, the Granus system of Russia, or the Galileo system of the European Union.
- the power supply 1209 is used to supply power to various components in the terminal 1200.
- the power source 1209 may be alternating current, direct current, disposable batteries, or rechargeable batteries.
- the rechargeable battery may support wired charging or wireless charging.
- the rechargeable battery can also be used to support fast charging technology.
- the terminal 1200 further includes one or more sensors 1210.
- the one or more sensors 1210 include, but are not limited to: an acceleration sensor 1211, a gyroscope sensor 1212, a pressure sensor 1213, a fingerprint sensor 1214, an optical sensor 1215, and a proximity sensor 1216.
- the acceleration sensor 1211 can detect the magnitude of acceleration on the three coordinate axes of the coordinate system established by the terminal 1200.
- the acceleration sensor 1211 can be used to detect the components of gravitational acceleration on three coordinate axes.
- the processor 1201 may control the display screen 1205 to display the user interface in a horizontal view or a vertical view according to the gravity acceleration signal collected by the acceleration sensor 1211.
- the acceleration sensor 1211 may also be used for the collection of game or user motion data.
- the gyroscope sensor 1212 can detect the body direction and rotation angle of the terminal 1200, and the gyroscope sensor 1212 can cooperate with the acceleration sensor 1211 to collect the user's 3D actions on the terminal 1200. Based on the data collected by the gyroscope sensor 1212, the processor 1201 can implement the following functions: motion sensing (such as changing the UI according to the user's tilt operation), image stabilization during shooting, game control, and inertial navigation.
- the pressure sensor 1213 may be disposed on the side frame of the terminal 1200 and/or the lower layer of the display screen 1205.
- the processor 1201 performs left and right hand recognition or quick operation according to the holding signal collected by the pressure sensor 1213.
- the processor 1201 can control the operability controls on the UI interface according to the pressure operation of the user on the display screen 1205.
- the operability control includes at least one of a button control, a scroll bar control, an icon control, and a menu control.
- the fingerprint sensor 1214 is used to collect the user's fingerprint.
- the processor 1201 identifies the user's identity according to the fingerprint collected by the fingerprint sensor 1214, or the fingerprint sensor 1214 identifies the user's identity according to the collected fingerprint. When it is recognized that the user's identity is a trusted identity, the processor 1201 authorizes the user to perform related sensitive operations, including unlocking the screen, viewing encrypted information, downloading software, paying, and changing settings.
- the fingerprint sensor 1214 may be provided on the front, back or side of the terminal 1200. When a physical button or a manufacturer logo is provided on the terminal 1200, the fingerprint sensor 1214 can be integrated with the physical button or the manufacturer logo.
- the optical sensor 1215 is used to collect the ambient light intensity.
- the processor 1201 may control the display brightness of the display screen 1205 according to the ambient light intensity collected by the optical sensor 1215. Specifically, when the ambient light intensity is high, the display brightness of the display screen 1205 is increased; when the ambient light intensity is low, the display brightness of the display screen 1205 is decreased.
- the processor 1201 may also dynamically adjust the shooting parameters of the camera assembly 1206 according to the ambient light intensity collected by the optical sensor 1215.
- the proximity sensor 1216 also called a distance sensor, is usually arranged on the front panel of the terminal 1200.
- the proximity sensor 1216 is used to collect the distance between the user and the front of the terminal 1200.
- the processor 1201 controls the display screen 1205 to switch from the on-screen state to the off-screen state; when the proximity sensor 1216 detects When the distance between the user and the front surface of the terminal 1200 gradually increases, the processor 1201 controls the display screen 1205 to switch from the rest screen state to the bright screen state.
- FIG. 12 does not constitute a limitation on the terminal 1200, and may include more or less components than those shown in the figure, or combine certain components, or adopt different component arrangements.
- the embodiment of the present application also provides a computer-readable storage medium that stores computer-readable instructions, and when the computer-readable instructions are executed by a processor, the processor executes the steps of the video noise reduction method.
- the steps of the video noise reduction method may be the steps in the video noise reduction method of each of the foregoing embodiments.
- Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
- Volatile memory may include random access memory (RAM) or external cache memory.
- RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Processing (AREA)
- Picture Signal Circuits (AREA)
Abstract
Description
Claims (15)
- 一种视频降噪方法,其特征在于,由终端执行,所述方法包括:对待处理视频中的目标图像的像素点进行空域滤波,得到第一图像;所述空域滤波用于消除所述目标图像的像素点之间的依赖关系;根据所述第一图像和第一降噪图像之间的帧差,并行对所述目标图像的像素点进行时域滤波,得到第二图像,所述第一降噪图像为所述目标图像的上一帧图像对应的已经过降噪处理的图像;根据所述目标图像的像素点在所述第一降噪图像中对应的第二增益系数,预测所述第二图像的像素点在第二降噪图像中对应的第一增益系数;及根据所述第一增益系数,对所述第一图像和所述第二图像进行融合,得到所述目标图像对应的已经过降噪处理的所述第二降噪图像。
- 根据权利要求1所述的方法,其特征在于,所述对待处理视频中的目标图像的像素点进行空域滤波,包括:针对所述待处理视频中的目标图像的全部像素点,获取每个所述像素点的邻域像素点的初始像素值;及根据所述邻域像素点的初始像素值,对所述像素点进行空域滤波。
- 根据权利要求1所述的方法,其特征在于,所述对待处理视频中的目标图像的像素点进行空域滤波,包括:调用图形处理器的图像处理接口;通过所述图像处理接口并行获取待处理视频中的目标图像的像素点;及通过所述图像处理接口对并行获取的像素点进行滤波。
- 根据权利要求1所述的方法,其特征在于,所述根据所述第一图像和第一降噪图像之间的帧差,并行对所述目标图像的像素点进行时域滤波,得到第二图像,包括:并行获取所述目标图像的每个像素点;对于所述目标图像的任一像素点,根据所述像素点在所述第一降噪图像中对应的第一方差、所述第一图像和第一降噪图像之间的帧差以及方差偏置 系数,确定所述像素点的第二方差;根据所述第二方差、所述像素点对应的第一增益偏置系数以及运动补偿系数,确定所述像素点对应的第一增益系数;根据所述第一增益系数、所述像素点的初始像素值以及所述像素点在所述第一降噪图像中对应的降噪像素值,确定所述像素点时域滤波后的第一像素值;及根据所述目标图像的每个像素点时域滤波后的第一像素值,得到第二图像。
- 根据权利要求4所述的方法,其特征在于,所述根据所述第二方差、所述像素点对应的第一增益偏置系数以及运动补偿系数,确定所述像素点对应的第一增益系数之前,所述方法还包括:根据所述帧差,确定所述运动补偿系数。
- 根据权利要求4所述的方法,其特征在于,所述根据所述第二方差、所述像素点对应的第一增益偏置系数以及运动补偿系数,确定所述像素点对应的第一增益系数之前,所述方法还包括:获取所述像素点在所述第一降噪图像中对应的第二增益系数以及第二增益偏置系数;及根据所述第二增益系数和所述第二增益偏置系数,确定所述像素点对应的第一增益偏置系数。
- 根据权利要求4所述的方法,其特征在于,所述根据所述第一增益系数,对所述第一图像和所述第二图像进行融合,得到所述目标图像对应的已经过降噪处理的第二降噪图像,包括:对于所述第二图像的任一像素点,将所述像素点对应的第一增益系数与预设数值之间的差值和所述像素点的第一像素值的乘积作为第一融合值;将所述像素点对应的第一增益系数和所述像素点的第二像素值的乘积作为第二融合值,所述第二像素值为所述像素点在经过所述空域滤波后的像素值;及对所述第一融合值和所述第二融合值求和,得到所述像素点对应的降噪像素值。
- 根据权利要求1所述的方法,其特征在于,所述空域滤波和时域滤波分别对像素点的亮度分量进行处理。
- 一种视频降噪装置,其特征在于,所述装置包括:空域滤波模块,用于对待处理视频中的目标图像的像素点进行空域滤波,得到第一图像;所述空域滤波用于消除所述目标图像的像素点之间的依赖关系;时域滤波模块,用于根据所述第一图像和第一降噪图像之间的帧差,并行对所述目标图像的像素点进行时域滤波,得到第二图像,所述第一降噪图像为所述目标图像的上一帧图像对应的已经过降噪处理的图像;及融合模块,用于根据所述目标图像的像素点在所述第一降噪图像中对应的第二增益系数,预测所述第二图像的像素点在第二降噪图像中对应的第一增益系数;及根据所述第一增益系数,对所述第一图像和所述第二图像进行融合,得到所述目标图像对应的已经过降噪处理的所述第二降噪图像。
- 根据权利要求9所述的装置,其特征在于,所述空域滤波模块,还用于对于所述待处理视频中的目标图像的全部像素点,获取每个所述像素点的邻域像素点的初始像素值;及根据所述邻域像素点的初始像素值,对所述像素点进行空域滤波。
- 根据权利要求10所述的装置,其特征在于,所述装置还包括:接口调用模块,用于调用图形处理器的图像处理接口;及并行获取模块,用于通过所述图像处理接口并行获取待处理视频中的目标图像的像素点;及通过所述图像处理接口对并行获取的像素点进行滤波。
- 根据权利要求9所述的装置,其特征在于,所述时域滤波模块,还用于并行获取所述目标图像的每个像素点;对于所述目标图像的任一像素点,根据所述像素点在所述第一降噪图像中对应的第一方差、所述第一图像和第一降噪图像之间的帧差以及方差偏置系数,确定所述像素点的第二方差;根 据所述第二方差、所述像素点对应的第一增益偏置系数以及运动补偿系数,确定所述像素点对应的第一增益系数;根据所述第一增益系数、所述像素点的初始像素值以及所述像素点在所述第一降噪图像中对应的降噪像素值,确定所述像素点时域滤波后的第一像素值;及根据所述目标图像的每个像素点时域滤波后的第一像素值,得到第二图像。
- 根据权利要求9所述的装置,其特征在于,所述时域滤波模块,还用于对于所述第二图像的任一像素点,将所述像素点对应的第一增益系数与预设数值之间的差值和所述像素点的第一像素值的乘积作为第一融合值;将所述像素点对应的第一增益系数和所述像素点的第二像素值的乘积作为第二融合值,所述第二像素值为所述像素点在经过所述空域滤波后的像素值;及对所述第一融合值和所述第二融合值求和,得到所述像素点对应的降噪像素值。
- 一种终端,包括存储器和处理器,所述存储器中存储有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述处理器执行如权利要求1至8中任一项所述的方法的步骤。
- 一种存储有计算机可读指令的非易失性存储介质,所述计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行如权利要求1至8中任一项所述的方法的步骤。
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP20898101.9A EP3993396B1 (en) | 2019-12-12 | 2020-06-10 | Video denoising method, device, terminal, and storage medium |
| US17/572,604 US12094085B2 (en) | 2019-12-12 | 2022-01-10 | Video denoising method and apparatus, terminal, and storage medium |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201911288617.7A CN110933334B (zh) | 2019-12-12 | 2019-12-12 | 视频降噪方法、装置、终端及存储介质 |
| CN201911288617.7 | 2019-12-12 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/572,604 Continuation US12094085B2 (en) | 2019-12-12 | 2022-01-10 | Video denoising method and apparatus, terminal, and storage medium |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2021114592A1 true WO2021114592A1 (zh) | 2021-06-17 |
Family
ID=69863644
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2020/095359 Ceased WO2021114592A1 (zh) | 2019-12-12 | 2020-06-10 | 视频降噪方法、装置、终端及存储介质 |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US12094085B2 (zh) |
| EP (1) | EP3993396B1 (zh) |
| CN (1) | CN110933334B (zh) |
| WO (1) | WO2021114592A1 (zh) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115330628A (zh) * | 2022-08-18 | 2022-11-11 | 盐城众拓视觉创意有限公司 | 基于图像处理的视频逐帧去噪方法 |
| CN119831905A (zh) * | 2024-12-19 | 2025-04-15 | 广东汇天航空航天科技有限公司 | 图像校正方法及装置 |
Families Citing this family (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110933334B (zh) * | 2019-12-12 | 2021-08-03 | 腾讯科技(深圳)有限公司 | 视频降噪方法、装置、终端及存储介质 |
| CN112740265A (zh) * | 2020-04-28 | 2021-04-30 | 深圳市大疆创新科技有限公司 | 红外图像降噪方法、装置及设备 |
| CN113362260B (zh) * | 2021-07-21 | 2025-01-07 | Oppo广东移动通信有限公司 | 图像优化方法及装置、存储介质及电子设备 |
| AU2023327801A1 (en) * | 2022-08-26 | 2025-03-20 | Cuvos Pty Ltd | A signal processing system |
| CN117876243A (zh) * | 2022-09-30 | 2024-04-12 | 深圳市中兴微电子技术有限公司 | 视频降噪方法、电子设备及计算机可读存储介质 |
| CN116228589B (zh) * | 2023-03-22 | 2023-08-29 | 新创碳谷集团有限公司 | 一种视觉检测相机噪声点消除方法、设备及存储介质 |
| CN116777775B (zh) * | 2023-06-14 | 2025-10-10 | 杭州微影软件有限公司 | 一种针对红外图像的图像处理方法、装置及电子设备 |
| CN118365554B (zh) * | 2024-06-19 | 2024-08-23 | 深圳市超像素智能科技有限公司 | 视频降噪方法、装置、电子设备及计算机可读存储介质 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102769722A (zh) * | 2012-07-20 | 2012-11-07 | 上海富瀚微电子有限公司 | 时域与空域结合的视频降噪装置及方法 |
| CN104735300A (zh) * | 2015-03-31 | 2015-06-24 | 中国科学院自动化研究所 | 基于权重滤波的视频去噪装置及方法 |
| US20180220129A1 (en) * | 2017-01-30 | 2018-08-02 | Intel Corporation | Motion, coding, and application aware temporal and spatial filtering for video pre-processing |
| CN109410124A (zh) * | 2016-12-27 | 2019-03-01 | 深圳开阳电子股份有限公司 | 一种视频图像的降噪方法及装置 |
| CN110933334A (zh) * | 2019-12-12 | 2020-03-27 | 腾讯科技(深圳)有限公司 | 视频降噪方法、装置、终端及存储介质 |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7110455B2 (en) * | 2001-08-14 | 2006-09-19 | General Instrument Corporation | Noise reduction pre-processor for digital video using previously generated motion vectors and adaptive spatial filtering |
| KR20100036601A (ko) * | 2008-09-30 | 2010-04-08 | 엘지전자 주식회사 | 영상 잡음 제거 장치 및 방법 |
| US9451183B2 (en) * | 2009-03-02 | 2016-09-20 | Flir Systems, Inc. | Time spaced infrared image enhancement |
| US8638395B2 (en) * | 2009-06-05 | 2014-01-28 | Cisco Technology, Inc. | Consolidating prior temporally-matched frames in 3D-based video denoising |
| CN101964863B (zh) * | 2010-05-07 | 2012-10-24 | 镇江唐桥微电子有限公司 | 一种自适应的时空域视频图像降噪方法 |
| CN102497497B (zh) * | 2011-12-05 | 2013-07-31 | 四川九洲电器集团有限责任公司 | 一种图像去噪算法中阈值动态调整的方法 |
| US9497429B2 (en) * | 2013-03-15 | 2016-11-15 | Pelican Imaging Corporation | Extended color processing on pelican array cameras |
| CN103369209B (zh) * | 2013-07-31 | 2016-08-17 | 上海通途半导体科技有限公司 | 视频降噪装置及方法 |
| CN103533214B (zh) * | 2013-10-01 | 2017-03-22 | 中国人民解放军国防科学技术大学 | 一种基于卡尔曼滤波和双边滤波的视频实时去噪方法 |
| WO2016109585A1 (en) * | 2014-12-31 | 2016-07-07 | Flir Systems, Inc. | Image enhancement with fusion |
| CN107979712B (zh) | 2016-10-20 | 2020-02-21 | 上海富瀚微电子股份有限公司 | 一种视频降噪方法及装置 |
| CN108174056A (zh) | 2016-12-07 | 2018-06-15 | 南京理工大学 | 一种时空域联合的微光视频降噪方法 |
| CN109743473A (zh) | 2019-01-11 | 2019-05-10 | 珠海全志科技股份有限公司 | 视频图像3d降噪方法、计算机装置及计算机可读存储介质 |
| CN110493494B (zh) * | 2019-05-31 | 2021-02-26 | 杭州海康威视数字技术股份有限公司 | 图像融合装置及图像融合方法 |
-
2019
- 2019-12-12 CN CN201911288617.7A patent/CN110933334B/zh active Active
-
2020
- 2020-06-10 WO PCT/CN2020/095359 patent/WO2021114592A1/zh not_active Ceased
- 2020-06-10 EP EP20898101.9A patent/EP3993396B1/en active Active
-
2022
- 2022-01-10 US US17/572,604 patent/US12094085B2/en active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102769722A (zh) * | 2012-07-20 | 2012-11-07 | 上海富瀚微电子有限公司 | 时域与空域结合的视频降噪装置及方法 |
| CN104735300A (zh) * | 2015-03-31 | 2015-06-24 | 中国科学院自动化研究所 | 基于权重滤波的视频去噪装置及方法 |
| CN109410124A (zh) * | 2016-12-27 | 2019-03-01 | 深圳开阳电子股份有限公司 | 一种视频图像的降噪方法及装置 |
| US20180220129A1 (en) * | 2017-01-30 | 2018-08-02 | Intel Corporation | Motion, coding, and application aware temporal and spatial filtering for video pre-processing |
| CN110933334A (zh) * | 2019-12-12 | 2020-03-27 | 腾讯科技(深圳)有限公司 | 视频降噪方法、装置、终端及存储介质 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3993396A4 |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115330628A (zh) * | 2022-08-18 | 2022-11-11 | 盐城众拓视觉创意有限公司 | 基于图像处理的视频逐帧去噪方法 |
| CN115330628B (zh) * | 2022-08-18 | 2023-09-12 | 盐城众拓视觉创意有限公司 | 基于图像处理的视频逐帧去噪方法 |
| CN119831905A (zh) * | 2024-12-19 | 2025-04-15 | 广东汇天航空航天科技有限公司 | 图像校正方法及装置 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20220130023A1 (en) | 2022-04-28 |
| EP3993396C0 (en) | 2025-10-15 |
| CN110933334A (zh) | 2020-03-27 |
| EP3993396B1 (en) | 2025-10-15 |
| US12094085B2 (en) | 2024-09-17 |
| EP3993396A4 (en) | 2022-08-31 |
| EP3993396A1 (en) | 2022-05-04 |
| CN110933334B (zh) | 2021-08-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12094085B2 (en) | Video denoising method and apparatus, terminal, and storage medium | |
| CN110502954B (zh) | 视频分析的方法和装置 | |
| KR102635373B1 (ko) | 이미지 처리 방법 및 장치, 단말 및 컴퓨터 판독 가능 저장 매체 | |
| WO2021008456A1 (zh) | 图像处理方法、装置、电子设备及存储介质 | |
| WO2020221012A1 (zh) | 图像特征点的运动信息确定方法、任务执行方法和设备 | |
| CN111754386B (zh) | 图像区域屏蔽方法、装置、设备及存储介质 | |
| CN110839174A (zh) | 图像处理的方法、装置、计算机设备以及存储介质 | |
| CN110189348B (zh) | 头像处理方法、装置、计算机设备及存储介质 | |
| JP2022550126A (ja) | レンズの動きを制御する方法及び装置並びコンピュータ装置及びプログラム | |
| CN108320263A (zh) | 一种图像处理的方法、装置及移动终端 | |
| WO2022033272A1 (zh) | 图像处理方法以及电子设备 | |
| CN111565309A (zh) | 显示设备及其畸变参数确定方法、装置、系统及存储介质 | |
| CN110248197A (zh) | 语音增强方法及装置 | |
| CN111050211B (zh) | 视频处理方法、装置及存储介质 | |
| CN109754439B (zh) | 标定方法、装置、电子设备及介质 | |
| CN110797042B (zh) | 音频处理方法、装置及存储介质 | |
| CN111860064B (zh) | 基于视频的目标检测方法、装置、设备及存储介质 | |
| CN113407774B (zh) | 封面确定方法、装置、计算机设备及存储介质 | |
| CN107730443B (zh) | 图像处理方法、装置及用户设备 | |
| CN114897915A (zh) | 图像分割方法、装置、电子设备和存储介质 | |
| CN111757146B (zh) | 视频拼接的方法、系统及存储介质 | |
| CN113420172A (zh) | 图片分享方法、装置、计算机设备及介质 | |
| CN113129221A (zh) | 图像处理方法、装置、设备及存储介质 | |
| CN114219746B (zh) | 超广角图像获取方法、装置、存储介质及电子设备 | |
| CN110660031B (zh) | 图像锐化方法及装置、存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20898101 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 20898101.9 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2020898101 Country of ref document: EP Effective date: 20220127 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWG | Wipo information: grant in national office |
Ref document number: 2020898101 Country of ref document: EP |


