WO2015157058A1 - Contrast based image fusion - Google Patents
Contrast based image fusion Download PDFInfo
- Publication number
- WO2015157058A1 WO2015157058A1 PCT/US2015/023831 US2015023831W WO2015157058A1 WO 2015157058 A1 WO2015157058 A1 WO 2015157058A1 US 2015023831 W US2015023831 W US 2015023831W WO 2015157058 A1 WO2015157058 A1 WO 2015157058A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- images
- infrared
- visible
- pixels
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/50—Image enhancement or restoration using two or more images, e.g. averaging or subtraction
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/251—Fusion techniques of input or preprocessed data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformations in the plane of the image
- G06T3/40—Scaling of whole images or parts thereof, e.g. expanding or contracting
- G06T3/4053—Scaling of whole images or parts thereof, e.g. expanding or contracting based on super-resolution, i.e. the output image resolution being higher than the sensor resolution
- G06T3/4061—Scaling of whole images or parts thereof, e.g. expanding or contracting based on super-resolution, i.e. the output image resolution being higher than the sensor resolution by injecting details from different spectral ranges
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/70—Denoising; Smoothing
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING SYSTEMS, e.g. PERSONAL CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B13/00—Burglar, theft or intruder alarms
- G08B13/18—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
- G08B13/181—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using active radiation detection systems
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09G—ARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
- G09G5/00—Control arrangements or circuits for visual indicators common to cathode-ray tube indicators and other visual indicators
- G09G5/02—Control arrangements or circuits for visual indicators common to cathode-ray tube indicators and other visual indicators characterised by the way in which colour is displayed
- G09G5/026—Control of mixing and/or overlay of colours in general
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10048—Infrared image
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20212—Image combination
- G06T2207/20221—Image fusion; Image merging
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09G—ARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
- G09G2320/00—Control of display operating conditions
- G09G2320/02—Improving the quality of display appearance
- G09G2320/0271—Adjustment of the gradation levels within the range of the gradation scale, e.g. by redistribution or clipping
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09G—ARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
- G09G2320/00—Control of display operating conditions
- G09G2320/06—Adjustment of display parameters
- G09G2320/066—Adjustment of display parameters for control of contrast
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09G—ARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
- G09G2340/00—Aspects of display data processing
- G09G2340/10—Mixing of images, i.e. displayed pixel being the result of an operation, e.g. adding, on the corresponding input pixels
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09G—ARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
- G09G2340/00—Aspects of display data processing
- G09G2340/14—Solving problems related to the presentation of information to be displayed
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09G—ARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
- G09G2360/00—Aspects of the architecture of display systems
- G09G2360/16—Calculation or use of calculated indices related to luminance levels in display data
Definitions
- the invention relates to imaging and more particularly to contrast-based image fusion.
- LACE thermal local area contrast enhancement
- Like techniques are applied in the visible light channel.
- LACE local area contrast enhancement
- Both of these local area contrast enhancement (LACE) techniques involved histogram preprocessor functions to add contrast for improved detail.
- noise rejection functions and algorithms were used to correct for nonuniformity related to temperature changes and shifts.
- gain correction algorithms provided uniformity for each pixel, whereas row noise reduction algorithms normalized the levels of the rows.
- cluster de-noise algorithms removed flashing out of a family of pixels in low light scenarios, whereas optical distortion correction was applied between the co-registered visible light images and the IR images using translation, rotation and magnification. Finally, focal actuated vergence algorithms were utilized to correct for parallax errors.
- Embodiments of the present disclosure provide a system and method that maximizes information content in an image fusion process that blends co-registered low light level images in the visible region of the electromagnetic spectrum with thermal infrared images, said infrared and visible images constituting two different image types.
- a fusion module detects which of the two image types has a greater quantity of structural information and increases a weight of the pixels in the image type detected to have the greater quantity of structural information.
- the present disclosure can also be viewed as providing methods of maximizing information content in an image fusion process by blending co-registered low light level visible images in a visible region of the electromagnetic spectrum with thermal infrared images, said infrared and visible images constituting two different image types.
- one embodiment of such a method can be broadly summarized by the following steps: detecting, in a fusion module, which of the two image types of the visible images and the infrared images has a greater quantity of structural information; and increasing a weight of pixels in the image type detected to have the greater quantity of structural information.
- Figure 1 is a block diagram illustrating the preprocessing of visible images and infrared images, followed by a technique that emphasizes detail in the images, in accordance with a first exemplary embodiment of the present disclosure
- Figure 2 is a block diagram showing the utilization of contrast detection in the low level visible channel and the infrared channel, in which the weight of pixels in a channel having increased contrast is increased, thus to emphasize images having the better structural information, in accordance with the first exemplary embodiment of the present disclosure;
- Figure 3 is a block diagram showing the utilization of the weighting technique described in Figure 2, combined with additional processing techniques to increase and emphasize detail in both low light and strong illumination situations, in accordance with the first exemplary embodiment of the present disclosure
- Figure 4 is an enhanced digital night vision goggle system simplified functional block diagram, in accordance with the first exemplary embodiment of the present disclosure
- Figure 5 is a video processing pipeline diagram for the enhanced digital night vision goggle system, in accordance with the first exemplary embodiment of the present disclosure
- Figure 6 is an illustration of histogram segmentation, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 7 is an illustration of compression of dynamic range segments, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 8 is an illustration of expansion of dynamic range segments, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 9 is a histogram pre-processor with IR LACE example image - dark lab w/hot objections, in accordance with the first exemplary embodiment of the present disclosure
- Figure 10 is a histogram pre-processor w/IR LACE example image cold sky, in accordance with the first exemplary embodiment of the present disclosure
- Figure 11 is a Green gain function for the IR channel, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 12 is an IR scale for the Green channel, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 13 is a weighting function for the low light level channel, in accordance with the first exemplary embodiment of the present disclosure
- Figure 14 is a fusion example image - dark lab, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 15 is a fusion example image - mixed illumination, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 16 is a fusion example image - dark woods, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 17 is a fusion example image - lighted woods, in accordance with the first exemplary embodiment of the present disclosure.
- a fusion algorithm fuses thermal images and visible light images by utilizing a blending function based on the contrast, or average deviation, in each of two channels, namely, a thermal image infrared channel and a low light visible image channel, and gives more weight to the channel with the most structural information.
- a contrast detector is utilized for each of the two channels, with pixels in each of the two channels being weighted in accordance with the output of the contrast detector to increase the weight of the pixels in the channel having the most contrast, and thus most structural information.
- the images built up in the aforementioned weighting system are made available from a Green plane, which in general produces a situational awareness greenish image of the scene such as would be acquired by night vision goggles.
- the image displayed is switched from the scene mode to the target mode when the raw infrared pixel levels are above a predetermined threshold level.
- the target mode is what is displayed.
- infrared pixels change from their normal value to orange or red, whereas low light level visible pixels are shown in the green with a level equal to the low light level, divided by two.
- the fusion algorithm selects the blended visible/IR image that emphasizes images having the better structural information.
- This blended visible/IR image is based on the average deviation in the Red and Blue planes as computed globally over the entire image, or can be computed over local sub-image regions.
- color is used to emphasize a target.
- the scene mode and target mode are the two video or image presentation modes and they are selected by determining whether detected infrared exceeds a predetermined infrared threshold.
- a predetermined infrared threshold when the infrared value for pixel exceeds a user defined threshold, that pixel is considered belonging to a target and the fuse multiplexer chooses the target mode instead of the scene mode.
- the target color mode switches on and colors the particular pixel somewhere between orange and red depending on how much low light is present. These colors never exist in the scene mode.
- the targets are made to pop out.
- the visible light is brought in as Green, with the target pixels being somewhere between orange and red.
- scene mode pixels are from the visible light channel, with any little infrared being presented as Blue-Green. As the raw infrared level increases these pixels become more and more yellow.
- the target mode is operative only when the raw infrared is above the predetermined threshold. Otherwise the scene mode is used, with the test being on a pixel by pixel basis.
- limiters ensure that the color palette that is chosen is realized.
- the limiter associated with the Red plane prevents pixels from becoming red or orange, as this is reserved for the target mode.
- the limiter associated with the Blue plane reduces the color swing across varying levels of light. The minimum in the Blue plane insures that in an area of strong infrared, the visible aspect still will be represented.
- the technique is best described by comparing images fused using the standard A+B fusion method and the new local contrast-based fusion method.
- the contrast fusion method two images are fused together and displayed in the Red/Green/Blue (RGB) color planes using the following scheme. If the raw IR does not exceed the Target Threshold, the Green plane displays a weighted combination of the pixels from the thermal camera and low light level camera to increase the weight of images that have a high structural content.
- the weightings which are based on structural content are a function of the average deviation computed within each image, either globally or locally, and are designed to add more weight to the image region with the most structural content, as defined by the average deviation.
- the Red plane displays a color enhanced thermal camera pixel and the Blue plane displays a bracketed or limited version of the low light level camera pixel from the Blue plane.
- the resulting detail rich image is the result of simply adding the thermal image pixel values with the low light level pixel values in the Green plane utilizing the above contrast enhancement algorithm involving detecting average deviation in the image.
- the finally rendered image is the result of the new contrast-based fusion method which provides significantly more detail by increasing the weight of either the infrared pixels or the visible light pixels for those images having the stronger structural content as measured by average deviation.
- a system for two color image fusion blending co-registered low light level images in the visible region of the electromagnetic spectrum with thermal infrared images maximizes the information content by detecting in which of the two image types, IR and visible, there is more structural information and increasing the weight of the pixels in the image type having the most structural information.
- situational awareness is increased by categorizing image information as "scene” or “target” and colorizing the target images to highlight target features when raw IR values are above a predetermined threshold.
- the system utilizes Red, Green and Blue (RGB) planes to convey different information such that for targets the Red plane is used to colorize regions when the raw IR exceeds the predetermined threshold.
- RGB Red, Green and Blue
- the Green plane provides improved situational awareness due to the above weighted blend of the two image types.
- FIG. 1 is a block diagram illustrating the preprocessing of visible images and infrared images, followed by a technique that emphasizes detail in the images, in accordance with a first exemplary embodiment of the present disclosure.
- a fusion enhancement system for use in enhanced night vision goggles takes light from a low light visual channel 10, and thermal images from an infrared channel 12 and pre- processes them as illustrated at 14 and 16, after which the subject fusion algorithm is applied as illustrated at 18 to display at 20 an enhanced image in which detail is increased to aid in situational awareness.
- FIG. 2 is a block diagram showing the utilization of contrast detection in the low level visible channel and the infrared channel, in which the weight of pixels in a channel having increased contrast is increased, thus to emphasize images having the better structural information, in accordance with the first exemplary embodiment of the present disclosure.
- Central to the enhancement of the fusion process described in Figure 1 is the enhancement of structural detail in the finally rendered image.
- visible light 10 and infrared light 12 in two separate channels are applied to respective contrast detectors 22 and 24 which detect the contrast in the images in each of these channels.
- the contrast is detected in one embodiment utilizing standard deviation techniques, with images having increased structural information as detected by the contrast detectors resulting in increased weight shown at 26 and 28, respectively for the two channels.
- pixels are multiplied by the increased weights in the visible and infrared channels and are summed at 30, at which point they are ultimately used to drive display 20 of Figure 1, after having been coupled to a fuse multiplexer 38 of Figure 3.
- the result of so doing is to provide increased weight to those channels having increased structural information.
- the result is the highlighting or enhancement in a combined image of the infrared and visible image channels so that what is presented is an image having increased sharpness and clarity.
- FIG 3 is a block diagram showing the utilization of the weighting technique described in Figure 2, combined with additional processing techniques to increase and emphasize detail in both low light and strong illumination situations, in accordance with the first exemplary embodiment of the present disclosure.
- the weights W IR and W vis here shown at 32 and 34 generated through the contrast detection system of Figure 2, are applied to a Green plane situational awareness module 36 which generates a blended visible/IR image based on structural information.
- the weighted pixels come from infrared channel 12 and low light visible channel 10, with the light in these channels having been preprocessed as illustrated at 14 and 16 by sophisticated preprocessing techniques to be described hereinafter.
- Limiters 42 and 44 having as inputs the preprocessed raw IR from channel 12 and the preprocessed raw low light visible light from channel 10 process the infrared and visible light and couple them respectively to a Red plane 50 and a Blue plane 52.
- limiter 42 limits the Red plane pixels to the level associated with the Green plane pixels divided by two.
- Blue plane 52 the visible light is the greater of LLL/4 or LLL - IR, limited to the Green plane pixels divided by two.
- the limiters ensure that the color palette chosen is realized.
- the limiter on the red channel prevents pixels from becoming red or orange, as this is reserved for the target mode to be described hereinafter.
- the minimum for the blue channel ensures that in an area of strong IR, the visible aspect is still represented.
- Blue plane is that the corresponding scene images 58 are coupled to display 20 through fuse multiplexer 38 if the raw intensity of the raw IR is below a predetermined threshold.
- the blended visible/IR scene image 58 based on structural information is used to provide increased clarity and sharpness.
- fuse multiplexer 38 deselects the scene image 58 and selects a target image 62 which is the result of a color change operation provided by a color change module 60.
- the inputs to color change module 60 are the preprocessed infrared light from infrared channel 12 and the pre-processed visible light from low light visible channel 10.
- the scene image 58 is that which is coupled by the fuse multiplexer 38 to display 20.
- fuse multiplexer 38 selects the target image 62 to be coupled to display 20.
- the result for low IR is that the blended visible/IR image based on structural information is coupled to display 20, whereas in situations where the raw infrared intensity is greater than the threshold set by threshold detector 54, it is the color changed image which is coupled by fuse multiplexer 38 to display 20.
- the scene or target modes are determined by whether the IR exceeds the predetermined threshold.
- the RGB from both the scene and the target images are coupled to the fuse multiplexer which controls which of the two modes are used based on raw IR levels.
- the multiplexer is used to select between scene and target modes based on the predetermined threshold. It will be noted that when the raw IR value exceeds a user configured threshold, that pixel is considered a target, and the fuse multiplexer chooses the target mode instead of the scene mode. When the raw IR is above the threshold, the color switches making that pixel emphasized as being somewhere between orange and red. These colors never exist in the scene mode.
- FIG. 4 is an enhanced digital night vision goggle system simplified functional block diagram, in accordance with the first exemplary embodiment of the present disclosure.
- the present invention is used in an Enhanced digital night vision goggle system, or Enhanced Night Vision Goggle (digital) ENVG (digital) system, that is helmet mounted, battery powered, and uses a monocular Night Vision Goggle (NVG) that collects energy from the scene in multiple bands, converts this energy to electrical signals which are digitally processed, fused and presented in real time as full motion video on a display for viewing by the user.
- the enhanced digital night vision goggle is intended to provide man portable vision capability suitable to perform
- the night vision goggle system is housed in an image system housing module 70, which includes a thermal objective lens assembly 72 and visible light lens assembly 74 coupled respectively to a thermal sensor module 76 and a low light level sensor module 78.
- the outputs of modules 76 and 78 are applied to system electronics 80 that includes image fusion, power conversion, digital zoom electronics, and an LED control.
- Modules 76, 78 and 80 constitute core electronics 81 for the subject system.
- the output of system electronics 80 is coupled to a micro display 82 which in one embodiment is a 1280 x 1024 pixel display having a 24-bit RGB capability which is also gamma corrected.
- Micro display 82 is viewed by an eyepiece lens assembly 84 such that the result of the image fusion is visible by the naked eye 86.
- System electronics 80 also includes a land warrior interface 88 and user controls 90, as illustrated.
- the enhanced digital night vision goggle forms imagery from scene energy in the following bands at a minimum:
- VisNIR Visible-Near Infrared 600-900nm also referred to as Low Light Level (LLL) herein. Primarily using reflected light energy from night sky illumination or artificial sources.
- LLL Low Light Level
- TIR Thermal or Thermal Infrared
- the system provides a unity magnification, wide Field-Of View (FOV), high resolution, continuous, full motion, and video image of the scene.
- the system permits the operator to select viewing either of the two bands or the fused product of the two bands.
- the system is compatible with 820-860nm laser illuminators and pointers (e.g., AN/PEQ-2 and AN/PAQ-4).
- Figure 5 is a video processing pipeline diagram for the enhanced digital night vision goggle system, in accordance with the first exemplary embodiment of the present disclosure.
- the enhanced digital night vision goggle video processing pipeline is depicted in Figure 5, where it can be seen that there is a low light image pipeline and an IR image pipeline.
- the outputs of these outlines are coupled to a combined image focal actuated vergence parallax correction module 73.
- incoming light is corrected for fixed pattern noise and constant pixel non-uniformity. Further correction is provided by offset and gain sensor mapping module 75. Also involved is an automatic gain module 77.
- the outputs of modules 75 and 77 are coupled to a cluster de-noise module 79, with modules 75, 77 and 79, constituting a noise preprocessor 83. It is noted that the cluster de-noise module 79 removes flashing out of the family of pixels in the low light image channel.
- the output of noise preprocessor 83 is applied to LL LACE module 82 which adds an amount of contrast using histogram techniques. This concentrates on low light level local area contrast enhancement.
- the output of LL LACE module 82 is coupled to one input of module 73 used in combined image generation.
- a fine map module 84 is used to correct for non- uniformity related to temperature change or shifts.
- the output of module 84 is coupled to a gain module 86, which corrects for non-uniformity for each pixel.
- Module 86 is coupled to row noise rejection module 88, which is used for row noise reduction by normalizing the levels of the rows.
- This unit is coupled to a histogram preprocessor 90 which does a piecewise linear stretch to spread out most frequent intensity values within segments, and provides more uniform distribution of intensity across the histogram.
- the output of histogram preprocessor 90 is coupled to IR LACE module 92 which enhances IR local area contrast by pulling out detail from shadows and highlights.
- the output of IR LACE module 92 is applied to optical distortion correction module 94, in which optical distortion correction between the low light channel and the infrared channel is removed by translation, rotation and magnification.
- the output of optical distortion correction module 94 is applied to the other input of module 73.
- the combined image which is the output of module 73 is applied to a fusion module 96 so as to provide preprocessed low light image information and preprocessed IR image information.
- the NUC FPN 75 processing function includes the offset and gain sensor mapping.
- the offset map corrects for fixed pattern noise including read noise, noise associated with background and dark current.
- the offset map will also correct for constant pixel non-uniformities.
- the gain map corrects for pixel response non- uniformity as well as non-uniformity related to the lens. Through the gain operation one allows bit depth to grow to 16bits.
- the AutoGain module 77 controls the high voltage power supply on the LLL sensor.
- the major input to this control is light level which is determined through the mean of the LLL image.
- the high voltage duty cycle is then adjusted with a PID loop so that the best possible response can be achieved.
- the AutoGain module also includes sensor protection from saturation or damage.
- the Cluster De-Noise module 79 addresses flashing out of family pixels that are prevalent at very low light levels. A rank order filter is used to determine outlying pixels and they are then filtered out. This is a proven technique used in the visible camera industry to reduce noise.
- LL LACE module 82 is primarily adding an amount of contrast in the image.
- This operation operates on both a global and local level to enhance the contrast in the image. This operation is performed using a filter kernel and a global histogram stretch. The image is reduced to 8 bits during this operation.
- the fine map module 84 processing involves a thermal calibration done using the system shutter.
- the goal of the fine map is to correct for non-uniformity related to temperature change or system drift. Given the extreme sensitivity of the thermal sensor this map is required to be adjusted as the system operates.
- the Gain Map processing shown at 86 is a thermal calibration done during system build.
- the gain operation corrects for response non-uniformity for each pixel.
- the bit depth of the image goes from 14 bit to 16 bit in this step.
- the gain map also has the ability to substitute and replace unresponsive pixels.
- the RNR (Row Noise Reduction) module 88 processing is an algorithm that normalizes the level of the rows locally.
- the Histogram Preprocessor module 90 is essentially a Piecewise Linear Stretch of the histogram of the incoming image.
- the histogram of the incoming image is divided into eight (8) segments, and each segment is stretched using a linear function to spread out the more densely populated sections of the histogram over the entire segment. This essentially provides a more uniform distribution of intensity across the histogram, in preparation for contrast enhancement.
- IR LACE module 92 is primarily for adjusting the amount of contrast in the image.
- the block operates on both a global and local level to enhance the contrast, pulling out detail from the shadows and highlights. This operation is performed using a filter kernel and a global histogram stretch. The image is reduced to 8 bits during this operation.
- the optical distortion correction module 94 processing corrects for mechanical and optical differences between the LLL subsystem and the IR subsystem. Corrections include translation, rotation, magnification and distortion mismatch.
- the FAV (Focal Actuated Vergence) module 73 processing contains a technique employed to correct for parallax errors between the LLL and IR subsystems. This algorithm adjusts the vertical offset on the thermal image based on the focus point of the LLL lens. This provides proper alignment for any part of the image that is in focus.
- the Fusion algorithm in fusion module 96 provides Target Cueing (TC) and Situational Awareness (SA) under all weather and illumination conditions utilizing numerically efficient methods chosen to provide low-SWAP and low latency.
- TC Target Cueing
- SA Situational Awareness
- the Fusion algorithm utilizes metrics provided by the low light level (LLL) and thermal pipelines to adapt to dynamic scenes.
- LLL LACE utilizes several Signal to Noise Ratio (SNR) metrics to determine how much to rely on the LLL and the thermal sensors for the situation awareness channel.
- SNR Signal to Noise Ratio
- An algorithm used for the histogram pre-processing of the IR channel before local area contrast enhancement (LACE) is applied.
- the purpose of this preprocessing is both to analyze the distribution of the pixel values in the image for use in contrast enhancement and also to scale the distribution to reduce problems associated with large dynamic range distributions.
- Such conditions can be due to very hot objects or regions with large differences in average temperature, such as a warm forest in the foreground with a cold sky in the background. These conditions are known as bimodal distributions due the separation of the histogram for such images into two distinct Gaussian-like distributions, often with a large gap between them.
- Proper adjustment of the dynamic range of the image in such cases permits effective processing by subsequent algorithms to properly enhance the fine detail in the different regions and prevents over-saturating the values in bright regions and washing out darker regions.
- a piecewise histogram scaling method compresses regions of the distribution that are sparsely populated, such as the gaps between the distributions in a bimodal case, and expands regions that are densely populated. Additionally, the method is designed to restrict the expansion of dynamic range values so as to minimize the amplification of noise and creation of artifacts, which is a weakness of the standard histogram-based global contrast enhancement techniques such as plateau equalization.
- the technique also has the advantage that it does not exhibit large variations in illumination as the brightness of the scene varies and it does not create significant flickering in video sequences compared to other histogram based techniques. All the same, it retains the overall simplicity and a low burden of processing requirements commonly associated with such global methods.
- the algorithm involves four major steps in the processing:
- the histogram of the image is calculated using 4k bins over the entire 16- bit range of possible values in the image.
- This histogram actually comes from the previous frame in the hardware implementation due to the low-latency requirements of the goggle.
- the requirement for number of bins comes from the fact that one needs to accurately characterize the distribution for the scaling procedure, without overly burdening the processing requirements.
- a properly imaged region will have a dynamic range on the order of 512 values, so the quantization of the values by 16 results in roughly 32 bins over such regions, which provides sufficient resolution of the various dynamic -range regions to allow for accurate segmentation and scaling of the values.
- the distribution is segmented into K (equals 32 for the implementation) points which determine the illumination levels at discrete values of the population.
- the segmentation of the dynamic range distribution is based upon finding the discrete values in the dynamic range that correspond to certain predetermined values of the pixel population. To this end, one first creates the cumulative distribution function from the histogram in the usual way. Next, one determines the
- Equation 1 the k th segmentation value if ⁇ ' is determined as the value for which th f"
- the threshold value is defined by:
- the distribution of the threshold factors k is a configuration table for the current implementation, F 1 ' , and was and was originally chosen to be a linearly spaced set of values from 0 to 1. However, it was found that a logarithmically distributed set of values worked better.
- Figure 6 is an illustration of histogram segmentation, in accordance with the first exemplary embodiment of the present disclosure. This sort of distribution is typical for cases where a hot object such as a light is in the image, and in fact the long tail is typically several orders of magnitude larger than the distribution of the "hump" which contains most of the interesting image detail. Simple clipping of such an image will not solve the problem due to the fact that the hot object may occupy a significant proportion of the image.
- the next step in the process is to determine the scaling and offsets for each interval. This is accomplished by mapping the dynamic range segments identified by the segmentation procedure onto a "canonical grid" which represents some ideal distribution. This is actually a form of histogram specification, although the specification used was somewhat heuristic and was chosen based on observation of the distributions of a variety of well-formed IR images which did not have dynamic range issues. Another issue that influences the specification was the desire to not over enhance the values of the dynamic range at lower temperatures, which typically results in amplification of undesirable spatial noise.
- Figure 7 is an illustration of compression of dynamic range segments, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 8 is an illustration of expansion of dynamic range segments, in accordance with the first exemplary embodiment of the present disclosure.
- the most important contribution of the algorithm is the compression of large dynamical range segments that have little or no image information, which is illustrated in Figure 7.
- the interval AL c k is the difference between the levels Z and L t k _ 1 which were shown in Figure 6.
- the k th segment is larger than the canonical segment AL c k .
- the scaling of this interval based on the ratio of the 2 intervals is also determined, as well as described in more detail below. First, what is explained is how to handle the opposite case for which the original segment is smaller than the canonical segment as illustrated in Figure 8.
- ⁇ is the value of the final scaled interval, is the value of the un-scaled, original interval found by the segmentation procedure, and is the value of the "canonical" or pre-specified segment interval which is a configuration value stored in a look-up table for the hardware implementation
- a ⁇ is the final scaled pixel value, is the original un-scaled value, '' is the value of the segment point that is just less than the original pixel value, and k is the new offset for that segment.
- the scaling factor is given by the ratio of the scaled to un-scaled interval for that segment.
- the scaled offsets are simply the accumulation of the values of the interval sizes for all of the segments below that segment:
- the value of the first offset is arbitrary, and is chosen to be the value of the lower clip value of the original distribution for simplicity. Another obvious choice would be to set it equal to zero.
- Figure 9 is a histogram pre-processor with IR LACE example image - dark lab w/hot objections, in accordance with the first exemplary embodiment of the present disclosure.
- Figure 10 is a histogram pre-processor w/IR LACE example image cold sky, in accordance with the first exemplary embodiment of the present disclosure.
- the first example shows how the algorithm solves the problem of very hot objects in an image causing fade in the detail for the rest of the image, thereby masking other objects such as a person.
- the original contrast enhancement algorithm did not handle this case well as shown by the image collected from the goggle on the left of Figure 9, while the right hand side shows the new pre-processing algorithm implementation.
- the subject system produces two image modalities (visible and thermal) and has three color planes (Red, Green, and Blue) to display the information.
- the fused image color scheme was designed to provide specific information regarding the scene based on colorization. Constraints were placed on the image color scheme, including:
- the primary situation awareness information should be shades of Green or hues close to Green.
- the Green channel was designated to contain the primary situation awareness information.
- Soldiers trained in the use of photon- intensified low light cameras are used to seeing a Green image. Therefore, subject goggle should provide familiar images that are primarily Green, but enhanced with Red to indicate targets, yellow to indicate higher levels of thermal, and Blues to indicate higher levels of visible light.
- the Green channel is composed of a blend of visible and thermal imagery.
- the Red channel contains the thermal image with each pixel limited to one half of the Green pixel level, providing shades of Green to yellow.
- the Blue channel shows the greater of excess visible over the thermal or one quarter of visible with each pixel limited to one half of the Green pixel level, thus providing shades of Blue.
- the reason Blue is an excess level is to prevent the colors from all being white when thermal and visible levels are similar.
- This coloring scheme provides the following perceptual information:
- the pixels are colored Red to indicate a "hot" target.
- the pixel values are shades of Red in low levels of visible light and turn orange-yellow as the visible light level increases.
- the Green channel is mainly composed of enhanced thermal combined with a fraction of the visible light image (minimize the speckle while still showing strong lasers).
- Green is composed of a blend of the visible image mixed with a fraction of the thermal image. • The Red channel contains the thermal image and each pixel level is bounded by G/2. Yellow in the fused image indicates areas where the thermal image is stronger than the visible image.
- the Blue channel is the excess of visible over thermal and bounded by ⁇ LL/4 ... G/2>. Blue or purple in the fused image indicates areas where the visible image is much stronger than the thermal image.
- the average deviation was calculated for both the pre- LACE visible and pre-LACE thermal images. In order to calculate the average deviation for the thermal image with dimensions 640x480 and not using any divides, only 640x410 pixels were used and result was shifted by 18 (divided by 512*512):
- Exhibit 1 Matial? code for average deviation of visible Image
- Nr 1024; % Number of rows in LL image
- mLL bit shif (mLL, nBitShiftLL) ; % Mean LL RNR
- sLL bitshift (sLL, nBitShiftLL ⁇ ; % ADEV LL NUC
- Nrl 36; % Start row in IR RNR image
- mIR bitshift iiaIR, nBitShiftNR ⁇ ; % Mean IR RNR % Compute the average deviation of IR RNR
- SIR SIR ⁇ abs (IRii, j ) -mIR; ;
- SIR bit shift (sIR, nBitShiftIR) ; % ADEV IR P.NR Contrast-based fusion uses the estimates of the average deviations to blend visible and thermal images into the Green channel. The complete fusion algorithm using these estimates is shown in the table in Exhibit 3.
- the raw thermal pixels exceed a user-controlled threshold, those pixels are colored a shade of Red.
- the Red channel is set to the thermal image intensity, while the Green channel is set to half the intensity of the visible image in which targets get an orange hue in strong visible light.
- Images with thermal pixel values less than the target threshold provide situation awareness, with the primary information in the Green channel.
- the Red channel provides yellow hues where the thermal image is high, while the Blue channel provides Blues and purples in regions where visible intensities exceed thermal intensities.
- the function for the Green channel enhances the thermal a little and diminishes visible a little when visible contrast is low.
- the following gain function was used to weight the thermal image based on the contrast of the two images:
- WgLUT ⁇ ( ⁇ (64./(1:256),.3 ⁇ , ⁇ ,25);
- Figure 11 is a Green gain function for the IR channel, in accordance with the first exemplary embodiment of the present disclosure, and shows a plot of this function.
- the thermal weighting function for the Green blend is:
- WgLUT min (max (6 ./ (1:256) , .3) , 1 . 25 ) ;
- Wg ggLU (iG ⁇ ;
- Figure 12 is an IR scale for the Green channel, in accordance with the first exemplary embodiment of the present disclosure.
- the resulting IR weighting for various levels of visible contrast are shown in Figure 12.
- the visible image is gradually reduced in intensity when the light level is very low.
- the purpose of this weighting function is to reduce image speckle where there is little information, but still allow strong laser pointers or spot lights to be clearly seen.
- the weighting function for the visible image in the Green channel is a function of the mean value of the image:
- This weighting function is shown in Figure 13, which depicts a weighting function for the low light level channel, in accordance with the first exemplary embodiment of the present
- the first example in Figure 14 which is a fusion example image - dark lab, in accordance with the first exemplary embodiment of the present disclosure, is a very low light level image taken in a darkroom with the following statistics:
- the original goggle algorithm on the left has little thermal, has lots of visible speckle, and a strong light under the door.
- the new algorithm on the right provides a good blend of enhanced IR, reduces the visible speckle, but preserves the strong light under the door.
- Figure 15 is a fusion example image - mixed illumination, in accordance with the first exemplary embodiment of the present disclosure, shows the strong thermal being blended in to the image even when there is relatively strong visible.
- the bright visible lights are still evident around the porch light and the headlights, but the details in the thermal background are also brought out in shades of yellow and Green.
- the image statistics are:
- Figure 16 is a fusion example image - dark woods, in accordance with the first exemplary embodiment of the present disclosure, was taken in the woods and the visible light was artificially lowered.
- the scene is primarily thermal, but there is a strong laser light in the middle left. Contrast fusion preserves the Green channel for situation awareness and uses shades of yellow to indicate strong thermal in the image.
- the statistics for the images are:
- Figure 17 is a fusion example image - lighted woods, in accordance with the first exemplary embodiment of the present disclosure.
- the image is primarily visible, but the thermal is fused in at a lower level.
- the image statistics are:
- the new contrast-based fusion algorithm uses a blending algorithm that gives more weight to image types (thermal or low light level) that have the most structural content.
- the new algorithm maximizes the scene information content, providing more detail in low contrast areas (inside buildings, caves, or under tunnels), or in washed out areas (in the presence of strong lights, in smoke, or in fog).
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computer Hardware Design (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Data Mining & Analysis (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Image Processing (AREA)
Abstract
A system for two color image fusion blending co-registered low light level images in the visible region of the electromagnetic spectrum with thermal infrared images maximizes the information content of the scene by detecting in which of the two image types, IR and visible, there is more structural information and increasing the weight of the pixels in the image type having the most structural information. Additionally, situational awareness is increased by categorizing image information as "scene" or "target" and colorizing the target images to highlight target features when raw IR values are above a predetermined threshold. The system utilizes Red, Green and Blue (RGB) planes to convey different information such that for targets the Red plane is used to colorize regions when raw IR exceeds the predetermined threshold. For scene images, the Green plane provides improved situational awareness due to the above weighted blend of the two image types.
Description
CONTRAST BASED IMAGE FUSION
STATEMENT OF GOVERNMENT INTEREST
This invention was made with United States Government support under
Contract No. W91CRB-07-C-0098 awarded by the Department of the Army. The United States Government has certain rights in this invention.
RELATED APPLICATIONS
This Application claims rights under 35 USC § 119(e) from US Application
Serial No. 61/976,166 filed April 7, 2014, the contents of which are incorporated herein by reference.
FIELD OF INVENTION
The invention relates to imaging and more particularly to contrast-based image fusion.
BACKGROUND OF THE INVENTION
Heretofore considerable work has been done on night vision devices which collect energy from scenes in multiple bands and convert the energy to electrical signals which are digitally processed, fused and presented in real time as full motion video on a display for viewing by the user. One of these systems is a so-called two color system in which infrared images and visible light images are fused together in the final image. These prior multiband digitally processed fusion techniques are intended to increase image detail. A need still exists, however, for a way to blend co- registered low visible light level images with thermal infrared (IR) images in a way that maximizes the scene detail, especially in very low light conditions, in scenes with very bright lights, and in smoke or fog conditions.
Specifically, in the past, infrared light and visible light have been fused together in a two color image fusion process that blends co-registered low light level images. In these systems increased contrast enhancement is available through a
thermal local area contrast enhancement (LACE) algorithm, and is especially useful in low light and in well illuminated scenarios. Like techniques are applied in the visible light channel. Both of these local area contrast enhancement (LACE) techniques involved histogram preprocessor functions to add contrast for improved detail. Moreover, a number of noise rejection functions and algorithms were used to correct for nonuniformity related to temperature changes and shifts. Additionally, gain correction algorithms provided uniformity for each pixel, whereas row noise reduction algorithms normalized the levels of the rows. Further, cluster de-noise algorithms removed flashing out of a family of pixels in low light scenarios, whereas optical distortion correction was applied between the co-registered visible light images and the IR images using translation, rotation and magnification. Finally, focal actuated vergence algorithms were utilized to correct for parallax errors.
All of the above techniques were used to remove noise and other artifacts prior to being passed to a fusion algorithm to provide a co-registered fused image composed of infrared and visible light images.
However, there is a need for further improvement of the fused image to be able to emphasize structural content information in the final fused image, thus to further improve image detail.
SUMMARY OF THE INVENTION
Embodiments of the present disclosure provide a system and method that maximizes information content in an image fusion process that blends co-registered low light level images in the visible region of the electromagnetic spectrum with thermal infrared images, said infrared and visible images constituting two different image types. Briefly described, in architecture, one embodiment of the system, among others, can be implemented as follows. A fusion module detects which of the two image types has a greater quantity of structural information and increases a weight of the pixels in the image type detected to have the greater quantity of structural information.
The present disclosure can also be viewed as providing methods of maximizing information content in an image fusion process by blending co-registered
low light level visible images in a visible region of the electromagnetic spectrum with thermal infrared images, said infrared and visible images constituting two different image types. In this regard, one embodiment of such a method, among others, can be broadly summarized by the following steps: detecting, in a fusion module, which of the two image types of the visible images and the infrared images has a greater quantity of structural information; and increasing a weight of pixels in the image type detected to have the greater quantity of structural information.
Other systems, methods, features, and advantages of the present disclosure will be or become apparent to one with skill in the art upon examination of the following drawings and detailed description. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the present disclosure, and be protected by the accompanying claims.
BRIEF DESCRIPTION OF THE DRAWINGS
These and other features of the subject invention will be better understood in connection with the Detailed Description in conjunction with Drawings, of which:
Figure 1 is a block diagram illustrating the preprocessing of visible images and infrared images, followed by a technique that emphasizes detail in the images, in accordance with a first exemplary embodiment of the present disclosure;
Figure 2 is a block diagram showing the utilization of contrast detection in the low level visible channel and the infrared channel, in which the weight of pixels in a channel having increased contrast is increased, thus to emphasize images having the better structural information, in accordance with the first exemplary embodiment of the present disclosure;
Figure 3 is a block diagram showing the utilization of the weighting technique described in Figure 2, combined with additional processing techniques to increase and emphasize detail in both low light and strong illumination situations, in accordance with the first exemplary embodiment of the present disclosure;
Figure 4 is an enhanced digital night vision goggle system simplified functional block diagram, in accordance with the first exemplary embodiment of the present disclosure;
Figure 5 is a video processing pipeline diagram for the enhanced digital night vision goggle system, in accordance with the first exemplary embodiment of the present disclosure;
Figure 6 is an illustration of histogram segmentation, in accordance with the first exemplary embodiment of the present disclosure;
Figure 7 is an illustration of compression of dynamic range segments, in accordance with the first exemplary embodiment of the present disclosure;
Figure 8 is an illustration of expansion of dynamic range segments, in accordance with the first exemplary embodiment of the present disclosure;
Figure 9 is a histogram pre-processor with IR LACE example image - dark lab w/hot objections, in accordance with the first exemplary embodiment of the present disclosure;
Figure 10 is a histogram pre-processor w/IR LACE example image cold sky, in accordance with the first exemplary embodiment of the present disclosure;
Figure 11 is a Green gain function for the IR channel, in accordance with the first exemplary embodiment of the present disclosure;
Figure 12 is an IR scale for the Green channel, in accordance with the first exemplary embodiment of the present disclosure;
Figure 13 is a weighting function for the low light level channel, in accordance with the first exemplary embodiment of the present disclosure;
Figure 14 is a fusion example image - dark lab, in accordance with the first exemplary embodiment of the present disclosure;
Figure 15 is a fusion example image - mixed illumination, in accordance with the first exemplary embodiment of the present disclosure;
Figure 16 is a fusion example image - dark woods, in accordance with the first exemplary embodiment of the present disclosure; and
Figure 17 is a fusion example image - lighted woods, in accordance with the first exemplary embodiment of the present disclosure.
DETAILED DESCRIPTION
A fusion algorithm fuses thermal images and visible light images by utilizing a blending function based on the contrast, or average deviation, in each of two channels, namely, a thermal image infrared channel and a low light visible image channel, and gives more weight to the channel with the most structural information. In one embodiment, a contrast detector is utilized for each of the two channels, with pixels in each of the two channels being weighted in accordance with the output of the contrast detector to increase the weight of the pixels in the channel having the most contrast, and thus most structural information.
In one embodiment, the images built up in the aforementioned weighting system are made available from a Green plane, which in general produces a situational awareness greenish image of the scene such as would be acquired by night vision goggles.
While the above describes a scene mode, in one embodiment there is a color enhancing target mode for emphasizing targets when the terrain is highly illuminated. In order to accommodate the color enhancing target mode, the image displayed is switched from the scene mode to the target mode when the raw infrared pixel levels are above a predetermined threshold level.
Assuming for the moment, that the raw IR is below the threshold, then what is presented and displayed is the scene mode in which blended visible/IR emphasizes pixels in the channel having the more structural information.
When the raw infrared signal is above a user defined threshold, then the target mode is what is displayed. Here due to color change algorithms infrared pixels change from their normal value to orange or red, whereas low light level visible pixels are shown in the green with a level equal to the low light level, divided by two. These color changes resulted in an image that constitutes the target image, with a fuse multiplexer switching to the target mode from the scene mode when the raw infrared is above the aforementioned threshold.
The result is that when raw infrared is below a predetermined threshold, what is presented, is the contrast enhanced scene which blends visible and infrared based on structural information.
However, when the raw infrared exceeds a predetermined threshold, a fuse multiplexer displays the target image, which is the aforementioned color changed image. What happens in this case is that target image pops out due to the coloration.
According to one embodiment, when in the scene mode, the fusion algorithm selects the blended visible/IR image that emphasizes images having the better structural information. This blended visible/IR image is based on the average deviation in the Red and Blue planes as computed globally over the entire image, or can be computed over local sub-image regions.
More particularly, in addition to enhancement due to the detection of structural information in two channels, in one embodiment, color is used to emphasize a target. The scene mode and target mode are the two video or image presentation modes and they are selected by determining whether detected infrared exceeds a predetermined infrared threshold. In the illustrated embodiment, when the infrared value for pixel exceeds a user defined threshold, that pixel is considered belonging to a target and the fuse multiplexer chooses the target mode instead of the scene mode. When the raw infrared is above this predetermined threshold, the target color mode switches on and colors the particular pixel somewhere between orange and red depending on how much low light is present. These colors never exist in the scene mode. As a result, in high illumination situations when the detected raw infrared exceeds the predetermined threshold and the system switches to the target mode, the targets are made to pop out.
Note that in the target mode the visible light is brought in as Green, with the target pixels being somewhere between orange and red. On the other hand, when in the scene mode, scene mode pixels are from the visible light channel, with any little infrared being presented as Blue-Green. As the raw infrared level increases these pixels become more and more yellow.
As will be appreciated, the target mode is operative only when the raw infrared is above the predetermined threshold. Otherwise the scene mode is used, with the test being on a pixel by pixel basis.
For other types of enhancement, and referring now to the scene mode in which there is a Red plane, a Blue plane and a Green plane, limiters ensure that the
color palette that is chosen is realized. In the scene mode, the limiter associated with the Red plane prevents pixels from becoming red or orange, as this is reserved for the target mode. The limiter associated with the Blue plane reduces the color swing across varying levels of light. The minimum in the Blue plane insures that in an area of strong infrared, the visible aspect still will be represented.
Thus, in the scene mode and as to the limiters, for IR image pixels that do not exceed the thermal target threshold, their intensity is reduced to G/2. In the case of visible light pixels, when the IR pixel intensities do not exceed the thermal target threshold, their intensity is, LLL/4 or LLL-IR, whichever is larger, limited to G/2, where G is the intensity of the Green plane pixels and LLL refers to the intensity of the low light level Blue plane pixels.
The technique is best described by comparing images fused using the standard A+B fusion method and the new local contrast-based fusion method. In the contrast fusion method, two images are fused together and displayed in the Red/Green/Blue (RGB) color planes using the following scheme. If the raw IR does not exceed the Target Threshold, the Green plane displays a weighted combination of the pixels from the thermal camera and low light level camera to increase the weight of images that have a high structural content. The weightings which are based on structural content are a function of the average deviation computed within each image, either globally or locally, and are designed to add more weight to the image region with the most structural content, as defined by the average deviation.
If the raw IR exceeds a predefined threshold (Target Threshold), the Red plane displays a color enhanced thermal camera pixel and the Blue plane displays a bracketed or limited version of the low light level camera pixel from the Blue plane.
The resulting detail rich image is the result of simply adding the thermal image pixel values with the low light level pixel values in the Green plane utilizing the above contrast enhancement algorithm involving detecting average deviation in the image.
In addition to the detail enhancement associated with the Green plane, because there is a loss of detail in regions of strong light and in dark areas, generating the Green plane by fusing clipped and weighted versions of the thermal pixels in the
Red plane with clipped and weighted versions of the low light level pixels in the Blue plane, the effect of strong light or darkness is eliminated in the final rendered fused image.
Regardless of the other enhancements described above, the finally rendered image is the result of the new contrast-based fusion method which provides significantly more detail by increasing the weight of either the infrared pixels or the visible light pixels for those images having the stronger structural content as measured by average deviation.
In summary, a system for two color image fusion blending co-registered low light level images in the visible region of the electromagnetic spectrum with thermal infrared images maximizes the information content by detecting in which of the two image types, IR and visible, there is more structural information and increasing the weight of the pixels in the image type having the most structural information.
Additionally, situational awareness is increased by categorizing image information as "scene" or "target" and colorizing the target images to highlight target features when raw IR values are above a predetermined threshold. The system utilizes Red, Green and Blue (RGB) planes to convey different information such that for targets the Red plane is used to colorize regions when the raw IR exceeds the predetermined threshold. For scene images, the Green plane provides improved situational awareness due to the above weighted blend of the two image types.
Figure 1 is a block diagram illustrating the preprocessing of visible images and infrared images, followed by a technique that emphasizes detail in the images, in accordance with a first exemplary embodiment of the present disclosure. A fusion enhancement system for use in enhanced night vision goggles takes light from a low light visual channel 10, and thermal images from an infrared channel 12 and pre- processes them as illustrated at 14 and 16, after which the subject fusion algorithm is applied as illustrated at 18 to display at 20 an enhanced image in which detail is increased to aid in situational awareness.
Figure 2 is a block diagram showing the utilization of contrast detection in the low level visible channel and the infrared channel, in which the weight of pixels in a channel having increased contrast is increased, thus to emphasize images having the better structural information, in accordance with the first exemplary embodiment of
the present disclosure. Central to the enhancement of the fusion process described in Figure 1 is the enhancement of structural detail in the finally rendered image. Here visible light 10 and infrared light 12 in two separate channels are applied to respective contrast detectors 22 and 24 which detect the contrast in the images in each of these channels. The contrast is detected in one embodiment utilizing standard deviation techniques, with images having increased structural information as detected by the contrast detectors resulting in increased weight shown at 26 and 28, respectively for the two channels. In this way pixels are multiplied by the increased weights in the visible and infrared channels and are summed at 30, at which point they are ultimately used to drive display 20 of Figure 1, after having been coupled to a fuse multiplexer 38 of Figure 3.
The result of so doing is to provide increased weight to those channels having increased structural information. The result is the highlighting or enhancement in a combined image of the infrared and visible image channels so that what is presented is an image having increased sharpness and clarity.
Figure 3 is a block diagram showing the utilization of the weighting technique described in Figure 2, combined with additional processing techniques to increase and emphasize detail in both low light and strong illumination situations, in accordance with the first exemplary embodiment of the present disclosure. The weights W IR and W vis, here shown at 32 and 34 generated through the contrast detection system of Figure 2, are applied to a Green plane situational awareness module 36 which generates a blended visible/IR image based on structural information. As will be seen, the weighted pixels come from infrared channel 12 and low light visible channel 10, with the light in these channels having been preprocessed as illustrated at 14 and 16 by sophisticated preprocessing techniques to be described hereinafter.
Limiters 42 and 44, having as inputs the preprocessed raw IR from channel 12 and the preprocessed raw low light visible light from channel 10 process the infrared and visible light and couple them respectively to a Red plane 50 and a Blue plane 52. In the case of the Red plane, limiter 42 limits the Red plane pixels to the level associated with the Green plane pixels divided by two. For Blue plane 52 the visible light is the greater of LLL/4 or LLL - IR, limited to the Green plane pixels divided by two. It will be noted that the
limiters ensure that the color palette chosen is realized. The limiter on the red channel prevents pixels from becoming red or orange, as this is reserved for the target mode to be described hereinafter. The minimum for the blue channel ensures that in an area of strong IR, the visible aspect is still represented.
The result of having generated the Red plane, the Green plane and the
Blue plane is that the corresponding scene images 58 are coupled to display 20 through fuse multiplexer 38 if the raw intensity of the raw IR is below a predetermined threshold. Thus, in the case where there is low IR, the blended visible/IR scene image 58 based on structural information is used to provide increased clarity and sharpness.
As will be appreciated, what is coupled to display 20 is a fusion first and foremost of the weighted versions of the visible and infrared channels. It will be appreciated that the weighted blended visible/IR pixels in the Green plane may be used by themselves to drive display 20.
However, for situations in which the intensity of the raw infrared energy is above a predetermined threshold 54, fuse multiplexer 38 deselects the scene image 58 and selects a target image 62 which is the result of a color change operation provided by a color change module 60. It will be noted that the inputs to color change module 60 are the preprocessed infrared light from infrared channel 12 and the pre-processed visible light from low light visible channel 10.
The color change engendered by the color change module is such that the infrared color is the color red, whereas the green is LLL/2. In this case B = 0, with the target 62 defined to be the output of the color change module.
In operation, when the raw infrared light intensities are less than the threshold set by threshold detector 54, the scene image 58 is that which is coupled by the fuse multiplexer 38 to display 20. On the other hand, if the intensity of the raw infrared intensity is greater than the threshold set by threshold detector 54, then fuse multiplexer 38 selects the target image 62 to be coupled to display 20.
The result for low IR is that the blended visible/IR image based on structural information is coupled to display 20, whereas in situations where the
raw infrared intensity is greater than the threshold set by threshold detector 54, it is the color changed image which is coupled by fuse multiplexer 38 to display 20.
Thus, for low light situations, one has increased sharpness based on the weighting of the infrared or visible light depending on which channel has more structural information, whereas for highly lit scenarios, that which is presented by display 20 is a colorized version which highlights or pops up targets within the field of view of the cameras.
As described above, the scene or target modes are determined by whether the IR exceeds the predetermined threshold. The RGB from both the scene and the target images are coupled to the fuse multiplexer which controls which of the two modes are used based on raw IR levels. Thus, the multiplexer is used to select between scene and target modes based on the predetermined threshold. It will be noted that when the raw IR value exceeds a user configured threshold, that pixel is considered a target, and the fuse multiplexer chooses the target mode instead of the scene mode. When the raw IR is above the threshold, the color switches making that pixel emphasized as being somewhere between orange and red. These colors never exist in the scene mode.
In the scene mode if there is little IR which will be given a blue green tint. As IR increases, the IR pixels will become more and more yellow.
In summary, structural information is detected in each of the two channels and pixels having the better structural information are given greater weights. Secondly, if the infrared channel value is above a predetermined threshold, then color change algorithms enhance the colors to promote target awareness. Finally, limiters are provided to limit the Red plane and Blue plane components in low light situations and to prevent coloration in case the IR is below the predetermined threshold. The limiter on the red channel prevents pixels from becoming red or orange, as this is reserved for the target mode. The minimum for the blue channel ensures that in an area of strong IR, the visible aspect is still represented.
Figure 4 is an enhanced digital night vision goggle system simplified functional block diagram, in accordance with the first exemplary embodiment of the present disclosure. In one embodiment, the present invention is used in an Enhanced digital night vision goggle system, or Enhanced Night Vision Goggle (digital) ENVG (digital) system, that is helmet mounted, battery powered, and uses a monocular Night Vision Goggle (NVG) that collects energy from the scene in multiple bands, converts this energy to electrical signals which are digitally processed, fused and presented in real time as full motion video on a display for viewing by the user. The enhanced digital night vision goggle is intended to provide man portable vision capability suitable to perform
dismounted and mounted military missions in all terrains, under all light and visibility conditions.
As can be seen in Figure 4, the night vision goggle system is housed in an image system housing module 70, which includes a thermal objective lens assembly 72 and visible light lens assembly 74 coupled respectively to a thermal sensor module 76 and a low light level sensor module 78. The outputs of modules 76 and 78 are applied to system electronics 80 that includes image fusion, power conversion, digital zoom electronics, and an LED control.
Modules 76, 78 and 80 constitute core electronics 81 for the subject system. The output of system electronics 80 is coupled to a micro display 82 which in one embodiment is a 1280 x 1024 pixel display having a 24-bit RGB capability which is also gamma corrected. Micro display 82 is viewed by an eyepiece lens assembly 84 such that the result of the image fusion is visible by the naked eye 86. System electronics 80 also includes a land warrior interface 88 and user controls 90, as illustrated.
The enhanced digital night vision goggle forms imagery from scene energy in the following bands at a minimum:
• Visible-Near Infrared (VisNIR) 600-900nm also referred to as Low Light Level (LLL) herein. Primarily using reflected light energy from night sky illumination or artificial sources.
• Long Wave Infrared (LWIR) [8-12μηι] also referred to as Thermal or Thermal Infrared (TIR) herein. It primarily uses emitted infrared energy of scene objects.
The system provides a unity magnification, wide Field-Of View (FOV), high resolution, continuous, full motion, and video image of the scene. The system permits the operator to select viewing either of the two bands or the fused product of the two bands. The system is compatible with 820-860nm laser illuminators and pointers (e.g., AN/PEQ-2 and AN/PAQ-4).
Video Processing Pipeline
Figure 5 is a video processing pipeline diagram for the enhanced digital night vision goggle system, in accordance with the first exemplary embodiment of the present disclosure.
The enhanced digital night vision goggle video processing pipeline is depicted in Figure 5, where it can be seen that there is a low light image pipeline and an IR image pipeline. The outputs of these outlines are coupled to a combined image focal actuated vergence parallax correction module 73.
As to the low light image pipeline, incoming light is corrected for fixed pattern noise and constant pixel non-uniformity. Further correction is provided by offset and gain sensor mapping module 75. Also involved is an automatic gain module 77. The outputs of modules 75 and 77 are coupled to a cluster de-noise module 79, with modules 75, 77 and 79, constituting a noise preprocessor 83. It is noted that the cluster de-noise module 79 removes flashing out of the family of pixels in the low light image channel. The output of noise preprocessor 83 is applied to LL LACE module 82 which adds an amount of contrast using histogram techniques. This concentrates on low light level local area contrast enhancement. The output of LL LACE module 82 is coupled to one input of module 73 used in combined image generation.
As to the IR channel, a fine map module 84 is used to correct for non- uniformity related to temperature change or shifts. The output of module 84 is coupled to a gain module 86, which corrects for non-uniformity for each pixel.
Module 86 is coupled to row noise rejection module 88, which is used for row noise reduction by normalizing the levels of the rows. This unit is coupled to a histogram
preprocessor 90 which does a piecewise linear stretch to spread out most frequent intensity values within segments, and provides more uniform distribution of intensity across the histogram. The output of histogram preprocessor 90 is coupled to IR LACE module 92 which enhances IR local area contrast by pulling out detail from shadows and highlights. The output of IR LACE module 92 is applied to optical distortion correction module 94, in which optical distortion correction between the low light channel and the infrared channel is removed by translation, rotation and magnification. The output of optical distortion correction module 94 is applied to the other input of module 73. The combined image which is the output of module 73 is applied to a fusion module 96 so as to provide preprocessed low light image information and preprocessed IR image information.
More particularly, the functionality of the processing pipelines is now described in more detail: Low Light Pipeline: NUC FPN
The NUC FPN 75 processing function includes the offset and gain sensor mapping. The offset map corrects for fixed pattern noise including read noise, noise associated with background and dark current. The offset map will also correct for constant pixel non-uniformities. The gain map corrects for pixel response non- uniformity as well as non-uniformity related to the lens. Through the gain operation one allows bit depth to grow to 16bits.
Low Light Pipeline: NUC AutoGain
The AutoGain module 77 controls the high voltage power supply on the LLL sensor. The major input to this control is light level which is determined through the mean of the LLL image. The high voltage duty cycle is then adjusted with a PID loop so that the best possible response can be achieved. The AutoGain module also includes sensor protection from saturation or damage. Low Light Pipeline: Cluster De-Noise
The Cluster De-Noise module 79 addresses flashing out of family pixels that are prevalent at very low light levels. A rank order filter is used to determine
outlying pixels and they are then filtered out. This is a proven technique used in the visible camera industry to reduce noise.
Low Light Pipeline: LL LACE
LL LACE module 82 is primarily adding an amount of contrast in the image.
The block
operates on both a global and local level to enhance the contrast in the image. This operation is performed using a filter kernel and a global histogram stretch. The image is reduced to 8 bits during this operation.
Thermal Pipeline: Fine Map
In the IR image pipeline, the fine map module 84 processing involves a thermal calibration done using the system shutter. The goal of the fine map is to correct for non-uniformity related to temperature change or system drift. Given the extreme sensitivity of the thermal sensor this map is required to be adjusted as the system operates.
Thermal Pipeline: Gain Map
The Gain Map processing shown at 86 is a thermal calibration done during system build. The gain operation corrects for response non-uniformity for each pixel. The bit depth of the image goes from 14 bit to 16 bit in this step. The gain map also has the ability to substitute and replace unresponsive pixels.
Thermal Pipeline: RNR
The RNR (Row Noise Reduction) module 88 processing is an algorithm that normalizes the level of the rows locally.
Thermal Pipeline: Histogram Preprocessor
The Histogram Preprocessor module 90 is essentially a Piecewise Linear Stretch of the histogram of the incoming image. In this approach, the histogram of the incoming image is divided into eight (8) segments, and each segment is stretched using a linear function to spread out the more densely populated sections
of the histogram over the entire segment. This essentially provides a more uniform distribution of intensity across the histogram, in preparation for contrast enhancement. Thermal Pipeline: IR LACE
IR LACE module 92 is primarily for adjusting the amount of contrast in the image. The block operates on both a global and local level to enhance the contrast, pulling out detail from the shadows and highlights. This operation is performed using a filter kernel and a global histogram stretch. The image is reduced to 8 bits during this operation.
Thermal Pipeline: ODC+
The optical distortion correction module 94 processing corrects for mechanical and optical differences between the LLL subsystem and the IR subsystem. Corrections include translation, rotation, magnification and distortion mismatch.
Combined Pipeline: FAV
The FAV (Focal Actuated Vergence) module 73 processing contains a technique employed to correct for parallax errors between the LLL and IR subsystems. This algorithm adjusts the vertical offset on the thermal image based on the focus point of the LLL lens. This provides proper alignment for any part of the image that is in focus. Combined Pipeline: Fusion
Within the enhanced digital night vision goggle video processing pipeline, the Fusion algorithm in fusion module 96 provides Target Cueing (TC) and Situational Awareness (SA) under all weather and illumination conditions utilizing numerically efficient methods chosen to provide low-SWAP and low latency.
The Fusion algorithm utilizes metrics provided by the low light level (LLL) and thermal pipelines to adapt to dynamic scenes. The contrast detector located in LLL LACE utilizes several Signal to Noise Ratio (SNR) metrics to determine how
much to rely on the LLL and the thermal sensors for the situation awareness channel. In relatively high light conditions, almost the entire situational awareness image is mapped from the LLL sensor. As lighting conditions deteriorate, the LLL SNR decreases, the LLL sensor contribution is decreased and thermal sensor data fills the gap maintaining high situational awareness resolution and capability.
What is now discussed is the algorithm utilized for the histogram
preprocessor utilized in the IR channel.
Histogram Pre-Processor (Piecewise Dynamic Range Reduction)
An algorithm used for the histogram pre-processing of the IR channel before local area contrast enhancement (LACE) is applied. The purpose of this preprocessing is both to analyze the distribution of the pixel values in the image for use in contrast enhancement and also to scale the distribution to reduce problems associated with large dynamic range distributions. Such conditions can be due to very hot objects or regions with large differences in average temperature, such as a warm forest in the foreground with a cold sky in the background. These conditions are known as bimodal distributions due the separation of the histogram for such images into two distinct Gaussian-like distributions, often with a large gap between them. Proper adjustment of the dynamic range of the image in such cases permits effective processing by subsequent algorithms to properly enhance the fine detail in the different regions and prevents over-saturating the values in bright regions and washing out darker regions.
To this end a piecewise histogram scaling method compresses regions of the distribution that are sparsely populated, such as the gaps between the distributions in a bimodal case, and expands regions that are densely populated. Additionally, the method is designed to restrict the expansion of dynamic range values so as to minimize the amplification of noise and creation of artifacts, which is a weakness of the standard histogram-based global contrast enhancement techniques such as plateau equalization. The technique also has the advantage that it does not exhibit large variations in illumination as the brightness of the scene varies and it does not create significant flickering in video sequences compared to other histogram based
techniques. All the same, it retains the overall simplicity and a low burden of processing requirements commonly associated with such global methods.
Algorithm Details
The algorithm involves four major steps in the processing:
1. Histogram Development - First, the histogram of the image is
calculated using 4k bins over the entire 16-bit range of possible values in the image. This histogram actually comes from the previous frame in the hardware implementation due to the low-latency requirements of the goggle.
2. Segmentation of the Histogram - Next, the distribution is segmented into K points which determine the illumination levels at discrete values of the population. The distribution of these points gives an indication of the dynamic range regions which are either densely or sparsely populated, thereby indicating where one must compress the dynamic range and where one must expand the dynamic range.
3. Dynamic Range Specification - Then, the lengths of each segment are used to determine whether compression or expansion of the dynamic range is used for that interval. Based on that, the value of the length of each segment for the final image distribution is determined by a simple histogram specification procedure.
From this, the offset values and scaling coefficient for each interval can be calculated.
4. Piecewise Scaling - Finally, the values of each of the pixels in the image are adjusted by a scaling procedure which uses the offset values and scaling coefficients.
Histogram Development
First, the histogram of the image is calculated using 4k bins over the entire 16- bit range of possible values in the image. This histogram actually comes from the previous frame in the hardware implementation due to the low-latency requirements of the goggle. The requirement for number of bins comes from the fact that one needs to accurately characterize the distribution for the scaling procedure, without overly burdening the processing requirements. Generally speaking, a properly imaged region will have a dynamic range on the order of 512 values, so the quantization of the values
by 16 results in roughly 32 bins over such regions, which provides sufficient resolution of the various dynamic -range regions to allow for accurate segmentation and scaling of the values. Also, for one implementation one uses only a bin every 4^ pixel to reduce the memory requirement of the histogram buffer to an acceptable level. Segmentation of the Histogram
Next, the distribution is segmented into K (equals 32 for the implementation) points which determine the illumination levels at discrete values of the population. The segmentation of the dynamic range distribution is based upon finding the discrete values in the dynamic range that correspond to certain predetermined values of the pixel population. To this end, one first creates the cumulative distribution function from the histogram in the usual way. Next, one determines the
Equation 1 In this equation, the k th segmentation value if Λ' is determined as the value for which th f"
Equation 2
The distribution of the threshold factors k is a configuration table for the current implementation, F 1 ', and was and was originally chosen to be a linearly spaced set of values from 0 to 1. However, it was found that a logarithmically distributed set of values worked better.
An important issue is the determination of the extreme values of the image distribution to decide how to clip the minimum and maximum values to reduce the effects of outliers on the overall scaling of the illumination and brightness. This is determined by the values of the first and last thresholds, which were set to:
0.005. Fc'lp 0.98
Equation 3
These values were based on thresholds used for previous versions of our contrast enhancement algorithms that worked well for the enhanced digital night vision goggle.
The current implementation uses 32 segment points, which provides a decent balance between simplicity and effectiveness. Originally, 8 values were used which worked well enough to prove the concept but resulted in unacceptable amounts of artifacts due to the coarseness of the scaling. An illustration of this segmentation for 8 points is given in Figure 6.
Figure 6 is an illustration of histogram segmentation, in accordance with the first exemplary embodiment of the present disclosure. This sort of distribution is typical for cases where a hot object such as a light is in the image, and in fact the long tail is typically several orders of magnitude larger than the distribution of the "hump" which contains most of the interesting image detail. Simple clipping of such an image will not solve the problem due to the fact that the hot object may occupy a significant proportion of the image.
Now that one has the segmentation of the image distribution, it is necessary to decide how the distribution should be scaled. This is accomplished by first determining the values of the distribution points for the corrected image.
Dynamic Range Specification
The next step in the process is to determine the scaling and offsets for each interval. This is accomplished by mapping the dynamic range segments identified by the segmentation procedure onto a "canonical grid" which represents some ideal distribution. This is actually a form of histogram specification, although the specification used was somewhat heuristic and was chosen based on observation of the distributions of a variety of well-formed IR images which did not have dynamic range issues. Another issue that influences the specification was the desire to not over enhance the values of the dynamic range at lower temperatures, which typically results in amplification of undesirable
spatial noise.
Figure 7 is an illustration of compression of dynamic range segments, in accordance with the first exemplary embodiment of the present disclosure. Figure 8 is an illustration of expansion of dynamic range segments, in accordance with the first exemplary embodiment of the present disclosure. The most important contribution of the algorithm is the compression of large dynamical range segments that have little or no image information, which is illustrated in Figure 7. The interval ALc k is the difference between the levels Z and Lt k_1 which were shown in Figure 6. For example shown here, the kth segment is larger than the canonical segment ALc k., The scaling of this interval based on the ratio of the 2 intervals is also determined, as well as described in more detail below. First, what is explained is how to handle the opposite case for which the original segment is smaller than the canonical segment as illustrated in Figure 8.
In this case the condition would suggest expansion of the interval. However, experience with this was mostly negative, as this frequently leads to amplification of spatial noise in the image, while yielding only modest benefits in terms of contrast enhancement (in general). Thus, it was decided for this version of the algorithm to just leave the segment length alone for this case, as the improvements due to dynamic range compression was the pressing issue.
The result of this step of processing is simply the choice of the final scaled interval size for each segment. This can be summarized by the formula:
Δ¾. = Ik - = min(AZ , AL" k)
Equation 4
In this formula Λ is the value of the final scaled interval, is the value of the un-scaled, original interval found by the segmentation procedure, and is the value of the "canonical" or pre-specified segment interval which is a configuration value stored in a look-up table for the hardware implementation
Piecewise Scaling
Once one has the segments of the original distribution and values for the final dynamic range intervals, the calculation of the offsets and scaling coefficients for each segment is based on simple linear scaling. The formula for this is given by:
Equation 5
Here, A^ is the final scaled pixel value, is the original un-scaled value, '' is the value of the segment point that is just less than the original pixel value, and k is the new offset for that segment. The scaling factor is given by the ratio of the scaled to un-scaled interval for that segment.
Equation 6
The scaled offsets are simply the accumulation of the values of the interval sizes for all of the segments below that segment:
Equation 7
The value of the first offset is arbitrary, and is chosen to be the value of the lower clip value of the original distribution for simplicity. Another obvious choice would be to set it equal to zero.
Imagery Examples
Several example images comparing the previous goggle algorithm with the new algorithm are shown below. These examples show how the
method solves a couple of important problems for large dynamic range
imagery.
Figure 9 is a histogram pre-processor with IR LACE example image - dark lab w/hot objections, in accordance with the first exemplary embodiment of the present disclosure. Figure 10 is a histogram pre-processor w/IR LACE example image cold sky, in accordance with the first exemplary embodiment of the present disclosure. The first example shows how the algorithm solves the problem of very hot objects in an image causing fade in the detail for the rest of the image, thereby masking other objects such as a person. The original contrast enhancement algorithm did not handle this case well as shown by the image collected from the goggle on the left of Figure 9, while the right hand side shows the new pre-processing algorithm implementation.
Another important case is that of a warm foreground with a cold background. Here, what is shown is an image of a scene with interesting objects which is degraded due to the cold sky causing contrast reduction in the warmer foreground (left-hand side of Figure 10), and the same scene with the preprocessor active (right-hand side of Figure 10).
As illustrated in this case, many details that were completely washed out are now visible, and the person is clearly visible. Context-Based Fusion, Weighting based on structural information
Different smoothly varying blends of low light level (LLL) images and thermal (IR) images were evaluated in order to create a fused image for the goggle. The objective of the investigation was to address several user-identified deficiencies in the existing algorithm - primarily to provide more thermal in areas where the visible contrast is low. Low visible contrast primarily happens when the light level is very low (inside dense foliage, buildings, or tunnels) or when there is smoke or fog. The final blending algorithm was a function of the contrast for each image type, where contrast is defined as the standard deviation of the image. Blending gives more weight to the image type with the most structural information as measured by the standard deviation of that image.
Algorithm Approach
The subject system produces two image modalities (visible and thermal) and has three color planes (Red, Green, and Blue) to display the information. Using the theory of opponent colors, the fused image color scheme was designed to provide specific information regarding the scene based on colorization. Constraints were placed on the image color scheme, including:
• The primary situation awareness information should be shades of Green or hues close to Green.
• Red colors are reserved for "hot" thermal targets.
Based on these constraints, the Green channel was designated to contain the primary situation awareness information. Soldiers trained in the use of photon- intensified low light cameras are used to seeing a Green image. Therefore, subject goggle should provide familiar images that are primarily Green, but enhanced with Red to indicate targets, yellow to indicate higher levels of thermal, and Blues to indicate higher levels of visible light. In order to accomplish this, the Green channel is composed of a blend of visible and thermal imagery. The Red channel contains the thermal image with each pixel limited to one half of the Green pixel level, providing shades of Green to yellow. The Blue channel shows the greater of excess visible over the thermal or one quarter of visible with each pixel limited to one half of the Green pixel level, thus providing shades of Blue. The reason Blue is an excess level is to prevent the colors from all being white when thermal and visible levels are similar. This coloring scheme provides the following perceptual information:
• If thermal values exceed the IR threshold, the pixels are colored Red to indicate a "hot" target. The pixel values are shades of Red in low levels of visible light and turn orange-yellow as the visible light level increases.
• In very low light levels, the Green channel is mainly composed of enhanced thermal combined with a fraction of the visible light image (minimize the speckle while still showing strong lasers).
• As the visible light level increases, Green is composed of a blend of the visible image mixed with a fraction of the thermal image.
• The Red channel contains the thermal image and each pixel level is bounded by G/2. Yellow in the fused image indicates areas where the thermal image is stronger than the visible image.
• The Blue channel is the excess of visible over thermal and bounded by <LL/4 ... G/2>. Blue or purple in the fused image indicates areas where the visible image is much stronger than the thermal image.
Algorithm Details
The primary algorithm effort involved finding a smooth function to blend visible images with thermal in the Green channel based on their respective contrast levels. Since standard deviation is directly related to contrast, the robust average deviation of the image was calculated:
Equation 8
where N is the number of pixels, x(i) is the ith pixel level, and μ is the mean value of the image. In one embodiment, the average deviation was calculated for both the pre- LACE visible and pre-LACE thermal images. In order to calculate the average deviation for the thermal image with dimensions 640x480 and not using any divides, only 640x410 pixels were used and result was shifted by 18 (divided by 512*512):
Equation 9
A Matlab code segment to calculate the mean and average deviation for the visible images is shown in Exhibit 1.
Exhibit 1 : Matial? code for average deviation of visible Image
% Average Deviation Code for IX
Nr = 1024; % Number of rows in LL image
Kc = 1280; % Number of columns in LL image nBitShiftLL = -20; % Divide by 1024*IG24
% Compute the average value of LL NUC
mLL = 0;
for i = l:Nr % for each row
for j = 1 : Nc % For each pixel in each c ol laLL = mLL + LL{i,j);
eno
end
mLL = bit shif (mLL, nBitShiftLL) ; % Mean LL RNR
% Compute the average deviation, of LL. NUC sLL = 0;
for i = r : Nr % for each row
for j = 1 : Nc % For each pixel in each c ol sLL = sLL + abs {LL { i , j ) -mLL ) ;
end
e d
sLL = bitshift (sLL, nBitShiftLL} ; % ADEV LL NUC
A Matlab code segment to calculate the mean and average deviation for the thermal images is shown in Exhibit 2.
Exhibit 2: Matlab code for average deviation of thermal image
% Average Deviation Code for I
Nrl = 36; % Start row in IR RNR image
KP.2 = 445; % En row in IR RNR image cIR = 640; % Number of columns in IR image nEitShiftlR = -IS; % Divide by 512-512
% Compute the average value of IR RNR iTiIR = 0;
for i = NrI:Nr2 % for each row
for j = l:Hc∑R % For each pixel in each cor mIR = inI + IR (i, j ) ;
end
e d
mIR = bitshift iiaIR, nBitShiftNR} ; % Mean IR RNR % Compute the average deviation of IR RNR
SI = 0;
for i = K i :Nr2 % for each row
for j = l:HoIR. % For each pixel in each col
SIR = SIR ÷ abs (IRii, j ) -mIR; ;
end
end
SIR = bit shift (sIR, nBitShiftIR) ; % ADEV IR P.NR
Contrast-based fusion uses the estimates of the average deviations to blend visible and thermal images into the Green channel. The complete fusion algorithm using these estimates is shown in the table in Exhibit 3.
If the raw thermal pixels exceed a user-controlled threshold, those pixels are colored a shade of Red. The Red channel is set to the thermal image intensity, while the Green channel is set to half the intensity of the visible image in which targets get an orange hue in strong visible light.
Images with thermal pixel values less than the target threshold provide situation awareness, with the primary information in the Green channel. The Red channel provides yellow hues where the thermal image is high, while the Blue channel provides Blues and purples in regions where visible intensities exceed thermal intensities. The function for the Green channel enhances the thermal a little and diminishes visible a little when visible contrast is low. The following gain function was used to weight the thermal image based on the contrast of the two images:
WG - ffiin(inax(64 (sls -tsLL),.3), L25) Equation 10
The resulting values are placed in a lookup table and shifted to the left by 24 samples:
WgLUT = ϊηιη(ίΏΒχ(64./(1:256),.3},Ι,25);
Equation 11
gLUT(l:233) = WsLUT(24:256)
Equation 12
Figure 11 is a Green gain function for the IR channel, in accordance with the first exemplary embodiment of the present disclosure, and shows a plot of this function.
The thermal weighting function for the Green blend is:
Equation 13
The complete lookup table implementation of this algorithm is calculated as shown in Exhibit 4.
Exhibit 4: Matlab code to develop lookup table for IR Green Gain Function
% Compute the green gain function
iG = max ( 3LL÷sIP, 1 } % index must be at least I iG = min<iG, 128 ) ; % maximum index range is I2S
WgLUT = min (max (6 ./ (1:256) , .3) , 1 . 25 ) ;
WgLUT (1 :23'ii = WgLUT (24 : 256} ;
Wg = ggLU (iG} ;
% Compute the IP weighting function
D = roax isLL+sIP., i) ; % Denom must fo% at least i D = min (D, 1024.) ; % Force range to a maximum value Wir = Wg*sIP./D; % One mult and one div
Figure 12 is an IR scale for the Green channel, in accordance with the first exemplary embodiment of the present disclosure. The resulting IR weighting for various levels of visible contrast are shown in Figure 12. The visible image is gradually reduced in intensity when the light level is very low. The purpose of this weighting function is to reduce image speckle where there is little information, but still allow strong laser pointers or spot lights to be clearly seen.
The weighting function for the visible image in the Green channel is a function of the mean value of the image:
WLL = min(mLL/16+.5,l)
This weighting function is shown in Figure 13, which depicts a weighting function for the low light level channel, in accordance with the first exemplary embodiment of the present
disclosure.
Imagery Examples
Several example images comparing the previous goggle algorithm with the new algorithm are shown below. These examples show the corner cases of the algorithm. The first example in Figure 14, which is a fusion example image - dark lab, in accordance with the first exemplary embodiment of the present disclosure, is a very low light level image taken in a darkroom with the following statistics:
μΙΧ = 0.72, σΙΧ = 5, alR = 19
The original goggle algorithm on the left has little thermal, has lots of visible speckle, and a strong light under the door. The new algorithm on the right provides a good blend of enhanced IR, reduces the visible speckle, but preserves the strong light under the door.
The next example of Figure 15, which is a fusion example image - mixed illumination, in accordance with the first exemplary embodiment of the present disclosure, shows the strong thermal being blended in to the image even when there is relatively strong visible. In this case, the bright visible lights are still evident around the porch light and the headlights, but the details
in the thermal background are also brought out in shades of yellow and Green. The image statistics are:
μΙΧ = 65, σΙΧ =96, cJR = 254
The following example of Figure 16, which is a fusion example image - dark woods, in accordance with the first exemplary embodiment of the present disclosure, was taken in the woods and the visible light was artificially lowered. The scene is primarily thermal, but there is a strong laser light in the middle left. Contrast fusion preserves the Green channel for situation awareness and uses shades of yellow to indicate strong thermal in the image. The statistics for the images are:
μΙΧ = 0.3, σΙΧ =1, alR = 56
The same scene in much strong light is shown in Figure 17, which is a fusion example image - lighted woods, in accordance with the first exemplary embodiment of the present disclosure. The image is primarily visible, but the thermal is fused in at a lower level. The image statistics are:
μΙΧ = 175, σΙΧ =83, alR = 56
It will be appreciated that the new contrast-based fusion algorithm uses a blending algorithm that gives more weight to image types (thermal or low light level) that have the most structural content. The new algorithm maximizes the scene information content, providing more detail in low contrast areas (inside buildings, caves, or under tunnels), or in washed out areas (in the presence of strong lights, in smoke, or in fog).
While the present invention has been described in connection with the preferred embodiments of the various Figures, it is to be understood that other similar embodiments may be used or modifications and additions may be made to the described embodiment for performing the same function of the present invention without deviating there from. Therefore, the present invention should not be limited to any single embodiment, but rather construed in breadth and scope in accordance with the recitation of the appended claims.
Claims
1. A system that maximizes information content in an image fusion process that blends co- registered low light level images in the visible region of the electromagnetic spectrum with thermal infrared images, said infrared and visible images constituting two different image types, comprising:
a fusion module for detecting which of the two image types has a greater quantity of structural information and increasing a weight of the pixels in the image type detected to have the greater quantity of structural information.
2. The system of Claim 1, wherein the fusion module for detecting which of the two image types has the greater quantity of structural information includes contrast detectors for each of the two image types that detect a contrast for the two image types.
3. The system of Claim 2, wherein said contrast detectors include modules for determining an average standard deviation of pixel values for pixels in an image.
4. The system of Claim 3, wherein the average standard deviation is taken over an entire image.
5. The system of Claim 3, wherein the average standard deviation is taken over a predetermined region within an image.
6. The system of Claim 1, further comprising a camera having infrared and visible outputs therefrom, wherein said fusion module detects the structural information in said infrared and visible outputs.
7. The system of Claim 6, further comprising preprocessing modules for each of said infrared and visible outputs.
8. The system of Claim 6, further comprising a display, wherein an information content of said infrared and visible outputs is made available to said display in terms of a Red plane, a Green plane, and a Blue plane.
9. The system of Claim 8, wherein images from said Green plane are displayed at said display for situational awareness of a scene viewed by said camera.
10. The system of Claim 8, wherein images in said Red plane, said Green plane and said Blue plane are categorized as being either one of: scene images and target images.
11. The system of Claim 10, wherein scene images are displayed when raw infrared values from said camera are below a predetermined threshold.
12. The system of Claim 10, wherein target images are colorized when raw infrared values from said camera are above a predetermined threshold.
13. The system of Claim 12, wherein the color of infrared images is colorized so as to change the color from a normal color associated with the infrared image to an image which has an increased red color, wherein the increased red color emphasizes target features of a target when said raw infrared levels are above said predetermined threshold.
14. The system of Claim 13, wherein said target features are colored red to orange, thus to emphasize said target features.
15. The system of Claim 8, and further including a first limiter coupled to said infrared image for limiting the value of infrared pixels to be no greater than the value of Green plane pixels divided by two.
16. The system of Claim 8, wherein said visible image has visible image pixels and further includes a second limiter coupled to said visible image setting a value of said visible image pixels to the greater of the low light level image value divided by at least one of four and the low light level image value, minus a raw infrared value, whichever is greater, and limiting to the value of the Green plane pixels divided by two.
17. The system of Claim 1, wherein situational awareness is increased by categorizing image information as at least one of scene information and target information and colorizing the target images to highlight target features when raw IR values are above a predetermined threshold.
18. The system of claim 17, further comprising a color change module for categorizing said image information as at least one of scene information and target information based on infrared pixel value inputs.
19. The system of Claim 18, wherein said color change module changes infrared pixels to a shade of orange to red.
20. A method of maximizing information content in an image fusion process by blending co- registered low light level visible images in a visible region of the electromagnetic spectrum with thermal infrared images, said infrared and visible images constituting two different image types, the method comprising:
detecting, in a fusion module, which of the two image types of the visible images and the infrared images has a greater quantity of structural information; and
increasing a weight of pixels in the image type detected to have the greater quantity of structural information.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP15776749.2A EP3129954A4 (en) | 2014-04-07 | 2015-04-01 | Contrast based image fusion |
| US14/896,421 US9996913B2 (en) | 2014-04-07 | 2015-04-01 | Contrast based image fusion |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201461976166P | 2014-04-07 | 2014-04-07 | |
| US61/976,166 | 2014-04-07 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2015157058A1 true WO2015157058A1 (en) | 2015-10-15 |
Family
ID=54288277
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2015/023831 Ceased WO2015157058A1 (en) | 2014-04-07 | 2015-04-01 | Contrast based image fusion |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US9996913B2 (en) |
| EP (1) | EP3129954A4 (en) |
| WO (1) | WO2015157058A1 (en) |
Cited By (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105528772A (en) * | 2015-12-05 | 2016-04-27 | 中国航空工业集团公司洛阳电光设备研究所 | Image fusion method based on guidance filtering |
| CN105631922A (en) * | 2015-12-21 | 2016-06-01 | 南京理工大学 | Vega-based infrared and low-light-level video synchronization simulation method |
| CN106097250A (en) * | 2016-06-22 | 2016-11-09 | 江南大学 | A kind of based on the sparse reconstructing method of super-resolution differentiating canonical correlation |
| CN106204509A (en) * | 2016-07-07 | 2016-12-07 | 西安电子科技大学 | Based on region characteristic infrared and visible light image fusion method |
| CN106296624A (en) * | 2015-06-11 | 2017-01-04 | 联想(北京)有限公司 | A kind of image interfusion method and device |
| CN106500852A (en) * | 2016-09-28 | 2017-03-15 | 北方夜视技术股份有限公司 | Infrared and visible light image registration and the System and method for for merging |
| WO2018064894A1 (en) * | 2016-10-08 | 2018-04-12 | 杭州海康威视数字技术股份有限公司 | Method, device and system for image fusion |
| CN109102484A (en) * | 2018-08-03 | 2018-12-28 | 北京字节跳动网络技术有限公司 | Method and apparatus for handling image |
| CN109147005A (en) * | 2018-08-24 | 2019-01-04 | 电子科技大学 | It is a kind of for the adaptive colouring method of infrared image, system, storage medium, terminal |
| CN110620885A (en) * | 2019-10-18 | 2019-12-27 | 中国科学院深圳先进技术研究院 | Infrared low-light-level image fusion system and method and electronic equipment |
| CN111611905A (en) * | 2020-05-18 | 2020-09-01 | 沈阳理工大学 | A target recognition method based on visible light and infrared fusion |
| CN111724333A (en) * | 2020-06-09 | 2020-09-29 | 四川大学 | Fusion method of infrared image and visible light image based on early visual information processing |
| CN111780877A (en) * | 2020-07-06 | 2020-10-16 | 广东智芯光电科技有限公司 | A method and system for measuring the temperature of an object based on a camera |
| CN113034371A (en) * | 2021-05-27 | 2021-06-25 | 四川轻化工大学 | Infrared and visible light image fusion method based on feature embedding |
| CN113255779A (en) * | 2021-05-28 | 2021-08-13 | 中国航天科工集团第二研究院 | Multi-source perception data fusion identification method and system and computer readable storage medium |
| CN113544734A (en) * | 2019-03-07 | 2021-10-22 | 三星电子株式会社 | Electronic device and method for adjusting color of image data by using infrared sensor |
| CN113850318A (en) * | 2021-09-23 | 2021-12-28 | 武汉九天高分遥感技术有限公司 | A temperature information enhancement method based on the fusion of visible light and thermal infrared |
| WO2022060444A1 (en) * | 2020-09-21 | 2022-03-24 | Microsoft Technology Licensing, Llc | Selective colorization of thermal imaging |
| CN114897735A (en) * | 2022-05-20 | 2022-08-12 | 河南科技学院 | Local self-adaptive underwater image contrast enhancement method |
| WO2022177705A1 (en) * | 2021-02-18 | 2022-08-25 | Microsoft Technology Licensing, Llc | Texture based fusion for images with cameras having differing modalities |
| CN115587945A (en) * | 2022-10-11 | 2023-01-10 | 西安中科立德红外科技有限公司 | High dynamic infrared image detail enhancement method, system and computer storage medium |
| CN115643811A (en) * | 2020-12-31 | 2023-01-24 | 华为技术有限公司 | Image processing method, data acquisition method and device |
| CN118154443A (en) * | 2024-05-09 | 2024-06-07 | 江苏北方湖光光电有限公司 | A method for improving the fusion viewing distance of fusion night vision device in real time |
Families Citing this family (60)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103986874A (en) * | 2014-05-29 | 2014-08-13 | 宇龙计算机通信科技(深圳)有限公司 | An image acquisition device, image acquisition method and terminal |
| DE102014115292A1 (en) * | 2014-10-21 | 2016-04-21 | Connaught Electronics Ltd. | Method for providing image files from a camera system, camera system and motor vehicle |
| WO2016205419A1 (en) * | 2015-06-15 | 2016-12-22 | Flir Systems Ab | Contrast-enhanced combined image generation systems and methods |
| US10365487B1 (en) * | 2015-06-23 | 2019-07-30 | Rockwell Collins, Inc. | Night sky spatial orientation using color and surface fusion |
| US10853926B2 (en) * | 2016-03-29 | 2020-12-01 | Sony Corporation | Image processing device, imaging device, and image processing method |
| US10200632B2 (en) | 2016-08-01 | 2019-02-05 | Microsoft Technology Licensing, Llc | Low-illumination photo capture with reduced noise and blur |
| US10701244B2 (en) * | 2016-09-30 | 2020-06-30 | Microsoft Technology Licensing, Llc | Recolorization of infrared image streams |
| CN107976257B (en) * | 2016-10-21 | 2019-03-22 | 杭州海康威视数字技术股份有限公司 | Image display method, device and infrared thermal imager of an infrared thermal imager |
| CN106791318B (en) * | 2016-12-30 | 2019-06-25 | 南京大学 | A portable hyperspectral video real-time acquisition and processing device and method thereof |
| EP3410688B1 (en) * | 2017-06-01 | 2019-03-06 | Axis AB | Method for improving image quality in images acquired by a near-infrared sensitive video camera and such a camera |
| CN107369145B (en) * | 2017-06-16 | 2020-07-10 | Oppo广东移动通信有限公司 | Image processing method, device and terminal device |
| CN107230199A (en) * | 2017-06-23 | 2017-10-03 | 歌尔科技有限公司 | Image processing method, device and augmented reality equipment |
| CN115937776A (en) * | 2017-09-15 | 2023-04-07 | 杭州海康威视数字技术股份有限公司 | Monitoring method, device, system, electronic device and computer-readable storage medium |
| CN107909562B (en) * | 2017-12-05 | 2021-06-08 | 华中光电技术研究所(中国船舶重工集团公司第七一七研究所) | Fast image fusion algorithm based on pixel level |
| CN108254077B (en) * | 2018-01-02 | 2020-04-28 | 国网上海市电力公司 | GIS thermal fault diagnosis method based on local and global feature information fusion |
| CN110060210B (en) * | 2018-01-19 | 2021-05-25 | 腾讯科技(深圳)有限公司 | Image processing method and related device |
| AU2019248023B2 (en) * | 2018-04-05 | 2023-03-16 | Efficiency Matrix Pty Ltd | Computer implemented structural thermal audit systems and methods |
| CN110363731B (en) * | 2018-04-10 | 2021-09-03 | 杭州海康微影传感科技有限公司 | Image fusion method and device and electronic equipment |
| CN110363732B (en) * | 2018-04-11 | 2026-03-17 | 杭州海康威视数字技术股份有限公司 | An image fusion method and apparatus |
| CN108765358A (en) * | 2018-05-22 | 2018-11-06 | 烟台艾睿光电科技有限公司 | The double light fusion methods and plug-in type thermal imager system of visible light and infrared light |
| US10440293B1 (en) * | 2018-06-22 | 2019-10-08 | Jurgen R. Ihns | Avionics imaging control method and apparatus |
| US10623709B2 (en) * | 2018-08-31 | 2020-04-14 | Disney Enterprises, Inc. | Video color propagation |
| CN109377468A (en) * | 2018-10-09 | 2019-02-22 | 湖南源信光电科技股份有限公司 | The pseudo-colours fusion method of infra-red radiation and polarization image based on multiple features |
| CN111247558A (en) * | 2018-12-04 | 2020-06-05 | 深圳市大疆创新科技有限公司 | Image processing method, device, unmanned aerial vehicle, system and storage medium |
| CN109618099B (en) * | 2019-01-10 | 2021-01-08 | 深圳英飞拓科技股份有限公司 | Double-spectrum camera image fusion method and device |
| US11216920B2 (en) * | 2019-05-31 | 2022-01-04 | Apple Inc. | Enhanced local contrast |
| CN112241668B (en) * | 2019-07-18 | 2024-06-28 | 杭州海康威视数字技术股份有限公司 | Image processing method, device and equipment |
| US11017491B2 (en) * | 2019-08-22 | 2021-05-25 | The Boeing Company | Nonconformance detection system |
| US11270448B2 (en) * | 2019-11-26 | 2022-03-08 | Microsoft Technology Licensing, Llc | Using machine learning to selectively overlay image content |
| US10951869B1 (en) * | 2019-12-11 | 2021-03-16 | Karl Storz Imaging, Inc. | System for optimizing blended video streams |
| CN111161356B (en) * | 2019-12-17 | 2022-02-15 | 大连理工大学 | Infrared and visible light fusion method based on double-layer optimization |
| CN114830172A (en) * | 2019-12-18 | 2022-07-29 | 化学影像公司 | System and method for a combined imaging modality for improved tissue detection |
| CN111222511B (en) * | 2020-04-13 | 2020-07-24 | 中山大学 | A kind of infrared unmanned aerial vehicle target detection method and system |
| CN113160106B (en) * | 2020-04-29 | 2024-05-07 | 西安天和防务技术股份有限公司 | Infrared target detection method and device, electronic equipment and storage medium |
| US20210400210A1 (en) * | 2020-06-23 | 2021-12-23 | Immervision, Inc. | Infrared wide-angle camera |
| CN114119378B (en) * | 2020-08-31 | 2025-03-25 | 华为技术有限公司 | Image fusion method, image fusion model training method and device |
| CN112529987B (en) * | 2020-09-14 | 2023-05-26 | 武汉高德智感科技有限公司 | Method and system for fusing infrared image and visible light image of mobile phone terminal |
| US11877088B2 (en) | 2020-12-17 | 2024-01-16 | Microsoft Technology Licensing, Llc | Contrast enhanced images composited with artificial colorization |
| US11528435B2 (en) | 2020-12-25 | 2022-12-13 | Industrial Technology Research Institute | Image dehazing method and image dehazing apparatus using the same |
| JP7681405B2 (en) * | 2021-01-29 | 2025-05-22 | 株式会社小松製作所 | Display system and display method |
| CN112907497B (en) * | 2021-03-19 | 2022-08-16 | 苏州科达科技股份有限公司 | Image fusion method and image fusion device |
| CN112950519B (en) * | 2021-03-19 | 2022-03-15 | 中国科学院长春光学精密机械与物理研究所 | A Novel Infrared and Visible Image Fusion Algorithm |
| US12079969B2 (en) * | 2021-04-14 | 2024-09-03 | Microsoft Technology Licensing, Llc | Colorization to show contribution of different camera modalities |
| US11810278B2 (en) * | 2021-05-03 | 2023-11-07 | Microsoft Technology Licensing, Llc | Low light and thermal image normalization for advanced fusion |
| US11851215B2 (en) * | 2021-05-27 | 2023-12-26 | Honeywell International Inc. | Systems and methods for calibrating a synthetic image on an avionic display |
| JP2023004185A (en) * | 2021-06-25 | 2023-01-17 | 株式会社Jvcケンウッド | Image processing apparatus, image processing method, and program |
| CN113487530B (en) * | 2021-08-02 | 2023-06-16 | 广东工业大学 | Infrared and visible light fusion imaging method based on deep learning |
| US12412249B2 (en) * | 2021-12-17 | 2025-09-09 | Canon Kabushiki Kaisha | Image processing apparatus, image processing method, and image capture apparatus |
| CN114898189B (en) * | 2022-03-31 | 2025-03-21 | 西安电子科技大学 | Infrared and visible light fusion recognition system and method based on modal difference feature guidance |
| CN116993593B (en) * | 2022-04-19 | 2026-01-06 | 中国石油化工股份有限公司 | Image processing methods, apparatus, electronic devices and storage media |
| CN115690551B (en) * | 2022-11-16 | 2026-02-24 | 国网智能科技股份有限公司 | Dual-light image matching fusion method and system |
| CN115719313A (en) * | 2022-11-21 | 2023-02-28 | 上海微觅医疗器械有限公司 | Image defogging method, storage medium and system |
| US12555192B2 (en) | 2023-01-30 | 2026-02-17 | Bae Systems Information And Electronic Systems Integration Inc. | ROI weighted plateau equalization |
| CN116416136B (en) * | 2023-04-17 | 2023-12-19 | 北京卫星信息工程研究所 | Data amplification method for ship target detection of visible light remote sensing image and electronic equipment |
| CN116704048B (en) * | 2023-08-09 | 2023-11-17 | 四川元祉智慧科技有限公司 | Double-light registration method |
| CN117975040B (en) * | 2024-03-28 | 2024-06-18 | 南昌工程学院 | GIS infrared image recognition system and method based on improved YOLOv5 |
| CN118015474B (en) * | 2024-04-08 | 2024-06-21 | 北京理工大学 | Intelligent data weight method for high-bit wide space reconnaissance image information analysis |
| US12560817B2 (en) | 2024-05-13 | 2026-02-24 | Rivet Industries, Inc. | Color imagery in extremely low light conditions for a head mounted display |
| WO2025240384A1 (en) * | 2024-05-13 | 2025-11-20 | Rivet Industries, Inc. | Improved color imagery in extremely low light conditions for a head mounted display |
| CN120869093B (en) * | 2025-08-22 | 2026-03-20 | 九创智航(广西)科技有限公司 | AI-based UAV electro-optical tracking and aiming system and its target tracking method |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20050103194A (en) * | 2003-02-06 | 2005-10-27 | 바이에리셰 모토렌 베르케 악티엔게젤샤프트 | Method and device for visualizing a motor vehicle environment with environment-dependent fusion of an infrared image and a visual image |
| US20090051760A1 (en) * | 2005-07-01 | 2009-02-26 | Ottney Joseph C | Fusion night vision system |
| US20120113266A1 (en) * | 2009-04-07 | 2012-05-10 | Nextvision Stabilized Systems Ltd | Methods of manufacturing a camera system having multiple image sensors |
| US20130057698A1 (en) * | 2011-03-09 | 2013-03-07 | Bae Systems Information And Electronic Systems Integration Inc. | System and method for situational awareness and target cueing |
| EP2690582A1 (en) * | 2012-07-27 | 2014-01-29 | Honda Research Institute Europe GmbH | System for controlling an automated device |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7620265B1 (en) | 2004-04-12 | 2009-11-17 | Equinox Corporation | Color invariant image fusion of visible and thermal infrared video |
| US7613360B2 (en) * | 2006-02-01 | 2009-11-03 | Honeywell International Inc | Multi-spectral fusion for video surveillance |
| US8749635B2 (en) * | 2009-06-03 | 2014-06-10 | Flir Systems, Inc. | Infrared camera systems and methods for dual sensor applications |
| US20120098971A1 (en) * | 2010-10-22 | 2012-04-26 | Flir Systems, Inc. | Infrared binocular system with dual diopter adjustment |
| KR101990367B1 (en) * | 2014-05-08 | 2019-06-18 | 한화테크윈 주식회사 | Method of image fusion |
-
2015
- 2015-04-01 US US14/896,421 patent/US9996913B2/en active Active
- 2015-04-01 EP EP15776749.2A patent/EP3129954A4/en not_active Withdrawn
- 2015-04-01 WO PCT/US2015/023831 patent/WO2015157058A1/en not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20050103194A (en) * | 2003-02-06 | 2005-10-27 | 바이에리셰 모토렌 베르케 악티엔게젤샤프트 | Method and device for visualizing a motor vehicle environment with environment-dependent fusion of an infrared image and a visual image |
| US20090051760A1 (en) * | 2005-07-01 | 2009-02-26 | Ottney Joseph C | Fusion night vision system |
| US20120113266A1 (en) * | 2009-04-07 | 2012-05-10 | Nextvision Stabilized Systems Ltd | Methods of manufacturing a camera system having multiple image sensors |
| US20130057698A1 (en) * | 2011-03-09 | 2013-03-07 | Bae Systems Information And Electronic Systems Integration Inc. | System and method for situational awareness and target cueing |
| EP2690582A1 (en) * | 2012-07-27 | 2014-01-29 | Honda Research Institute Europe GmbH | System for controlling an automated device |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3129954A4 * |
Cited By (39)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106296624A (en) * | 2015-06-11 | 2017-01-04 | 联想(北京)有限公司 | A kind of image interfusion method and device |
| CN106296624B (en) * | 2015-06-11 | 2020-05-26 | 联想(北京)有限公司 | Image fusion method and device |
| CN105528772B (en) * | 2015-12-05 | 2018-10-16 | 中国航空工业集团公司洛阳电光设备研究所 | A kind of image interfusion method based on directiveness filtering |
| CN105528772A (en) * | 2015-12-05 | 2016-04-27 | 中国航空工业集团公司洛阳电光设备研究所 | Image fusion method based on guidance filtering |
| CN105631922A (en) * | 2015-12-21 | 2016-06-01 | 南京理工大学 | Vega-based infrared and low-light-level video synchronization simulation method |
| CN106097250B (en) * | 2016-06-22 | 2019-04-16 | 江南大学 | A kind of sparse reconstructing method of super-resolution based on identification canonical correlation |
| CN106097250A (en) * | 2016-06-22 | 2016-11-09 | 江南大学 | A kind of based on the sparse reconstructing method of super-resolution differentiating canonical correlation |
| CN106204509B (en) * | 2016-07-07 | 2019-12-17 | 西安电子科技大学 | Infrared and visible light image fusion method based on regional characteristics |
| CN106204509A (en) * | 2016-07-07 | 2016-12-07 | 西安电子科技大学 | Based on region characteristic infrared and visible light image fusion method |
| CN106500852A (en) * | 2016-09-28 | 2017-03-15 | 北方夜视技术股份有限公司 | Infrared and visible light image registration and the System and method for for merging |
| CN107918929B (en) * | 2016-10-08 | 2019-06-21 | 杭州海康威视数字技术股份有限公司 | An image fusion method, device and system |
| US10977781B2 (en) | 2016-10-08 | 2021-04-13 | Hangzhou Hikvision Digital Technology Co., Ltd. | Method, device and system for image fusion |
| EP3525130A4 (en) * | 2016-10-08 | 2019-09-11 | Hangzhou Hikvision Digital Technology Co., Ltd. | METHOD, DEVICE, AND IMAGE FUSION SYSTEM |
| CN107918929A (en) * | 2016-10-08 | 2018-04-17 | 杭州海康威视数字技术股份有限公司 | A kind of image interfusion method, apparatus and system |
| WO2018064894A1 (en) * | 2016-10-08 | 2018-04-12 | 杭州海康威视数字技术股份有限公司 | Method, device and system for image fusion |
| CN109102484A (en) * | 2018-08-03 | 2018-12-28 | 北京字节跳动网络技术有限公司 | Method and apparatus for handling image |
| CN109147005A (en) * | 2018-08-24 | 2019-01-04 | 电子科技大学 | It is a kind of for the adaptive colouring method of infrared image, system, storage medium, terminal |
| CN109147005B (en) * | 2018-08-24 | 2023-02-28 | 电子科技大学 | Self-adaptive dyeing method and system for infrared image, storage medium and terminal |
| CN113544734A (en) * | 2019-03-07 | 2021-10-22 | 三星电子株式会社 | Electronic device and method for adjusting color of image data by using infrared sensor |
| CN110620885A (en) * | 2019-10-18 | 2019-12-27 | 中国科学院深圳先进技术研究院 | Infrared low-light-level image fusion system and method and electronic equipment |
| CN110620885B (en) * | 2019-10-18 | 2022-04-26 | 中国科学院深圳先进技术研究院 | Infrared low-light-level image fusion system and method and electronic equipment |
| CN111611905A (en) * | 2020-05-18 | 2020-09-01 | 沈阳理工大学 | A target recognition method based on visible light and infrared fusion |
| CN111611905B (en) * | 2020-05-18 | 2023-04-18 | 沈阳理工大学 | Visible light and infrared fused target identification method |
| CN111724333A (en) * | 2020-06-09 | 2020-09-29 | 四川大学 | Fusion method of infrared image and visible light image based on early visual information processing |
| CN111724333B (en) * | 2020-06-09 | 2023-05-30 | 四川大学 | Infrared image and visible light image fusion method based on early visual information processing |
| CN111780877A (en) * | 2020-07-06 | 2020-10-16 | 广东智芯光电科技有限公司 | A method and system for measuring the temperature of an object based on a camera |
| US11375166B2 (en) | 2020-09-21 | 2022-06-28 | Microsoft Technology Licensing, Llc | Selective colorization of thermal imaging |
| WO2022060444A1 (en) * | 2020-09-21 | 2022-03-24 | Microsoft Technology Licensing, Llc | Selective colorization of thermal imaging |
| CN115643811A (en) * | 2020-12-31 | 2023-01-24 | 华为技术有限公司 | Image processing method, data acquisition method and device |
| WO2022177705A1 (en) * | 2021-02-18 | 2022-08-25 | Microsoft Technology Licensing, Llc | Texture based fusion for images with cameras having differing modalities |
| US12056879B2 (en) | 2021-02-18 | 2024-08-06 | Microsoft Technology Licensing, Llc | Texture based fusion for images with cameras having differing modalities |
| CN113034371A (en) * | 2021-05-27 | 2021-06-25 | 四川轻化工大学 | Infrared and visible light image fusion method based on feature embedding |
| CN113255779A (en) * | 2021-05-28 | 2021-08-13 | 中国航天科工集团第二研究院 | Multi-source perception data fusion identification method and system and computer readable storage medium |
| CN113255779B (en) * | 2021-05-28 | 2023-08-18 | 中国航天科工集团第二研究院 | Multi-source perception data fusion identification method, system and computer readable storage medium |
| CN113850318A (en) * | 2021-09-23 | 2021-12-28 | 武汉九天高分遥感技术有限公司 | A temperature information enhancement method based on the fusion of visible light and thermal infrared |
| CN114897735A (en) * | 2022-05-20 | 2022-08-12 | 河南科技学院 | Local self-adaptive underwater image contrast enhancement method |
| CN114897735B (en) * | 2022-05-20 | 2023-08-11 | 河南科技学院 | Locally adaptive underwater image contrast enhancement method |
| CN115587945A (en) * | 2022-10-11 | 2023-01-10 | 西安中科立德红外科技有限公司 | High dynamic infrared image detail enhancement method, system and computer storage medium |
| CN118154443A (en) * | 2024-05-09 | 2024-06-07 | 江苏北方湖光光电有限公司 | A method for improving the fusion viewing distance of fusion night vision device in real time |
Also Published As
| Publication number | Publication date |
|---|---|
| EP3129954A1 (en) | 2017-02-15 |
| US20160093034A1 (en) | 2016-03-31 |
| US9996913B2 (en) | 2018-06-12 |
| EP3129954A4 (en) | 2017-10-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9996913B2 (en) | Contrast based image fusion | |
| US8334924B2 (en) | Exposure control for an imaging system | |
| CN110022469B (en) | Image processing method, device, storage medium and electronic device | |
| US9852499B2 (en) | Automatic selection of optimum algorithms for high dynamic range image processing based on scene classification | |
| JP5178170B2 (en) | White balance adjusting device and white balance adjusting method | |
| TWI459805B (en) | Image processing device, image capturing device, image processing method and image processing program | |
| JP6351903B1 (en) | Image processing apparatus, image processing method, and photographing apparatus | |
| US20070047803A1 (en) | Image processing device with automatic white balance | |
| US7081920B2 (en) | Illumination light color estimating method of obtaining white balance and image sensing apparatus | |
| KR102830051B1 (en) | Method for image processing of high dynamic range sensors for handling white balance | |
| WO2022067761A1 (en) | Image processing method and apparatus, capturing device, movable platform, and computer readable storage medium | |
| US9497433B2 (en) | Imaging device with color adjustment function, imaging method using the same, and non-transitory storage medium in which imaging program is stored | |
| CN102148933A (en) | Image processing apparatus | |
| US20180025476A1 (en) | Apparatus and method for processing image, and storage medium | |
| JP7307541B2 (en) | IMAGE PROCESSING DEVICE, IMAGING DEVICE, IMAGE PROCESSING METHOD, AND PROGRAM | |
| JP6965132B2 (en) | Image processing equipment, imaging equipment, image processing methods and programs | |
| JP2019040382A (en) | Image processing device | |
| JP3466167B2 (en) | Light source color estimation method for white balance and imaging apparatus using the estimation method | |
| WO2006067724A2 (en) | Automatic white balance control | |
| US20080079749A1 (en) | White balance method for image processing | |
| JP5050141B2 (en) | Color image exposure evaluation method | |
| EP4575981A1 (en) | Method and system for processing dual-stream images | |
| US20250209573A1 (en) | Method and system for processing dual-stream images | |
| CN119583963A (en) | An automatic image exposure method and hardware system based on SOC | |
| Lin et al. | Automatic White Balance for Image Pipeline of Digital Still Camera |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15776749 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 14896421 Country of ref document: US |
|
| REEP | Request for entry into the european phase |
Ref document number: 2015776749 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2015776749 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |





