Airborne Infrared and Visible Image Fusion Combined with Region Segmentation.

sensors Article

Airborne Infrared and Visible Image Fusion Combined with Region Segmentation Yujia Zuo 1,2 , Jinghong Liu 1, *, Guanbing Bai 1,2 , Xuan Wang 1,2 and Mingchao Sun 1 1

2

*

Changchun Institute of Optics Fine Mechanics and Physics, Chinese Academy of Science, #3888 Dongnanhu Road, Changchun 130033, China; [email protected] (Y.Z.); [email protected] (G.B.); [email protected] (X.W.); [email protected] (M.S.) University of Chinese Academy of Sciences, #19 Yuquan Road, Beijing 100049, China Correspondence: [email protected] or [email protected]; Tel.: +86-135-7878-9374

Academic Editors: Cheng Wang, Julian Smit, Ayman F. Habib and Michael Ying Yang Received: 24 March 2017; Accepted: 9 May 2017; Published: 15 May 2017

Abstract: This paper proposes an infrared (IR) and visible image fusion method introducing region segmentation into the dual-tree complex wavelet transform (DTCWT) region. This method should effectively improve both the target indication and scene spectrum features of fusion images, and the target identification and tracking reliability of fusion system, on an airborne photoelectric platform. The method involves segmenting the region in an IR image by significance, and identifying the target region and the background region; then, fusing the low-frequency components in the DTCWT region according to the region segmentation result. For high-frequency components, the region weights need to be assigned by the information richness of region details to conduct fusion based on both weights and adaptive phases, and then introducing a shrinkage function to suppress noise; Finally, the fused low-frequency and high-frequency components are reconstructed to obtain the fusion image. The experimental results show that the proposed method can fully extract complementary information from the source images to obtain a fusion image with good target indication and rich information on scene details. They also give a fusion result superior to existing popular fusion methods, based on eithers subjective or objective evaluation. With good stability and high fusion accuracy, this method can meet the fusion requirements of IR-visible image fusion systems. Keywords: airborne optoelectronic platform; image fusion; image segmentation; saliency extraction; dual-tree complex wavelet transform (DTCWT)

1. Introduction With the development of sensor imaging technology, an unmanned aerial vehicle (UAV) carrying an infrared (IR)-visible light sensor can work in both day and night, enabling the wide use of UAVs in fields such as military reconnaissance, target localization, and forest fire control [1–4]. Infrared and visible light camera imaging mechanism differs—infrared cameras use the object thermal radiation principle imaging; visible light cameras do so according to the object of the principle of light reflection imaging. The two images are strongly complementary to each other, as an IR image has good target indication, while a visible image gives clear background information. The fusion of IR and visible images combines their respective advantages to improve UAV reliability in target detection and identification. The current studies on IR-visible light fusion often leave the registration of source images out of consideration. However, factors such as high UAV speed, great imaging distance, and imaging sensor vibration in flight can lead to the distortion (in terms of translation, rotation, and scale) of airborne IR and visible images taken by an UAV, so as to seriously affect the fusion result. Therefore, IR and visible images must be accurately registered before being fused. Ee21existing registration methods mostly Sensors 2017, 17, 1127; doi:10.3390/s17051127

www.mdpi.com/journal/sensors

Sensors 2017, 17, 1127

2 of 15

focus on algorithmic research at lower operation efficiency [5–9]. Registration based on the structural design and optical alignment of an airborne photoelectronic imaging system can ensure registration accuracy, and increase arithmetic speed. By considering the image information on multiple scales, the multiresolution fusion method is effective in fusion, and has received extensive attention in the IR-visible image fusion field. Traditional multiresolution-based fusion algorithms are mainly pyramid transform and discrete wavelet transform (DWT) [10–12]. Strong in image reconstruction, DWT is the first to apply to the IR-visible image fusion field. However, due to translation variance and poor direction selectivity, an image fused by DWT will have insufficient details and blurry edges. In recent years, researchers have successively proposed improved algorithms and better multiresolution decomposition tools such as dual-tree complex wavelet transform (DTCWT) [13,14], Contourlet [15,16], Curvelet [17,18], and Non-subsampled Contourlet Transform, (NSCT) [19–21]. However, the downsampling operation used in image decomposition through Contourlet can cause the pseudo-Gibbs phenomena of a fusion image. In order to avoid this phenomenon, a non-subsampled contourlet transform (NSCT) is proposed. The fusion effect is improved, but the processing speed is too slow. In contrast, DTCWT relies on two groups of Q-shift filters, and uses two real wavelets to realize the complex wavelet transform. After being decomposed through DTCWT, an image can obtain the detail information in six directions, showing better direction selectivity than DWT. In addition, the two groups of Q-shift filters have different delays that can remove the aliasing caused by translation, so DTCWT boasts translation invariance. With limited data redundancy, DTCWT is more efficient than Contourlet. The research on IR-visible image fusion methods focuses on keeping as much complementary information of the two images as possible. With low resolution and poor background information definition, IR images are susceptible to noise. The difficulty in IR-visible image fusion research lies in correctly extracting the information on hot targets from the IR image, and trying to keep clear background information of the visible image in the fusion image. To keep the hot-target information of the IR image, Li et al. [22] proposed a fusion method based on PCA (principal component analysis). After the image decomposition through Contourlet, low-frequency components are fused through a strategy based on PCA to keep the IR image structure, while high-frequency components are fused through the average gradient method. In the image fused by this method, the information on hot targets is relatively complete, but the information on background details is blurred. To better keep the information on background details of the visible image, Liu et al. [23] proposed the image fusion method combining PCNN (pulse-coupled neural networks) with IHS transform. In this, the image contrast was enhanced through IHS transform, low-frequency components were fused through the rule based on adaptive threshold, and high-frequency components were fused through PCNN. Although this method obtains higher contrast of the fusion image and rich information on background details, some targets of interest in the image are still not clear enough. Li et al. [24] proposed a fusion method based on guidance filter (GFF) to at first break down the source image into a basic layer and a detail layer, and then to fuse the two layers through the GFF-based weighted average method. Although these methods improve the quality of the fusion image more or less, they fail to give attention to the target indication of the IR image and the spectral features of the visible image, and leave anti-noise performance out of consideration. This paper proposes an IR-visible image fusion method introducing region segmentation into the DTCWT region. After the quick registration of IR and visible images, segment the IR image by significance into region of interest (ROI) and background region, so that the ROI can still be correctly extracted from an IR image with low SNR. Then decompose the image into high-frequency and low-frequency components through DTCWT [13,14]. For low-frequency components, fuse them in the ROI and background region according to different fusion rules and in line with the segmentation result so that the structure of IR source image can be better kept, and the target indication of the fusion image can be highlighted. For high-frequency components, assign the weight for every region by the richness of detail information, fuse high-frequency components through a 2D edge matrix based on

through a 2D edge matrix based on region weights and segmentation results, and then differentiate meaningful detail information from noise through a shrinkage function to remove noise. In this way, the detail information in the visible source image can be better kept, and noise effectively suppressed. Finally, obtain the fusion image through DTCWT inverse transform. Sensors 2017, 17, 1127

2. Fusion System on the Airborne Photoelectric Platform

3 of 15

The airborne photoelectric platform is a kind of photoelectric reconnaissance equipment region weights and segmentation results, and then differentiate meaningful detail information from loaded on UAVs and other aircraft. As shown in the Figure 1, the airborne photoelectric platform noise through a shrinkage function to remove noise. In this way, the detail information in the visible fusion system adopted by this paper mainly consists of a visible light camera, an infrared thermal source image can be better kept, and noise effectively suppressed. Finally, obtain the fusion image imager (IF), a laser range finder (LRF), a gyrostabilized platform, a image processing module, a through DTCWT inverse transform. recorder, and a wireless link. Visible light camera and infrared thermal imager are used to obtain images. LRF on canthe measure thePhotoelectric distance between the airborne photoelectric platform and the 2. FusionThe System Airborne Platform ground target, which is used to locate the ground target. The gyroscope controls the steering of the Thethrough airbornethe photoelectric is a kind of photoelectric equipment loaded camera cooperationplatform of the servo control system andreconnaissance the image processing module to on UAVs and other aircraft. As shown in the Figure 1, the airborne photoelectric platform fusion isolate the disturbance of the UAV, so that the images obtained by the infrared camera and the system light adopted by this paper mainly of aclear visible camera, infrared visible camera mounted on theconsists UAV are andlight stable. The an recorder is thermal used to imager record (IF), the a laser range finder (LRF), a gyrostabilized platform, a image processing module, a recorder, and a original image video taken by the infrared camera and the visible light camera. The recorded wireless image link. Visible camera and infrared thermal imagerfusion are used to obtain The LRF can original will belight used as a reference for the subsequent accuracy if itimages. is not high enough, measure distance the airborne platform original and the ground target,characteristics which is used the fusionthe effect is notbetween good enough, or thephotoelectric visible and infrared image target to locate the ground target. The gyroscope controls the steering of the camera through the cooperation are highlighted without the need for complementary information fusion. The airborne photoelectric of the servo control system and the image processing module to isolate the disturbance of the UAV, platform is connected with the ground station through the wireless link. The platform accepts the so that the images obtained by the infrared camera and the visible light camera mounted on the UAV ground station control command via the wireless link, and transfers the image and platform control are clear and The recorder parameters to stable. the ground station. is used to record the original image video taken by the infrared camera and thephotoelectric visible light platform camera. The recorded original image willthe be ground used as station a reference for the Airborne is used to obtain images, and to observe subsequent fusion accuracy if it is not high enough, the fusion effect is not good enough, or the the visible or infrared images, with digital or manual control search found that the target.visible After and infrared image characteristics need control for complementary detecting theoriginal target via the target platform, lock it to are the highlighted FOV (field without of view)the center, the camera information fusion.the Thecoordination airborne photoelectric platform is connected withprocessing the groundmodule station through direction through of servocontrol system and image to keep the wireless link. The platform accepts the ground station control command via the wireless link, the target at the FOV center, then use image processing module through the GPU for image and transfers the image and platform control parameters to the ground station. registration and fusion processing, and then observe, track and image the target.

Figure 1. Structure of the airborne photoelectric platform. Figure 1. Structure of the airborne photoelectric platform.

It is hard to avoid the vibration of airborne photoelectric platform in the imaging process as a photoelectric platform is used can to obtain and the (in ground to observe the resultAirborne of UAV flight. The platform vibration lead toimages, the distortion termsstation of translation, angle visible or infrared images, with digital or manual control search found that the target. After detecting and scale) of the images captured by IR and visible light sensors. Therefore, the images shall be the target via the platform, it to the FOV of view) center, control the camera direction accurately registered beforelock being fused. The (field operating range of airborne photoelectric image through the coordination of servocontrol system and image processing module to keep the target at fusion systems is usually at least several thousand meters. As the result of harsh shooting the FOV center, then use image processing module through the GPU for image registration and fusion conditions, the imaging quality is affected by airframe vibration, atmospheric disturbance, defocus processing, and other then observe, and image the target. blur, haze, and factors, track so that the obtained aviation images are seriously blurred with bad It is hard to avoid the vibration of airborne photoelectric platform in the imaging process as a result of UAV flight. The platform vibration can lead to the distortion (in terms of translation, angle and scale) of the images captured by IR and visible light sensors. Therefore, the images shall be accurately registered before being fused. The operating range of airborne photoelectric image fusion systems is usually at least several thousand meters. As the result of harsh shooting conditions, the imaging quality is affected by airframe vibration, atmospheric disturbance, defocus blur, haze, and other factors,

Sensors 2017, 17, 1127 Sensors 2017, 17, 1127

4 of 15 4 of 15

contrast. On the other hand, the frequent use of this system to identify and track the targets requires superior target indication of fusion images, which raises a higher requirement fusion. By so that the obtained aviation images are seriously blurred with bad contrast. for Onimage the other hand, considering on multiple scales, the multiresolution-based fusion method the frequent the use image of this information system to identify and track the targets requires superior target indication locks richer information in the afusion images and thus applicable to By aviation image fusion. To of fusion images, which raises higher requirement forisimage fusion. considering the image obtain the fusion images withthe highlighted targets and fusion clear-cut scenelocks environment, this fusion information on multiple scales, multiresolution-based method richer information in method needs strategy improvement andtoanti-noise consideration. Therefore, this paperimages proposes the fusion images and thus is applicable aviation image fusion. To obtain the fusion witha fusion strategy based ROI inscene the DTCWT region, the target indication of IR highlighted targets andon clear-cut environment, thiswhich fusionhighlights method needs strategy improvement images in low-frequency components and visible images in high-frequency and anti-noise consideration. Therefore, thisdetail paperinformation proposes a of fusion strategy based on ROI in the components. The method also fully the anti-noise performance to effectively suppressand the DTCWT region, which highlights theconsiders target indication of IR images in low-frequency components influence of noise on the quality of fusion images. detail information of visible images in high-frequency components. The method also fully considers the anti-noise performance to effectively suppress the influence of noise on the quality of fusion images. 3. Fast Registration of IR and Visible Images 3. Fast Registration of IR and Visible Images The airborne photoelectric platform is always moving in the imaging process. The vibration and The variation airborne photoelectric platform always in the imagingfrom process. The vibration and angular caused by flight, andisthe scalemoving variation resulting the error of focusing, angular variation caused by flight, and the scale variation resulting from the error of focusing, renders renders the captured IR and visible images out of full alignment. Thus, registration shall precede the the captured and visible images outcaptured of full alignment. Thus, registration shall precede the fusion. fusion. As theIRIR and visible images by airborne photoelectric platform are distorted (in As the of IR translation, and visible images captured airborne photoelectric platform distortedwith (in terms of terms angle, and scale),by the registration algorithm shall beare provided rotation translation, angle, andinvariance. scale), the registration shall be provided withtorotation invariance and invariance and scale Moreover, italgorithm shall be efficient in operation meet the engineering scale invariance. Moreover, it shall be efficient in operation to meet the engineering demand. demand. In this paper, optical alignment alignment is is used used to to register registerIR IRand andvisible visibleimages. images.As Asthe theresolution resolutionofofa avisible visible image often higher than that image, former shall be lowered to the same as image is is often higher than that of of anan IR IR image, thethe former shall be lowered to the same as the the latter. Further, the design IRvisible and visible images the optical same optical axis andand path, it latter. Further, the design of IRof and images share share the same axis and path, it isand used is used to ensure a common LOS between optical paths. However, in the processes of manufacture, to ensure a common LOS between optical paths. However, in the processes of adjustment, and installation, some deviation in sensor installation location will render 2 FOV centers completely this problem, thethe starstar method is used to mark and align the FOV completely out outof ofalignment. alignment.ToTosolve solve this problem, method is used to mark and align the centers of the two images obtaintotheir direct affine transformation matrix quickly, thusquickly, realizingthus the FOV centers of the two to images obtain their direct affine transformation matrix fast registration IR and visible images. realizing the fastofregistration of IR and visible images. The principle of the star method is shown in Figure 2, where the star in the visible light FOV is illustrated in Figure Figure 2a,c, 2a,c, while while the the star star in in the the IR IR FOV FOV is is illustrated illustrated in in Figure Figure2b,d. 2b,d.

Figure 2. 2. Principle Figure Principle of of star star method. method. (a) (a) Visible Visible star star point; point; (b) (b) Infrared Infrared star star point; point; (c) (c) Visible Visible star star points; points; (d) Infrared star points. (d) Infrared star points.

When a star is observed through both visible light and IR light, it is found at the center of When a star is observed through both visible light and IR light, it is found at the center of visible visible light FOV, but somewhat off the IR FOV center. The translation between the two images can light FOV, but somewhat off the IR FOV center. The translation between the two images can be derived be derived from this deviation. Although, when the star is turned to 3 different positions, the from this deviation. Although, when the star is turned to 3 different positions, the triangle formed by triangle formed by the 3 star points observed in the visible light FOV will deviate from that in the the 3 star points observed in the visible light FOV will deviate from that in the IR FOV. In this case, IR FOV. In this case, the degrees of rotation and zooming between the two images can be the degrees of rotation and zooming between the two images can be determined through the direct determined through the direct relationship between the two triangles. The registration effect is relationship between the two triangles. The registration effect is shown in Figure 3. shown in Figure 3. This registration method can register the two images as soon as the data are read out of the sensors, with time taking only 2 ms. With simple operation, efficient computation, high registration accuracy (the registration precision can reach the pixel level), and good stability (only the error matching frame will be changed when the zoom lens group is switched), it can satisfy the registration demand of the image fusion system.

Sensors 2017, 17, 1127

5 of 15

Sensors 2017, 17, 1127

5 of 15

Figure 3. Registration effect. (a) Infrared (IR) image; (b) Visible image; (c) Registered image

This registration method can register the two images as soon as the data are read out of the sensors, with time taking only 2 ms. With simple operation, efficient computation, high registration accuracy (the registration precision can reach the pixel level), and good stability (only the error matching frame will be changed when the zoom lens group is switched), it can satisfy the registration demand of the image fusion system. Figure 3. 3. Registration (IR) image; image; (b) (b) Visible Visible image; image; (c) (c) Registered Registered image. image Figure Registration effect. effect. (a) (a) Infrared Infrared (IR)

4. IR-Visible Image Fusion Method Combined with Region Segmentation This registration method can register the two soon as the data are read out of the 4. IR-Visible Image Fusion Method Combined withimages RegionasSegmentation The flow chart of this algorithm is shown in Figure 4. Firstly, segment the region in the IR sensors, with time taking only 2 ms. With simple operation, efficient computation, high registration The flow chart of this algorithm is shown in Figure 4. Firstly, segment the region in the IR result image image by significance to obtain its ROI and background region. Then map the segmentation accuracy (the registration precision can reach the pixel level), and good stability (only the error bythe significance towill obtain ROI andwhen background region. map the segmentation result tocan the to visible light image. Transform IR and visible images through Every transform matching frame be its changed the zoom lens Then group is DTCWT. switched), it can satisfy the l,1, Ll,2} through visible light image. Transform IR and visible images DTCWT. Every transform can produce produce two low-frequency coefficients {L and the high-frequency coefficients in six registration demand of the image fusion system. l,1 , Ll,2 } and the high-frequency coefficients in six directions {Hl,θ , two low-frequency directions {Hl,θ, θ = coefficients ±15°, ±45°, {L ±75°}, where l is the number of DTCWT decomposition layers. ◦ ◦ ◦ θ =IR-Visible ±15 , ±45 , ±75Fusion }, where l is the number ofinformation DTCWT Low-frequency Low-frequency coefficients reflect essentialwith of an image, layers. while high-frequency 4. Image Method Combined Regiondecomposition Segmentation coefficients show reflect its the detail essential information of an image, while high-frequency coefficients show its coefficients information. Low-frequency components are fused in the ROI and The flow chart of this algorithm is shown in Figure 4. Firstly, segment the region in the IR detail information. Low-frequency components are fused in the ROI and background region under background region under different rules, according to the image segmentation result. image by significance to obtain its ROI and background region. Then map the segmentation result different rules, according to the result. High-frequency components are every fused High-frequency components are image fused segmentation under a rule that considers the weight assigned for to the visible light image. Transform IR and visible images through DTCWT. Every transform can under aby rule that considers weight assigned for every region by thepaper richness of detail information. region the richness of the detail information. Section 4.1 of this introduces the region produce two low-frequency coefficients {Ll,1, Ll,2} and the high-frequency coefficients in six Section 4.1 of this paperbased introduces the region on ROI, while Sectionsand 4.2 segmentation method on ROI, whilesegmentation Sections 4.2 method and 4.3based describe low-frequency directions {Hl,θ, θ = ±15°, ±45°, ±75°}, where l is the number of DTCWT decomposition layers. and 4.3 describefusion low-frequency and high-frequency fusion rules, respectively. high-frequency rules, respectively. Low-frequency coefficients reflect the essential information of an image, while high-frequency coefficients show its detail information. Low-frequency components are fused in the ROI and background region under different rules, according to the image segmentation result. High-frequency components are fused under a rule that considers the weight assigned for every region by the richness of detail information. Section 4.1 of this paper introduces the region segmentation method based on ROI, while Sections 4.2 and 4.3 describe low-frequency and high-frequency fusion rules, respectively.

Figure Figure4. 4.Fusion Fusionalgorithm algorithmflowchart. flowchart.

4.1. 4.1. Image ImageRegion RegionSegmentation SegmentationBased Basedon on ROI ROI Because Because the the IR IR image image provides provides the the information information on on object object heat heat radiation radiation connected connected with with good good target indication, this paper segments the region in the IR image at first to obtain the target indication, this paper segments the region in the IR image at first to obtain the ROI ROI and and background region, and then fuses IR and visible images according to the region features. In recent background region, and then fuses IR and visible images according to the region features. In recent Figure 4. a Fusion years, image segmentation has also had greatalgorithm amount flowchart. of research focus in the image processing field. Methods such as region growing [25,26] and K-means [27,28] are commonly used in the IR 4.1. Image Region Segmentation Based ROI image segmentation, but are not veryoneffective for the aviation IR images with low SNR and complex background information. This paper proposes a regiononsegmentation strategy based on ROI thatgood will Because the IR image provides the information object heat radiation connected with

target indication, this paper segments the region in the IR image at first to obtain the ROI and background region, and then fuses IR and visible images according to the region features. In recent

Sensors 2017, 17, 1127

6 of 15

at first extract significant features from the IR image, and then segment the image according to the significance result. After the significance extraction, the information on a hot target will be enhanced and the background information will be blurred, allowing a stronger contrast for an entire IR image. This strategy can effectively suppress the influence of noise on IR image segmentation. Cheng et al. [29] proposed a ROI detection method based on global contrast. Aiming to separate a large target area from its surroundings, their method is superior to local contrast methods showing higher significance only near the contour and that processes the images very quickly. By using the color contrast between image pixels to define the significance value, the pixel-significance detection method based on histogram contrast (HC) was proposed. The significance of pixel Ic in the image I is defined as: Shc (Ic ) =

∑

Dis(Ic , Ii )

(1)

∀ Ii ∈ I

where Dis(Ic , Ii ) = kIc − Ii k is the color distance of Ic in the Lab space, indicating the color difference between Ic and Ii . Then, the Equation (1) can be rewritten as: n

Shc (Ic ) = Shc (ac ) =

∑ f j Dis(ac , aj )

(2)

j =1

where ac is the color value of pixel Ic , n is the total number of colors contained in the image, and fj is the occurrence probability of aj in the image. This paper applies the above method to the significance extraction and segmentation of an IR image. In doing so, the Equation (2) is changed to calculate the significance of every gray level. The extraction of significance from an IR image is mainly to highlight the information on a hot target and to clearly distinguish it from background information. To increase the calculation speed, this paper proposes to change the gray values 0–255 to 0–15. In the Equation (2), there will be n = 16, showing that the gray difference of 16 pixels/scale can meet the requirement of hot-target significance detection in the IR image. The significance map quantified in this way may have some blemishes in the form of gray value jumping, but this can be improved through smoothening. Convert the significance value of every gray level into the weighted average significance value of similar gray levels. Choose the significance values of 4 adjacent gray levels to improve the significance of pixel a through the Equation (3): Shc 0 (a) =

1 4 (L − Dis(a, ai ))S(ai ) 3L i∑ =1

(3)

4

where L = ∑ Dis(a, ai )is the distance between gray level a and its four nearest gray levels ai . Thus, in i =1

the gray space, a bigger weight will be assigned to the gray level closer to a, and similar gray levels will be assigned to similar significance values. In this way, the flaws caused by quantification can be reduced. After obtaining the ROI through significance detection, we shall segment the region in the IR image with the help of ROI, take the initial segmentation result obtained after the binarization of significance map as the initial image, and conduct iterative operation through GrabCut [30] to further optimize the segmentation result. Every iterative operation is followed by corrosion and dilation operation to obtain a new target region, which, in turn, is followed by the next iteration. In this paper, no more than four iterations are allowed. The ROI-based region segmentation result in three different situations is shown in Figure 5. Quad is the infrared image of multiple targets in the urban environment, Airport is the real aerial image, and Noisy UNCamp is a complex field background infrared image and artificially added Gaussian noise.

Sensors 2017, 17, 1127

7 of 15

Sensors 2017, 17, 1127

7 of 15

Figure segmentation result based on region of interest (ROI): (a) infrared (IR) image; (b) Figure5.5.Region Region segmentation result based on region of interest (ROI): (a) infrared (IR) image; Significance map map of theofIR image; (c) Region segmentation result based on ROI. (b) Significance the IR image; (c) Region segmentation result based on ROI.

InInFigure be clearly clearly seen seenthat thatininthe thesegmentation segmentation result, ROI of the IR image Figure5,5, itit can can be result, thethe ROI of the IR image has has been been completely extracted, and it can obtain the correct segmentation result for the complex completely extracted, and it can obtain the correct segmentation result for the complex background background lowtoSNR to noise ratio) Thus, this for layssubsequent a foundation forto and the low and SNRthe (signal noise(signal ratio) image. Thus, thisimage. lays a foundation fusion subsequent fusion to highlight the target direction of the IR image. Table 1 shows the calculation highlight the target direction of the IR image. Table 1 shows the calculation time of the three sets of time of the three sets of image segmentation methods in Figure The experiment compeleted image segmentation methods in Figure 5. The experiment was5.compeleted in thewas Microsoft Visual in2010 the Microsoft Visual 2010 environment, where the computer CPU is Intel core i3-2120, 3.3 GHz, 4 G. environment, where the computer CPU is Intel core i3-2120, 3.3 GHz, 4 G. Table Table1.1.The TheCalculation Calculationtime timeofofregion regionsegmentation segmentationmethod methodbased basedon onROI. ROI.

Image Resolution Resolution Time/s Image

Time/s

Quad Airport Noisy UNCamp Quad Airport Noisy UNCamp 640 × 496 640 × 436 360 × 270 640 × 496 640 × 436 0.036 0.029 0.019360 × 270 0.036

0.029

0.019

4.2. Low-Frequency Fusion Rule 4.2. Low-Frequency Fusion Rule After the multiresolution decomposition, low-frequency components of an image will reflect After the multiresolution decomposition, low-frequency components of an image will reflect the the essential information of the image, while high-frequency components will show its detail essential information of the image, while high-frequency components will show its detail information. information. Most of the IR-visible image fusion methods based on multiresolution decomposition Most of the IR-visible image fusion methods based on multiresolution decomposition focus on how to focus on how to design the fusion rule of high-frequency components. Low-frequency components design the fusion rule of high-frequency components. Low-frequency components are mainly fused are mainly fused through simple weighted averaging or arithmetic averaging. For aviation through simple weighted averaging or arithmetic averaging. For aviation IR-visible image fusion, it is IR-visible image fusion, it is important to acquire the complementary information of the two images, important to acquire the complementary information of the two images, namely, the information on namely, the information on a hot target in the IR image and on clear edges, background, and other a hot target in the IR image and on clear edges, background, and other details of the visible image. details of the visible image. Therefore, the ability to acquire complementary information for the Therefore, the ability to acquire complementary information for the fusion image depends more on the fusion image depends more on the fusion effect of low-frequency information. The majority of fusion effect of low-frequency information. The majority of existing low-frequency fusion rules are existing low-frequency fusion rules are based on simple weighted averaging and arithmetic based on simple weighted averaging and arithmetic averaging, which will bring down the contrast averaging, which will bring down the contrast of IR hot-target information in the fusion image. If of IR hot-target information in the fusion image. If the target brightness is enhanced by increasing the target brightness is enhanced by increasing the low-frequency coefficient weight of the IR image the low-frequency coefficient weight the IR image through part ofTo thebetter detail through weight assignment, part of theofdetail information in theweight visibleassignment, image will be lost. information in the visible image will be lost. To better preserve the information on a high-brightness preserve the information on a high-brightness target in the IR image and the information on clear target in thedetails IR image andvisible the information clear proposes background details of the visiblefusion image, thisbased paper background of the image, thisonpaper a new low-frequency rule proposes a new low-frequency fusion rule based on ROI region segmentation. At first, segment the on ROI region segmentation. At first, segment the IR image into a ROI and a background region byIR imagethe into a ROIsegmentation and a background region using theinimage segmentation method, described in using image method, as by described Section 4.1. Then map the as segmentation Section 4.1. visible Then map segmentation result to thefusion visiblerules lighttoimage, and design different fusion result to the lightthe image, and design different fuse low-frequency components rules to fuse low-frequency components in the ROI and background region of the two images. in the ROI and background region of the two images. Toachieve achievethe thegoal goalofoflow-frequency low-frequencycomponent componentfusion, fusion,we weshall shallkeep keepasasmuch muchinformation informationasas To possibleon onhot-target hot-targetbrightness brightnessofofthe theIRIRimage imageininthe theROI, ROI,and andasasmuch muchinformation informationasaspossible possible possible on details and scene of the visible image in the background region. Therefore, the low-frequency

Sensors 2017, 17, 1127

8 of 15

on details and scene of the visible image in the background region. Therefore, the low-frequency coefficient of the IR image is directly chosen as the low-frequency fusion coefficient in the ROI, as expressed by the Equation (4): l,i Ll,i F (x, y) = Lir (x, y) (x, y) ∈ ROI

(4)

l,i where Ll,i F is the low-frequency coefficient of the fusion image (i = 1, 2). Lir is the low-frequency coefficient of the IR image at the layer l. For the background region, an adaptive weighted fusion rule based on region variance is proposed. Region variance shows how fast a pixel in the region changes. A bigger region variance means a greater change of the pixel, higher contrast, and richer information in the region. Therefore, pixels with a bigger region variance shall have a greater weight in the fusion image. According to the above principle, an image fusion rule based on the adaptive weighting of region variance can be established: l,i l,i Ll,i F (x, y) = ωir Lir (x, y) + ωvis Lvis (x, y)

(5)

l,i where Ll,i F is the low-frequency coefficient of the fusion image (i = 1, 2). Lir is the low-frequency coefficient of the IR image at the layer l. Ll,i vis is the low-frequency coefficient of the visible image at the layer l. The weight ω is determined by both region variance σ and correlation coefficient r, whose definitions are shown in Equations (6) and (7) respectively:

σ2 = M N

r=

∑∑s

i =1 j =1

M N 1 (I(i, j) − x )2 ∑ M × N i=1 j∑ =1

(6)

(Iir (i, j) − xir ) × (Ivis (i, j) − xvis ) M N

2

M N

(7) 2

( ∑ ∑ (Iir (i, j) − xir ) ) × ( ∑ ∑ (Ivis (i, j) − x vis ) ) i =1 j =1

i =1 j =1

where ωvis is the weight of low-frequency coefficient of the visible image; ωir is the weight of low-frequency coefficient of the IR image; σ is region variance in a window whose size is chosen by this paper as 3 × 3; r is correlation coefficient; Iir and Ivis are IR image and visible image, respectively, in a size of M × N; x is average gray level of the image. The weights in Equation (5) are assigned as below:   r ≥ 0.5  0.5, 2 ≥ σ2 , ω = 1 − ω ωvis = (8) 1 − r , r < 0.5 & σvis ir vis ir   r, 2 2 r < 0.5 & σvis < σir where ωvis is the weight of low-frequency coefficient of the visible image, ωir is the weight of low-frequency coefficient of the IR image, σ is region variance, and r is correlation coefficient. 4.3. High-Frequency Fusion Rule After the multiresolution decomposition, high-frequency components will reflect the information on edge, contour, and other details of the decomposed image. To retain as much information on details and texture of the source image as possible, this paper proposes a new high-frequency fusion rule based on region weighting. Through the above image segmentation method based on ROI, the IR and visible images are segmented into n regions, namely, R = {r1 , r2 , . . . ,rn }. The region weight Q is defined as: Q = { Q r1 , Q r2 , · · · Q r n } (9) The region weight Q corresponds to every small region ri (i = 1, 2 . . . n), and is calculated through the following equation:

Sensors 2017, 17, 1127

9 of 15

1 Q(riθ ) = θ r

∑

i ∀l,( x,y)∈rθ i

l,θ H ( x, y)

(10)

where H l,θ ( x, y) is the high-frequency coefficient derived from post-DTCWT image decomposition, l is the number of decomposition levels (l = 1, 2, 3, 4), θ is the decomposed directional sub-band (θ = ±15◦ , ±45◦ , ±75◦ ), and riθ is the absolute value of region riθ . The fusion rule f is defined according to the Q value calculated through the Equation (10). According to the fusion rule f, to ensure richer texture information in every subregion of the fusion image, the region with a bigger Qri in IR and visible images is chosen to calculate the fusion coefficient of region ri in the fusion image, and the average of high-frequency coefficients in the former region is taken as the high-frequency coefficient of region ri . To ensure that the edge details of every subregion in the fusion image are more remarkable and clearer, the binary edge matrix Sl,θ is introduced as weight into the calculation of high-frequency coefficient. To ensure the edge continuity and integrity of every subregion, Sl,θ shall be obtained through the dilation operation and 2D average filtering of the ROI-based image segmentation result, described in Section 2 of this paper. The high-frequency coefficient of the fusion image is rewritten as: l,θ l,θ HFl,θ = (1 − Sl,θ )∑ f ( Hirl,θ , Hvis ) + Sl,θ max( Hirl,θ , Hvis )

(11)

R

In the image fusion process, the edge ups and downs may lead to distortion of the fusion image. The phase of DTCWT transform coefficient corresponds to the exact location of its directional feature in the region. Therefore, to reduce the distortion caused by edge ups and downs, the high-frequency coefficient derived from IR-visible image DTWCT is used to calculate a unit vector and the expected high-frequency coefficient of the fusion image with improved average phase, as shown in the following rewritten form: l,θ (x, y) l,θ H l,θ (x, y) + Hvis e l,θ (x, y) = ir H ( x, y ) (12) H F l,θ F l,θ Hir (x, y) + Hvis (x, y) The environment where the aviation images are taken is very complex, so the high-frequency information of these images often contains some noise. This paper uses the shrinkage function proposed by Loza et al, through derivation, as the maximum posteriori estimation [31] to reduce the influence of noise on high-frequency information. Suppose the shrinkage function is Ts . To highlight the contrast of image details, a given value C (C > 1) is set to increase the high-frequency coefficient. But this simple method will amplify the remaining noise as well. To tackle this problem, we shall only increase the high-frequency coefficient for those pixel capable of being connected into a large points l,θ l,θ region, while defining a binary matrix M . When HF > τ, there will be Ml,θ = 1. Then the isolated points whose value is 1 are removed from the matrix Ml,θ . Finally, the high-frequency coefficient of the fusion image becomes: e l,θ HFl,θ = (Cl,θ Ml,θ + (1 − Ml,θ ))Tl,θ (13) s HF The low-frequency and high-frequency coefficients determined through the above fusion rule undergo the DTCWT inverse transform to finally obtain the fusion image. 5. Analysis of Experimental Results To verify the correctness and effectiveness of above theoretical analysis and of the proposed method, this paper tests a large number of IR and visible images. For lack of space, only three groups of representative images are listed here for experimental verification. The first group of tested images is the “Quad” in the IR-visible image fusion dataset, the second group is real aerial image named “Airport”, and the third group is the “UNcamp” in the image fusion dataset, to which white Gaussian noise has been added. (Source of IR and visible image dataset: http://www.imagefusion.org/) The experiment was done in the Matlab 2014a environment, where the computer CPU is Intel core i3-2120, 3.3 GHZ, 4 G.

Sensors 2017, 17, 1127

10 of 15

To further demonstrate the performance of the proposed method, this paper chooses some typical image fusion methods used in recent years for comparison. The comparison methods include: classic multiresolution decomposition image fusion method DWT [12]; the PCA-MST method proposed by Li et al. [22], which bases image multiresolution decomposition on Contourlet transform domain and determines the low-frequency fusion coefficient through PCA and the high-frequency fusion coefficient through the average gradient method; the HIS-PCNN method proposed by Liu et al. [23], which enhances the image contrast through IHS transform and fuses low-frequency components through the adaptive threshold rule and high-frequency components through PCNN; and the GFF method proposed by Li et al. [24], which breaks down the source image into a basic layer and a detail layer and then fuses the two layers by using the GFF-based weighted average method. 5.1. Evaluation Indicators To fully demonstrate the computational efficiency and the fusion accuracy of our method, we objectively evaluate the fusion effect of our method and other comparative methods by using running time and image standard deviation (SD), information entropy (IE), average gradient (AG) and edge-preserving degree (QAB/F ) separately. SD is the measure of dispersion of relative gray values. The bigger the SD, the better the image contrast and the higher the fusion quality. IE is the measure of amount of image information. The bigger the IE, the richer the information contained in the fusion image. AG is the measure of expressive ability of minor image details. The bigger the AG, the stronger their expressive ability. QAB/F is the measure of how well the fusion image preserves the information on edge details of the source image. The bigger the QAB/F , the higher the fusion quality. 5.2. Experiment of Image Fusion Contrast 5.2.1. Experiment 1: Quad Image Fusion Comparison This paper at first chooses images from the representative Quad dataset to verify the effectiveness and fusion effect of the proposed method. Size of Quad is 640 × 496. Quad infrared and visible images are shown in the Figure 6a,b respectively. In the IR image, pedestrians, signal lights and running cars can be seen clearly, while other background information is quite blurred. In contrast, in the visible light image for the same scene, the pedestrians in the dark can't be seen clearly, but the billboard information completely invisible in the IR image is legible here. Then the UNcamp images are fused by using the proposed method and other four methods including DWT [12], PCA-MST [22], IHS-PCNN [23] and GFF [24]. The results of fusion comparisons are shown in the Figure 6. The fusion details of important complementary information (billboards and pedestrians) in the IR and visible images are compared in the Figure 7. It can be subjectively seen from Figures 6 and 7 that the five fusion methods can keep most of the important complementary information of source images, and fuse the source image information very well. However, the fusion image obtained through DWT has a blurred target edge and low image contrast, and loses much background detail information of the visible-light source image. The PCA-MST method can keep all of the target information of the IR image, but can’t capture all the scene information of the visible image, so that the billboard words are very blurred. The GFF and IHS-PCNN methods, as a whole, can keep all the target information of the IR image and the background information of the visible image. However, the fusion image obtained through HIS-PCNN shows obvious zoning and partial distortion, and the fusion image formed through GFF has insufficient scene detail information and bad contrast, compared with the method proposed by this paper. Taken together, with high image contrast, well-preserved target brightness, high background contrast, and rich detail information, the fusion image obtained through the proposed method has captured target information of the IR image and background information of the visible image. Thus, it has achieved a good visual effect.

Sensors 2017, 17, 1127

11 of 15

Sensors 2017, 17, 1127 Sensors 2017, 17, 1127

11 of 15 11 of 15

Figure fusion comparisons: (a) (a) IR image; (b) Visible image; (c) discrete Figure 6. 6. Quad Quad image image fusion discrete wavelet wavelet Figure 6. comparisons: (a) IR image; (b) Visible image; (c) discrete wavelet transform (DWT) [12]; (d) PCA-MST [22]; (e) guidance filter (GFF) [24]; (f) IHS-PCNN [23]; transform (DWT) (DWT) [12]; [12]; (d) (d) PCA-MST PCA-MST [22]; [22]; (e) (e) guidance guidance filter filter (GFF) (GFF) [24]; [24]; (f) (f) IHS-PCNN IHS-PCNN [23]; [23]; (g) (g) The The transform (g) The proposed method. proposed method. proposed method.

Figure 7. Fusion detail comparisons: (a) IR image; (b) Visible image; (c) (c) DWT DWT [12]; [12]; (d) (d) PCA-MST PCA-MST [22]; [22]; Figure 7. Fusion detail comparisons: (a) IR image; (b) Visible image; (c) DWT [12]; (d) PCA-MST [22]; (e) GFF [24]; (f) IHS-PCNN [23]; (g) The proposed method. (e) GFF [24]; [24]; (f) (f) IHS-PCNN IHS-PCNN [23]; [23]; (g) (g) The The proposed proposed method. method.

It can can be be subjectively subjectively seen seen from from Figures Figures 66 and and 77 that that the the five five fusion fusion methods methods can can keep keep most most of of It Table 2 evaluates the performance of various fusion methods illustrated in Figure 6, by using the important complementary information of source images, and fuse the source image information the important complementary information of source images, and fuse the source image information the time the and fusion four objective evaluation indicators: deviation (SD), information verycomputation well. However, However, image obtained obtained through DWTstandard has aa blurred blurred target edge and low low very well. the fusion image through DWT has target edge and AB/F entropy (IE), average gradient (AG), and edge-preserving degree (Q ). Judging from the evaluation image contrast, contrast, and and loses loses much much background background detail detail information information of of the the visible-light visible-light source source image. image. image indicators in Table 1, all the indicators ofthe thetarget fusioninformation image obtained through the proposed method are The PCA-MST method can keep all of of the IR image, but can’t capture all The PCA-MST method can keep all of the target information of the IR image, but can’t capture all higher than those of the other 4 methods, except for SD, which is slightly lower than that of IHS-PCNN. the scene scene information information of of the the visible visible image, image, so so that that the the billboard billboard words words are are very very blurred. blurred. The The GFF GFF the This is mainly because the fusion imagecan obtained through IHS-PCNN is partly distorted, leading to and IHS-PCNN methods, as a whole, keep all the target information of the IR image and the and IHS-PCNN methods, as a whole, can keep all the target information of the IR image and the contrast change in the distorted region, and an increase in image SD. Therefore, the evaluation result background information information of of the the visible visible image. image. However, However, the the fusion fusion image image obtained obtained through through background of the proposed method is the best in general. HIS-PCNN shows shows obvious obvious zoning zoning and and partial partial distortion, distortion, and and the the fusion fusion image image formed formed through through GFF GFF HIS-PCNN has insufficient scene detail information and bad contrast, compared with the method proposed by has insufficient sceneTable detail information and bad contrast, compared with the method proposed by 2. Evaluation of Quad IR-visible image fusion performance. this paper. Taken together, with high image contrast, well-preserved target brightness, high this paper. Taken together, with high image contrast, well-preserved target brightness, high background contrast, contrast, and and rich rich detail detail information, information, the the fusion fusion image obtained obtained through through the the proposed proposed background Method SD IE Time/s QAB/F imageAG method has captured target information of the IR image and background information of the visible method has captured target IR image and background information of the visible DWT [12]information 21.4123of the 5.8733 0.2108 2.6492 1.943 image. Thus, Thus, it it has has achieved achieved aa good good visual visual effect. image. PCA-MST [22] 31.2098 effect. 6.3987 0.2974 3.7382 1.019 Table 2 evaluates the performance of various various fusion methods illustrated in Figure Figure 6, 6, by by using using GFF [24] 32.0211of 6.4520 fusion 0.2752 3.2129 1.373 Table 2 evaluates the performance methods illustrated in the computation time and four objective evaluation indicators: standard deviation (SD), IHS-PCNN [23] 39.5745 6.6929 0.3269 3.6875 4.693 the computation time and four objective evaluation indicators: standard deviation (SD), AB/F Proposed 35.0284 6.8894 0.5645 5.3487 0.977 information entropy entropy (IE), (IE), average average gradient gradient (AG), (AG), and and edge-preserving edge-preserving degree degree (Q (QAB/F). ). Judging Judging from from information Note: Bold values are used to show the best quality of fusion objective image criterion.obtained through the the evaluation indicators in Table 1, all the indicators of the the evaluation indicators in Table 1, all the indicators of the fusion image obtained through the proposed method method are are higher higher than than those those of of the the other other 44 methods, methods, except except for for SD, SD, which which is is slightly slightly proposed lower than than that that of of IHS-PCNN. IHS-PCNN. This This is is mainly mainly because because the the fusion fusion image image obtained obtained through through lower IHS-PCNN is partly distorted, leading to contrast change in the distorted region, and an increase in IHS-PCNN is partly distorted, leading to contrast change in the distorted region, and an increase in image SD. SD. Therefore, Therefore, the the evaluation evaluation result result of of the the proposed proposed method method is is the the best best in in general. general. image

PCA-MST [22] GFF [24] IHS-PCNN [23] Proposed

31.2098 32.0211 39.5745 35.0284

6.3987 6.4520 6.6929 6.8894

0.2974 0.2752 0.3269 0.5645

3.7382 3.2129 3.6875 5.3487

1.019 1.373 4.693 0.977

Sensors 2017, 17, 1127 Note: Bold values are used to show the best quality of objective criterion.

12 of 15

5.2.2. Experiment 2: Aviation IR-Visible Image Fusion Comparison 5.2.2. Experiment 2: Aviation IR-Visible Image Fusion Comparison Since the proposed method is expected to fuse aviation images taken by IR and visible light Since the proposed method is expected to fuse aviation images taken by IR and visible light sensors on airborne photoelectric platforms, a set of IR and visible-light runway images (named sensors on airborne photoelectric platforms, a set of IR and visible-light runway images (named “Airport“ in this paper, with the Size of Airport being 640 × 436) taken by an UAV photoelectric “Airport“ in this paper, with the Size of Airport being 640 × 436) taken by an UAV photoelectric platform was chosen for Experiment 2. The infrared image was taken by a long-wave infrared platform was chosen for Experiment 2. The infrared image was taken by a long-wave infrared sensor. sensor. The pixel size was 15 μm, the resolution 640 × 512, and the focal length f = 235 mm. The The pixel size was 15 µm, the resolution 640 × 512, and the focal length f = 235 mm. The visible visible image was taken by the visible light sensor. The pixel size was 4.65 μm, the resolution was image was taken by the visible light sensor. The pixel size was 4.65 µm, the resolution was 1024 × 768, 1024 × 768, the focal length f = 72.9 mm, the experimental weather conditions were clear, and the the focal length f = 72.9 mm, the experimental weather conditions were clear, and the flight height AGL flight height AGL was 2.3 km. Vibration and flying angle change in UAV flight can cause was 2.3 km. Vibration and flying angle change in UAV flight can cause translation, angular rotation, translation, angular rotation, scale zooming, and other forms of distortion of images captured by IR scale zooming, and other forms of distortion of images captured by IR and visible light sensors, so and visible light sensors, so that a whole fusion method may fail. Thus, these images must be that a whole fusion method Thus, these images must be registered before being fused. registered before being fused.may Thisfail. paper registered the IR and visible images through the method This paperinregistered IR then and visible images the method described in and Section 3, and then described Section 3,the and fused them forthrough experimental purposes. The IR visible images fused them for experimental purposes. The IR and visible images taken by the photoelectric platform taken by the photoelectric platform are shown in Figure 8a,b, respectively. It is observed that the are shown in Figure 8a,b,isrespectively. is observed that the runway in image is runway in the IR image well-defined,Itand the marking information onthe theIRrunway is well-defined, very clear in andvisible the marking the runway verythrough clear in the image. The Airport images the image.information The Airportonimages were is fused the visible proposed method and other four were fused through the proposed method and other four methods, including DWT [3], PCA-MST methods, including DWT [3], PCA-MST [4], IHS-PCNN [8], and GFF [9], separately. The results[4], of IHS-PCNN [8], and GFF [9], separately. The results of fusion comparisons are shown in Figure 8. fusion comparisons are shown in Figure 8.

Figure 8. IRIR image; (b)(b) Visible image; (c) (c) Registered image; (d) 8. Airport Airportimage imagefusion fusioncomparisons: comparisons:(a)(a) image; Visible image; Registered image; DWT [12];[12]; (e) PCA-MST [22];[22]; (f) GFF [24];[24]; (g) IHS-PCNN [23];[23]; (h) (h) TheThe proposed method. (d) DWT (e) PCA-MST (f) GFF (g) IHS-PCNN proposed method.

It can be clearly seen from Figure 8 that, in the fusion images obtained through DWT and It can be clearly seen from Figure 8 that, in the fusion images obtained through DWT and PCA-MST, ghosting and distortion occurs in some areas, and most of the visible-image information PCA-MST, ghosting and distortion occurs in some areas, and most of the visible-image information is lost. In the fusion image obtained through IHS-PCNN, almost all of the detail information of is lost. In the fusion image obtained through IHS-PCNN, almost all of the detail information of visible-light source image is lost, the marks on the runway are blurred, and obvious zoning is seen. visible-light source image is lost, the marks on the runway are blurred, and obvious zoning is seen. Only the GFF and the proposed method have a good fusion effect, but the GFF provides unclear Only the GFF and the proposed method have a good fusion effect, but the GFF provides unclear information on runway marking details in the visible image. Taken together, in this experiment, the information on runway marking details in the visible image. Taken together, in this experiment, method proposed by this paper has the best fusion effect. Table 3 gives the computation time and the method proposed by this paper has the best fusion effect. Table 3 gives the computation time fusion evaluation results of the proposed method and four other comparative methods. It can also and fusion evaluation results of the proposed method and four other comparative methods. It can also be clearly seen from Table 3 that the proposed method behaves best in four objective evaluation indicators, showing excellent fusion performance.

Sensors 2017, 17, 1127

13 of 15

Sensors 2017, 17, 1127 13 of 15 be clearly seen from Table 3 that the proposed method behaves best in four objective evaluation indicators, showing excellent fusion performance.

Table 3. Evaluation of aviation IR-visible image fusion performance. Table 3. Evaluation of aviation IR-visible image fusion performance. Method Method

SD SD IE IE DWT [12] [12] 40.3748 DWT 40.3748 5.1037 5.1037 PCA-MST [22] 48.6327 PCA-MST [22] 48.6327 6.3254 6.3254 GFF [24] 49.2637 7.0035 GFF[23] [24] 45.9653 49.2637 6.1276 7.0035 IHS-PCNN IHS-PCNN [23] 45.9653 6.9054 6.1276 Proposed 53.3248

AB/F

Time/s QAB/F Q AGAG Time/s 0.2964 3.0552 3.0552 1.526 1.526 0.2964 0.3978 3.7553 0.847 0.847 0.3978 3.7553 0.6755 3.3247 0.916 0.6755 3.3247 0.916 0.4458 3.7514 3.932 0.4458 0.6326 3.7514 6.3072 3.932 0.852 53.3248 6.9054 0.6326 6.3072 0.852 Proposed Note: Bold values are used to show the best quality of objective criterion.

Note: Bold values are used to show the best quality of objective criterion.

5.2.3. Experiment 3: Anti-Noise Performance Experiment 5.2.3. Experiment 3: Anti-Noise Performance Experiment As a UAV flies under volatile conditions and the information on air-to-ground target background As a UAV flies under volatile conditions and the information on air-to-ground target is complex, the images captured by the airborne photoelectric platform often contain noise. background is complex, the images captured by the airborne photoelectric platform often contain To demonstrate the anti-noise performance of the proposed method, white Gaussian noise is added to noise. To demonstrate the anti-noise performance of the proposed method, white Gaussian noise is a group of IR and visible images belonging to the fusion datasets UNCamp, as shown in Figure 9a. added to a group of IR and visible images belonging to the fusion datasets UNCamp, as shown in The noise-adding image is named Noisy UNCamp in this article, and the size of the image is 360 × 270. Figure 9a. The noise-adding image is named Noisy UNCamp in this article, and the size of the The noise-adding image fusion results of the proposed method and four other comparative methods image is 360 × 270. The noise-adding image fusion results of the proposed method and four other are compared in Figure 9. comparative methods are compared in Figure 9.

image; (b)(b) Visible image; (c) Figure 9. Noisy UNCamp UNCamp noise-adding noise-addingimage imagefusion fusioncomparisons: comparisons:(a)(a)IRIR image; Visible image; DWT [12]; (d) (d) PCA-MST [22]; (e) (e) GFFGFF [24]; (f) IHS-PCNN [23];[23]; (g) (g) TheThe proposed method. (c) DWT [12]; PCA-MST [22]; [24]; (f) IHS-PCNN proposed method.

It is observed observedfrom fromFigure Figure9 that 9 that proposed method hasbest the anti-noise best anti-noise performance, It is thethe proposed method has the performance, while while the fusion other fusion methods are affected byso noise, so that theirquality fusion is quality is generally lower.a the other methods are affected by noise, that their fusion generally lower. Given Given a low SNR images, of tested themethod proposed stilltarget keep region all theinformation target region low SNR of tested theimages, proposed can method still keepcan all the of information of and source images and richer background information, providing thewith fusion with source images richer background information, providing the fusion image theimage best visual the best In Table 4, the fusion of theand proposed method and four other effect. In visual Table 4,effect. the fusion performance of theperformance proposed method four other comparative methods comparative methods is evaluated through four objective evaluation indicators. As shown in Table is evaluated through four objective evaluation indicators. As shown in Table 4, all the indicators 4, indicators of the method higher than those of the comparative of all thethe proposed method areproposed higher than thoseare of the comparative methods, except for IEmethods, and AG, except for IE and AG, which are slightly lower than those of comparative algorithms. This, however, which are slightly lower than those of comparative algorithms. This, however, is because IE is the is because is the measure ofrichness, image information richness, AG is the ability measure of expressive measure ofIE image information and AG is the measureand of expressive of image details. ability of image details. The are comparative susceptible to noise, so their fusion The comparative algorithms susceptiblealgorithms to noise, soare their fusion images contain more noise,images which contain more noise, which increases the information on image details and, accordingly, forces the increases the information on image details and, accordingly, forces the relevant evaluation indicators relevant indicators performance up. This means the anti-noise performance of comparative up. This evaluation means the anti-noise of comparative algorithms is lower than that of the algorithms is lower than that of the proposed algorithm, whose fusion performance further proves proposed algorithm, whose fusion performance further proves to be excellent. to be excellent.

Sensors 2017, 17, 1127

14 of 15

Table 4. Evaluation of Field (noise-adding) IR-visible image fusion performance. Method

SD

IE

QAB/F

AG

Time/s

DWT [12] PCA-MST [22] GFF [24] IHS-PCNN [23] Proposed

33.6973 37.3488 37.9777 37.2536 39.3742

7.0965 7.2314 7.2664 7.1973 7.2013

0.3575 0.3204 0.3870 0.4544 0.6447

16.4053 16.6699 16.6331 11.2374 15.2238

0.960 0.499 0.693 3.137 0.547

Note: Bold values are used to show the best quality of objective criterion.

6. Conclusions To solve the problems that IR-visible fusion images on airborne photoelectric platforms are faced with (such as bad contrast, blurred background details, and susceptibility to noise), this paper proposes a new fusion method combined with region segmentation. In terms of low-frequency components, this method is combined with the significance-guided region segmentation to set up a fusion rule that can highlight the hot-target information of IR images, and keep the background detail information of the visible image. In terms of high-frequency components, this paper proposes a fusion rule based on region weighting. This rule strengthens the weights of those regions with rich detail information in the fusion image according to the self-defined region weights, uses adaptive phases to correct the fusion image distortion induced by edge deformation, effectively retains the detail information of source images, and introduces a shrinkage function to enhance the anti-noise performance of this fusion method. The experimental result shows that the proposed method has efficiently extracted complementary information from IR and visible images, to generate a fusion image with a clear target, moderate brightness, clear background details, and effective noise suppression. According to the evaluation data, the proposed method has a better fusion effect and somewhat higher fusion accuracy compared with several representative fusion methods used in recent years. With a high algorithm stability, the proposed method can be applied to diversified scenes to meet the operational requirements of fusion system on an airborne photoelectric platform. Thus, it has a high application value. Author Contributions: Yujia Zuo designed the algorithm and wrote the source code and manuscript; Jinghong Liu and Guanbing Bai made contribution to experiments design and paper written; Xuan Wang and Mingchao Sun analyzed the experiment results and revised paper. Conflicts of Interest: The authors declare no conflict of interest.

References 1. 2. 3. 4. 5. 6. 7. 8.

Huang, X.; Netravali, R.; Man, H.; Lawrence, V. Multi-sensor fusion of infrared and electro-optic signals for high resolution night images. Sensors 2012, 12, 10326–10338. [CrossRef] [PubMed] Ma, Y.; Wu, X.; Yu, G.; Xu, Y.; Wang, Y. Pedestrian detection and tracking from low-resolution unmanned aerial vehicle thermal imagery. Sensors 2016, 16, 446. [CrossRef] [PubMed] Jin, B.; Kim, G.; Cho, N.I. Wavelet-domain satellite image fusion based on a generalized fusion equation. J. Appl. Remote Sens. 2014, 8, 080599. [CrossRef] Smeelen, M.A.; Schwering, P.B.W.; Toet, A.; Loog, M. Semi-hidden target recognition in gated viewer images fused with thermal IR images. Inf. Fusion 2014, 18, 131–147. [CrossRef] Zitova, B.; Flusser, J. Image registration methods: A survey. Image Vis. Comput. 2003, 21, 977–1000. [CrossRef] Wu, F.; Wang, B.; Yi, X.; Li, M.; Hao, H.; Zhou, H. Visible and infrared image registration based on visual salient features. J. Electron. Imaging 2015, 24, 053017. [CrossRef] Guo, J.; Yang, F.; Tan, H.; Wang, J.; Liu, Z. Image matching using structural similarity and geometric constraint approaches on remote sensing images. J. Appl. Remote Sens. 2016, 10, 045007. [CrossRef] Chen, Y. Multimodal Image Fusion and Its Applications. Ph.D. Thesis, The University of Michigan, Ann Arbor, MI, USA, 2016.

Sensors 2017, 17, 1127

9. 10. 11. 12. 13.

14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29.

30. 31.

15 of 15

Zhen, J. Image Registration and Image Completion: Similarity and Estimation Error Optimization. Ph.D. Thesis, University of Cincinnati, Cincinnati, OH, USA, 2014. Xu, H.; Wang, Y.; Wu, Y.; Qian, Y. Infrared and multi-type images fusion algorithm based on contrast pyramid transform. Infrared Phys. Technol. 2016, 17, 133–146. [CrossRef] Mukane, S.M.; Ghodake, Y.S.; Khandagle, P.S. Image enhancement using fusion by wavelet transform and laplacian pyramid. Int. J. Comput. Sci. Issues 2013, 10, 122. Zehua, Z.; Min, T. Infrared image and visible image fusion based on wavelet transform. Adv. Mater. Res. 2014, 756–759, 2850–2856. Kingsbury, N.Q. The Dual-tree Complex Wavelet Transform: A New Technique for Shift Invariance and Directional Filters. In Proceedings of the 8th IEEE Digital Signal Processing Workshop, Bryce Canyon, UT, USA, 9–12 August 1998; pp. 120–131. Selesnick, W.; Baraniuk, R.G.; Kingsbury, N.Q. The dual-tree complex wavelet transform. IEEE Signal Process. Mag. 2005, 22, 123–151. [CrossRef] Do, M.N.; Vetterli, M. The contourlet transform: An efficient directional multiresolution image representation. IEEE Trans. Image Process. 2005, 14, 2091–2106. [CrossRef] [PubMed] Yang, L.; Guo, B.L.; Ni, W. Multimodality medical image fusion based on multiscale geometric analysis of contourlet transform. Neurocomputing 2008, 72, 203–211. [CrossRef] Starck, J.L.; Candes, E.J.; Donoho, D.L. The curvelet transform for image denoising. IEEE Trans. Image Process. 2002, 11, 670–684. [CrossRef] [PubMed] Bhadauria, H.S.; Dewal, M.L. Medical image denoising using adaptive fusion of curvelet transform and total variation. Comput. Electr. Eng. 2013, 39, 1451–1460. [CrossRef] Cunha, A.L.D.; Zhou, J.P.; Do, M.N. The nonsubsampled contourlet transform: Theory, design, and applications. IEEE Trans. Image Process. 2006, 15, 3089–3101. [CrossRef] [PubMed] Xiang, T.; Yan, L.; Gao, R. A fusion algorithm for infrared and visible images based on adaptive dual-channel unit-linking PCNN in NSCT domain. Infrared Phys. Technol. 2015, 69, 53–61. [CrossRef] Yang, Y.; Tong, S.; Huang, S.Y.; Lin, P. Multifocus Image Fusion Based on NSCT and Focused Area Detection. IEEE Sens. J. 2015, 15, 2824–2838. [CrossRef] Li, H.; Liu, L.; Huang, W.; Yue, C. An improved fusion algorithm for infrared and visible images based on multi-scale transform. Infrared Phys. Technol. 2016, 74, 28–37. [CrossRef] Shuo, L.; Yan, P.; Muhanmmad, T. Research on fusion technology based on low-light visible image and infrared image. Opt. Eng. 2016, 55, 123104. [CrossRef] Li, S.; Kang, S.; Hu, J. Image fusion with guided filtering. IEEE Trans. Image Process. 2013, 22, 2864–2874. [CrossRef] [PubMed] Montoya, M.D.G.; Gill, C.; Garcia, I. The load unbalancing problem for region growing image segmentation algorithm. J. Parallel Distrib. Comput. 2003, 63, 387–395. [CrossRef] Dawoud, A.; Netchaev, A. Preserving objects in markov random fields region growing image segmentation. Pattern Anal. Appl. 2012, 15, 155–161. [CrossRef] Chitade, A.Z.; Katiyar, D.S.K. Colour based image segmentation using K-means clustering. Int. J. Eng. Sci. Technol. 2010, 2, 5319–5325. Sulaiman, S.N.; Isa, N.A.M. Adaptive fuzzy-K-means clustering algorithm for image segmentation. IEEE Trans. Consum. Electron. 2010, 56, 2661–2668. [CrossRef] Chen, M.-M.; Zhang, G.; Mitra, N.J.; Huang, X.; Hu, S.-M. Global Contrast based salient region detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Colorado Springs, CO, USA, 20–25 January 2011. Carten, R.; Vladimir, K.; Andrew, B. “Grabcut”—Interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 2004, 23, 309–314. [CrossRef] Loza, A.; Bull, D.; Canagarajah, N.; Achim, A. Non-Gaussian model based fusion of noisy images in the wavelet domain. Comput. Vis. Image Underst. 2010, 114, 54–65. [CrossRef] © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Wavelet-Based Visible and Infrared Image Fusion: A Comparative Study.

Thermal Infrared Pedestrian Image Segmentation Using Level Set Method.

Infrared Imaging Spectrometer AVIS: Design, Characterization and Calibration.

Thermal physical property-based fusion of geostationary meteorological satellite visible and infrared channel images.

Generation and manipulation of ultrahigh order plasmon resonances in visible and near-infrared region.

Semantic Image Segmentation with Contextual Hierarchical Models.

Improving Cerebellar Segmentation with Statistical Fusion.

Infrared and Visual Image Fusion through Fuzzy Measure and Alternating Operators.

Segmentation of MR image using local and global region based geodesic model.

A Novel Gradient Vector Flow Snake Model Based on Convex Function for Infrared Image Segmentation.

Reconstructing Face Image from the Thermal Infrared Spectrum to the Visible Spectrum.

The role of visible and infrared spectroscopy combined with chemometrics to measure phenolic compounds in grape and wine samples.

Novel lattice Boltzmann method based on integrated edge and region information for medical image segmentation.

Image segmentation in digital mammography: comparison of local thresholding and region growing algorithms.

Multi-focus image fusion using epifluorescence microscopy for robust vascular segmentation.

Manual-Protocol Inspired Technique for Improving Automated MR Image Segmentation during Label Fusion.

Feature point descriptors: infrared and visible spectra.

Multi-atlas segmentation with robust label transfer and label fusion.

Infrared viewer and image intensifier.

Improving secondary ion mass spectrometry image quality with image fusion.

FISICO: Fast Image SegmentatIon COrrection.

Medical image segmentation based on a hybrid region-based active contour model.

Simultaneous Segmentation and Statistical Label Fusion.

Narrow band region-based active contours model for noisy color image segmentation.