ISSN: 2319-9873
Department of Electrical Engineering, Mahabad Branch, Islamic Azad University, Mahabad, Iran
Received: 13/07/2014; Revised: 12/08/2014; Accepted: 16/08/2014
Visit for more related articles at Research & Reviews: Journal of Engineering and Technology
Content based image retrieval, the problem of finding images from data base according to their content, has been the subject of a significant amount of research in the last decade. Image retrieval based on region is one of the most promising and active research directions in recent years. As literature prove that region segmentation will produce better results. Human visual perception is more effective than any machine vision systems for extracting semantic information from image, hitherto no specific system has been suggested with the ability of extracting object individually. We introduce a new idea, hence object detection has been obtained as main contribution of this paper and a new feature extraction based on spectral analysis is presented.
machine, image, High definition
Expectation Maximization (EM) algorithm is utilized to segment image into different regions. A new image representation which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space is presented. In addition the EM algorithm performs automatic segmentation based on image features [1-6]. EM iteratively models the joint distribution of color and texture with a mixture of Gaussians. The resulting pixel cluster memberships provide a segmentation of the image. After the image is segmented into regions, system select the region where contain main object. More over a description of chosen region, based on novel feature extraction is produced [7]. The other option of the proposed system is that the user can access the regions directly in order to see the segmentation of the query image and specify which aspects of the image are important to the query. The deficiency of traditional retrieval systems is due to either both image representation and method of accessing those representations to find images, while users generally want to find images containing particular objects [15]. Most existing image retrieval systems represent images based only on their low-level features, with little regard for the spatial organization of those features. Systems based on user querying are often unintuitive and offer little help in understanding why certain images were returned and how to refine the query [13-15]. Often the user knows only that he has submitted a query for, say, a horse and retrieved very few pictures of horses in return. For general image collections, there are currently no systems that can automatically classify images or recognize the objects they contain. In particular, this letter demonstrates how the segmentation and new feature extraction can considerably enhance object based retrieval system.
The EM algorithm is used for finding maximum likelihood parameter estimates when there is missing or incomplete data [1]. In our case, the missing data is the region to which the points in the feature space belong. We estimate values to fill in for the incomplete data (the “E-Step”), compute the maximum likelihood parameter estimates using this data (the “M-Step”), and repeat until a suitable stopping criterion is reached. Based on essence of EM algorithm we can segment each image to different parts. Simulation and Fig.1 show, 3 Gaussian function will produce the best result to extract the main object of image. Choosing different number of Gaussian function will show worst segmentation results. After extracting object we apply a novel textural feature extraction method based on spectral function (SF). SF is fully studied in [15,3,5,6].
For feature extraction method, two, one dimensional signals are obtained from image by ordering of pixels row by row and column by column. Then the SCF of each signal is calculated by FAM algorithm [3]. For constructing the feature vector, the Norm-1 Energy and standard deviation are computed separately on each diamond shape region of SCFs from each signal and then feature vector is formed using these two parameter values. The basic assumption of using energy as a feature for texture discrimination is that the energy distribution in the frequency-domain identifies textural information of an image. Resulting feature vectors from N number of regions from each partition as follows.
Where and n = 1,2,…,N, N is the number of diamond regions at FAM algorithm. Respectively, Ein and σin are the Norm-1 energy and standard deviation of the SCF amplitude that are calculated for the ith signal at the nth region namely D:
(2)
Where are the number of estimate points and the mean of spectral function at corresponding D, respectively. Advantages of using cyclostationary analysis are the simplicity of generating features and feature vector dimension. Feature vector dimension is the one of the most important factors that determined the amount of storage space for the vector, the retrieval accuracy and retrieval time (or computational complexity).it will be shown that how simply we extract features from spectral function. The spectral function (SCF), which is the cross spectral of a signal and a frequency shifted version of itself, provides a second-order statistical description in the frequency domain of such signals. The theory and utility of cyclostationary signal models and the SCF, also called the cyclic spectrum, are discussed in references [8]. The basic time-smoothing and frequency-smoothing methods of spectral analysis were introduced in [8] and proof of their equivalence was given in [9,10]. Methods which more fully exploit the computational efficiency of the FFT, namely, the FFT accumulation method (FAM) and the strip spectral analyzer (SSCA) were introduced in [11] and discussed in.
Spectral Function
The SCF for a discrete-time real-valued signal x(n) is defined as the Fourier-series transform of the cyclic function
Where
thus is the cross spectrum of the pair of complex valued frequency-shifted signals and where f is the cross spectrum frequency variable and the parameter α, called the cyclic frequency, is the relative frequency shift [3,5,6]. If the signal has finite average power then there are at most a countable number of values of α for which . For α=0 the SCF reduces to the ordinary power spectrum:
The symmetry relationships and and periodicity associated with discrete time for any integer m and n, which are easily verified from the defining expressions, imply that the entire function is determined by where f is the normalized frequency.
Several computationally efficient algorithms for cyclic spectral analysis have been introduced that generally fall into two categories: those that average in frequency (frequency smoothing methods) and those that average in time (time smoothing methods). In this paper we utilize one efficient algorithm from time smoothing category namely FAM. We explain only the fundamentals of time smoothing algorithms that tends to FAM method. The precise discussion can be studied at [13]. All time smoothing algorithms are based on the time smoothed cyclic cross periodogram [10].
In (8), the operation < .>is the time averaging operation defined as follows:
and, the operation * is conjugate operator.
The physical interpretation of time smoothed cyclic periodogram is the spectral components of x(n) over a time span of Δt seconds. The spectral components and are the complex envelopes of narrow-band, bandpass components of a signal and called complex demodulates. For the calculating of a data tapering window of length T slides over the data for a time span of each instant the complex demodulates of the data within the window are computed. After calculation of the complex demodulates, they are correlated to produce estimates of the cyclic spectrum function. is done by time averaging of conjugate products over an interval of Δt seconds. To estimate the spectrum at the point (f0, α0), demodulates separated in frequency by an amount α0 and centered about a midpoint of f0 are correlated. The quantity Δt and Δf are referred to as the time and frequency resolutions of the point estimate. Mathematically, computation of the complex demodulates is expressed as:
Where a(r) is a data tapering window of length T = NTs . Then, the complex demodulates are correlated over a time span of Δt seconds:
Whereg(n) is a data tapering window of width . It is shown in [16] that the time smoothed cyclic cross period gram converges to the cyclic cross spectrum in the limit, as Δt→∞ followed by Δf →0 , if the time windows a(n) and g(n) are properly normalized. Therefore, Then,
(9)
In the FAM algorithm, time smoothing is done by Fourier transform. If frequency is shifted from α0 to α0+ε, the output of the system is given by:
(10)
If several values of ε are desired, evaluation of the sum can be simplified by discretizing the values of ε to be ε = qΔα . In this case the output of the algorithm is expressed as:
(11)
In which the sum can be evaluated with an N-point FFT. Thus, point estimates with constant cycle frequency can be computed in blocks by Fourier transforming the product sequences instead of averaging the product sequences individually. For complete coverage of the bifrequency plane a bank of bandpass filters is required to produce the necessary complex demodulates. An efficient method for producing the required complex demodulates is based on a sliding FFT [7]. In this approach the frequencies of the filter bank are discretized to:
(12)
The locations of SCF estimation associated with the pairs of complex demodulates are (fj, αi) where the frequency coordinates are: (13)
And the cycle frequency coordinates are:
(14)
Tilling the bifrequency plane with the FFT accumulation method for N′ = 8 and the locations of SCF estimation. For an N′pointchannelizer there are (N')2 possible combinations of channelizer streams; hence, there are at most (N')2 estimation regions (diamond regions). Due to symmetry, estimation of the cyclic spectrum of a single real signal requires only (N')2/4 diamond regions (one quadrant of the bifrequency plane) [2]. The more details of the FAM algorithm can be studied at [13].
To demonstrate the efficiency of system, sub set of Corel date base containing 600 image (grouped into six classes each consisting of 100 images) was used for query images in a simulated analysis. For experimentation, the images have been categorized into six classes. For each class, the performance is evaluated, as a percentage of the correct number of returned images and is presented in Table I. Average precision in present method significantly is better than simplicity and edge based methods in all categories. Comparison between present method and Horng Lin method demonstrate to some extend same results and average precision at four categories from six increased. Main advantage of present system is computation complexity which is the least one among different methods. TableII demonstrate Comparison of accuracy (ACC%) of retrieved image.Fig.1 show result of Object extraction using EM Algorithm for different Gaussian functions.Upper line of Fig.1, present three functions and lower two functions. Meanwhile, Fig.2 display Average precision of different methods on Image set.
It is also interesting to notice that the proposed method has a high speed performance, which is faster than presented method in [7]. Moreover, unlike the proposed method in [7] the proposed method has a simple structure.
This paper presents a full-automatic tool for object-based content retrieval through the structure of EM algorithm and spectral analysis. Novel features extraction based on spectral function has been introduced. Subsequently, statistical information from spectral function descriptors are retrieved and matched accordingly. The use of defined objects rather than the entire image leads to increase the flexibility of the system, it make simple to search for an object in an already prepared different kind of objects. Object-based image retrieval is not limited by the averaging properties associated with analyzing the entire image and can use local properties.