Skip to main content

A modified shape context method for shape based object retrieval

Abstract

The complexity in shape context method and its simplification is addressed. A novel, but simple approach to design shape context method including Fourier Transform for the object recognition is described. Relevance of shape context, an important descriptor for the recognition process is detailed. Inclusion of information regarding all the contour points (with respect to a reference point) in computing the distribution is discussed. Role of similarity checking the procedure details regarding the computation of matching errors through the alignment transform are discussed. Present case of shape context (for each point with respect to the centroid) descriptor is testified for its invariance to translation, rotation and scaling operations. Euclidean distance is used during the similarity matching. Modified shape context based descriptor is experimented over three standard databases. The results evidence the relative efficiency of the modified shape context based descriptor than that reported for other descriptor of concurrent interests.

Introduction

Although significant progress is witnessed in the field of automated object recognition, it is still remains challenging task (Zhang and Lu2004; Iyer et al.2005) from the broad purview of machine learning and computer vision processes of contemporary requirements. The shape of an object contains (Forsyth and Mundy1999) an important, unique and characteristic features of the object. The shape based methods consider either the contour or the entire region of the object. The consideration of contour involves less representative points in comparison with the region based methods (Nixon and Aguado2002). The region-based methods consider the global information (all the pixels within a shape) for the design of the descriptor which involves the geometrical moments (Hu1962; Flusser2000), Zernike moments (ZM) (Teague1980; Khotanzad1990), pseudo- Zernike moments (Belkasim et al.1991), Legendre moments (Teague1980), and Tchebichef moments (Mukundan et al.2001), generic Fourier descriptor (FD) (Zhang and Lu2002), compounded image descriptor (Li and Lee2005), shape matrix (Goshtasby1985), the grid technique (Lu and Sajjanhar1999) and shock graph (Sebastian et al.2004; Siddiqi et al.1999) etc. However, the contour based representation is reported to be more efficient (Yang et al.2008). Several recently reported contour based methods rely on viz., Fourier transform (Zahn and Roskies1972; Wallace and Wintz1980; Kunttu et al.2006), curvature scale space (CSS) (Mokhtarian and Mackworth1986; Abbasi et al.1999,2000), wavelet transform (Chauang and Kuo1996; Yadav et al.2007), contour displacement (Adamek and O’Connor2004), chain codes (Junding and Xiaosheng2006), autoregressive (Dubois and Glanz1986), Delaunay triangulation (Tao and Wi1999), multi-resolution polygonal (Day et al.2004) robust symbolic representation (Daliri and Torre2008), distance sets (Grigorescu and Petkov2003), elastic matching (Attalla and Siy2005) etc techniques for the design of the shape descriptor. Basing on the consideration of shape boundaries (Petrakis et al.2002; Arica and Vural2003; Bartolini et al.2005; Lateckia et al.2005; Alajlan et al.2007), dynamic programming (DP) technique is also adopted to achieve high accuracy rate. The DP based techniques suffer from being computationally expensive and get reduced to be impractical for large databases, despite the fact that they offer better performance.

Generally, the descriptor relevant to the shape context (Belongie et al.2002) method for object recognition is developed with an established correspondence between the point sets. The procedure combines the shape context information with the information formatted by using thin plate spline (Bookstein1989) processing. Due to the proven simplicity and capability of discrimination, the shape context based methods proficiently proposed in the literature (Dubois and Glanz1986; Tao and Wi1999; Day et al.2004; Daliri and Torre2008; Grigorescu and Petkov2003; Attalla and Siy2005; Petrakis et al.2002; Arica and Vural2003; Bartolini et al.2005; Lateckia et al.2005; Alajlan et al.2007; Belongie et al.2002; Bookstein1989; Mori and Malik2003; Thayananthan et al.2003; Zhang and Malik2003; Salve and Jondhale2010). Recently, Xin Shu proposed Contour Points Distribution Histogram (CPDH) (Shu and Xiao Jun2011) for the shape context method. The shape matching process which speaks out the performance of a descriptor is dealt in different ways. The Zucker et al (Siddiqi et al.1999) has developed shock graph grammar and the relevant tree matching algorithm. The spectral distance (based on diffusion geometry, heat trace) estimated through the Laplacian transform is also used for matching (Bronstein and Kokkinos2010; Bronstein and Bronstein2011; Konukoglu et al.2013). On the other hand, the Fourier transform based matching procedures are is also popular (Cem Direko glu and Nixon2011; Xingyuan and Zongyu2013; Ghazal et al.2009; Ghazal et al.2012).

In the wake of the results reported in the area of shape context based object recognition techniques involving a wide variety of design of description and matching measures, it serves that the utility of the Fourier based descriptors for the shape context based recognition presents a superior method rather than the contour based methods. Hence, the authors propose for the design of a novel hybrid contour based shape descriptor which is constructed with respect to the centroid, while the feature vector is estimated by a 1D Fourier transform. The shape toning phase is involves the Euclidean Distance to enhance the quality.

The paper is organized in three sections. Introduction to the computerized object recognition method is presented in section-Introduction. Methodology adopted for the present shape context technique is presented in section-Methodology along with the information for indices to evaluate its performance. The results obtained by adopting present method to the standard databases and their trends are presented in section-Results and discussion along with the relevant discussions of performance.

Methodology

A multi staged novel and hybrid shape context based scheme for the object recognition process is proposed. The phase wise information during the processing is presented in section-Design of system, while the proposed indices to estimate the performance are presented in section-Performance.

Design of system

The details of various stages involved with the proposed object recognition by using shape context are schematically depicted in Figure 1. The proposed system consists of four successive steps viz:

Figure 1
figure 1

Schematic diagram of the proposed system with shape context.

  1. (i)

    Shape representation with contour

  2. (ii)

    Computation of Shape Context

  3. (iii)

    Construction of histogram for each bin of shape context

  4. (iv)

    Shape description by using Fourier Transform

The descriptor is further expected to a training stage viz., shape toning and ranking. An overview of all these stages of processing implies that the shape based object recognition system includes the salient features of stages, such as shape representation, shape description and shape toning. Contour based shape representation is considered as the initial step of processing. The second step includes description of the shape representation points. Belongie Shape Context (BSC) (Belongie et al.2002) is popularly used method for describing the shape of the object. During the second step, the contour of the given object is described (Figure 2(a)) by constructing the BSC. During the construction of BSC, the angle between any two points (on log polar transform) is measured with respect to constant center on x axis given by the Equation-(1).

θ x , y = tan - 1 y 2 y 1
(1)

where:

Figure 2
figure 2

Example of constructing (a) Belongie Shape Context (b) Modified Shape Context signature for a contour point wrt the corresponding farthest point.

θ(x, y) is the angle measurement between two points x and y,

y2 is the y coordinate of the first point,

y1 is the y coordinate of the second point.

To test the invariance property of the BSC, a Modified Shape Context (MSC) is presently proposed (Figure 2(b)). The MSC measures the angle between any two points with respect to the centroid. If the total no. of contour points are Z then the farthest two points will be selected and the angle between these points is measured by the Equation-(2).

θ x , y =ta n ‒ 1 m 2 ‒ m 1 1 + m 2 + m 1
(2)

where:

θ(x,y) is the angle measurement of a point (x, y),

m1 is the slope of the line between first point and second point,

m2 is the slope of the line between first point and center point.

A histogram containing each bin of the shape context (SCH) is constructed for each part of the shape context to enable the shape context to viable to various transformations. In the wake of the fact that the Fourier Transform is widely used transformations (Ghazal et al.2009; Zhang and Lu2005) for object recognition problems and its coefficients are found to be invariant to symmetry operations (i.e. translation, rotation and scaling etc). The size of the shape representation points is an important and influential factor that optimizes the utility of Fourier transformation. Hence, in the shape signature generation process, the sampling is considered as a mandatory step. Some of the sampling methods like Equal Point (EP), Equal Angle (EA) and Equal Arc Length (EAL) (Zhang and Lu2003) are considered presently. EAL is expected to yield for a better equal space (Peter and Otterloo1991) than the other two methods. By using EAL, the representation of the contour is restricted to N-number of points. The proposed method uses EAL method to sample the finite number of contour points. For a given contour signal, the 1-D Fourier transform is given as;

F D n = 1 N ∑ t = 0 N ‒ 1 s t × exp ‒ j 2 π n t N
(3)

where:

s(t) represents the 1-D contour signal,

N represents number of representative points of the contour,

n = 0,1,2,…,N-1 and,

FDn represents nth Fourier descriptor.

Using Equation (3), the required Fourier Descriptors of size ‘N’ are generated. Further, the extracted features are testified for their invariance to translation, rotation and scaling operations (performed over the set of images). In the wake of the fact that the proposed descriptor is obtained with respect to the centroid, the obtained features are expected to be invariant to translation. The possible finite (and stipulated) magnitude of the values for the features vouches for the rotation invariance. For the present method, the scaling invariance is also presented by involving the process of dividing the features with the first feature value. In the third step, the feature vector is constructed, which describes the entire shape features of the object.

To further improve the quality of proposed methods, the global information of the object is also considered. For this, experiments are conducted with considering different global descriptors and identified that three global descriptors (GD) are efficient to represent the global shape information. The GD feature vector, viz., {S, C, A} contains the measures of solidity, circularity and aspect ratio is computed for the given object.

In the fourth step, the shape toning process is executed. In the shape toning process, the distance measures (Ghazal et al.2009,2012) used is viz., the Euclidean distance (ED). The distance measure between two objects shape context vectors is given by the Equation (4). In this, the average global distance of global feature vectors is directly added to the Euclidean distance of the Fourier descriptor feature vector.

The distance between two shape context vectors including the object global feature vectors is given by the Equation (5).

D T E , T R =ED T E , T R + D X T E , T R 3
(4)

where:

D X T E , T R = ∑ X X T E ‒ X T R max X T R
(5)

ED (TE, TR) represents the Euclidean Distance between the test and trained shapes and,

DX (TE,TR) represents the Global distance between the test and trained shapes.

where:

X represents the GD vector {S, C, AR},

XTE represents the GD feature of the test shape and,

XTR represents the GD feature of the trained shape.

According to the specificity of the data of distance measurement, the distances are further rearranged in ascending order and are assigned with ranks. However, the system is also enabled to recognize and register the top ranked images.

The standard databases (Sikora2001; Sebastian et al.2001) used for the evaluation of shape descriptors presently are Kimia {K-99, K-216} and MPEG CE-1 Set B. It is noticed that the Set B database with 70 groups and each group with 20 images. It characteristically includes rotated, scaled, skewed and defected shapes. However the K-99 database which consists of 9 groups, each group with 11 images. It is known to include the partially occluded shapes. The K-216 database with 18 groups, each group with 12 images, it represents a sub database of Set B, and contains partially occluded shapes.

Performance

The performance of various object recognition schemes reported (Shu and Xiao Jun2011; Ghazal et al.2009,2012) so far employ different measures. Among them, precision and recall are considered as important measures, while they verbally quantify the similarity measurement. Precision (P) and Recall (R) are defined by:

P= x y
(6)

where:

R= x groupsize
(7)

x denotes the true recognition results,

y denotes the total recognized result and,

P denotes the precision.

where:

R denotes the Recall,

x denotes the true recognition results and,

group size denotes the maximum true recognition result.

The Average Precision value for each recall is computed. This value is affirmatively grouped as two categories viz., Low Recall (LR), High Recall (HR). The Average Precision for Low Recall (APLR) denotes the average precision for recalls less than or equal to 50. In contrast, the Average Precision for High Recall (APHR) represents the average precision for recalls greater than 50. The False Detection Rate (FDR) for each of the image is also estimated by:

FDR= z y
(8)

where:

FDR denotes the False Detection Rate,

z denotes the false recognition result and,

y denote the total recognized result.

The average FDR (AFDR) value of all test images corresponding to each database is estimated. Apart from the usual recognition rate, the Average Processing Time (APT) is also estimated for each query in the shape toning stage. The proposed descriptor is compared with 4 standard descriptors viz., Angular Radial Transform Descriptor (ARTD) (Zhang et al.2008), Moment Invariant Descriptor (MID) (Zhang et al.2008), Zernike Moment Descriptor (ZMD) (Tiagrajah and Razeen2011) and Curvature-Scale-Space-Descriptor (CSSD) (Tiagrajah and Razeen2011). A specific feature size of 35 for ARTD (n < 3, m < 12), 6 for MID, 34 for ZMD (order from 2 to 10) is adopted. The CSSD feature size is varying from that for one image to another image since number of peaks is varying. All the cited metrics viz., APLR, APHR, AFDR and APT are evaluated to estimate the performance for the proposed descriptor (with inclusion of GD), ARTD, MID, ZMD and CSSD.

Results and discussion

Shape context based object recognition is estimated as detailed in section-Design of system for the input of standard databases. The trends of the results that follow various approaches are presented in the following sub section-Processing of modified shape context based object recognition. The relative performance of the proposed descriptor is also analyzed in the section-Performance evaluation in the wake of the other reported methods.

Processing of modified shape context based object recognition

The shape context is constructed with 60 bins. Then for each contour point, the angle is measured (i.e. BSC and MSC) within the range of one full rotation i.e. from 0° to 360°. A histogram is generated that corresponds to each and every bin of shape context. The histograms, thus constructed are presented in Figures 3,4,5, and6 corresponding to four different image groups (i.e. animal, hand, heart and glass) as accessed from set B, K-99 and K-216 databases. Figures 3(a1), -3(b1) and -3(c1) contains three original images of animal group (animal-3, animal-5 and animal-7); the Figures 4(a1), -(b1) and -(c1) pertain to the three original images of hand group (hand8, hand9, hand11); the Figures 5(a1), -(b1) and -(c1) give three original images of heart group (heart-7, heart-11 and heart-12); and Figures 6(a1), -(b1) and -(c1) contain three original images of fly group (Fly1, Fly4 and Fly10). The corresponding Modified shape context (MSC) histogram is illustrated in Figures 3(a2),4,5,6(a2), Figures 3(b2),4,5,6(b2) and Figures 3(c2),4,5,6(c2) respectively. It is clearly noticed from Figures 3,4,5,6, that the MSC histogram is found to be similar for the different shapes within the same group; while it exhibits difference between those of one group compared to the other. Basing on 1-D shape signal, the Fourier descriptors (FD) are generated. The experiments are conducted with varying size of feature vector. From this, it is identified that the first ten features of the FD are consistent. Hence, they are used to design the feature vector.

Figure 3
figure 3

Proposed descriptor for three Animal group Images (a1) Animal 3 (a2) MSC signature of Animal 3 (b1) Animal 5 (b2) MSC signature of Animal 5 (c1) Animal 7 (c2) MSC signature of Animal 7.

Figure 4
figure 4

Proposed descriptor for three Hand group Images (a1) Hand 8 (a2) MSC signature of Hand 8 (b1) Hand 9 (b2) MSC signature of Hand 9 (c1) Hand 11 (c2) MSC signature of Hand 11.

In the present object recognition process, Euclidean Distance (ED) measure of performance is estimated between the target and test objects, while they are allocated with ranking according to their distance. In accord with the established procedures, the top n-ranked objects are used to estimate the Precision and Recall parameters. The ‘n’ notifies the group size for {Set B:20, K-99:11 and K-216:12} sets. For each database, the accuracy for the retrieval results corresponding to top ‘n’ (group size of the corresponding database) number of images is estimated and illustrated in Figures 7,8,9 respectively. For K-99 database:- The top 11 ranked images correspond to the query image Key2 (as estimated by using ZMD, BSC + GD and MSC + GD), while they are presented in the Figures 7(a)-(d). The Figure 7(a) corresponds to the Key2 query image and Figure 7(b) corresponds to the retrieval results with ZMD descriptor. Figure 7(c) gives the retrieval results for BSC + GD descriptor; and Figure 7(d) vouches for the retrieval results with MSC + GD descriptor. For K-216 database:- The top 12 ranked images corresponding to the query image Fork12 are estimated by using ZMD, BSC + GD and MSC + GD as presented in Figures 8(a)-(d). The Figure 8(a) illustrates the Fork12 query image and Figure 8(b) speaks for the retrieval results with ZMD descriptor. Figure 8(c) gives the retrieval results with BSC + GD descriptor, and Figure 8(d) gives the retrieval results with MSC + GD descriptor.

Figure 5
figure 5

Proposed descriptor for three Heart group Images (a1) Heart 7 (a2) MSC signature of Heart 7 (b1) Heart 11 (b2) MSC signature of Heart 11 (c1) Heart 12 (c2) MSC signature of heart 12.

Figure 6
figure 6

Proposed descriptor for three Fly group Images (a1) Fly 1 (a2) MSC signature of Fly 1 (b1) Fly 4 (b2) MSC signature of Fly 4 (c1) Fly 10 (c2) MSC signature of Fly 10.

Figure 7
figure 7

Retrieval results of Key2 test image from K-99 database (a) Original image (b) ZMD result (c) BSC+GD result (d) MSC+GD result.

For Set B database:- the top 20 ranked images corresponds to the query image of Carriage16 by using ZMD, BSC + GD and MSC + GD as presented in the Figures 9(a)-(d). The Figure 9(a) gives the Carriage16 query image, Figure 9(b) gives the retrieval results with ZMD descriptor, Figure 9(c) gives the retrieval results with BSC + GD descriptor and Figure 9(d) gives the retrieval results with MSC + GD descriptor. Overview of cited Figures 7(a),8,9(d) suggests that, the MSC + GD descriptor performs better for retrieval of more relevant images with relatively strong correspondence than that with the other descriptors.

Figure 8
figure 8

Retrieval results of Fork12 test image from K-216 database (a) Original image (b) ZMD result (c) BSC+GD result (d) MSC+GD result.

Figure 9
figure 9

Retrieval results of Carriage16 test Image from Set B database (a) Original image (b) ZMD result (c) BSC+GD result (d) MSC+GD result.

Performance evaluation

The yield of APLR and APHR values for the descriptor with the currently proposed distance measure is analyzed. In the wake of the four other standard descriptors, the aspect of compatibility (with three databases) is also explored, while the results are presented in Tables 1,2,3 respectively. From these results, it is clearly evident that the presently proposed descriptor out performs the other descriptors regarding all the three standard databases. However, among the presently considered descriptors, the CSSD descriptor is found to accompany with a lower performance, followed by that of MID, ARTD. However, the case of ZMD resulted for the next higher performance. But, for Set B database, the ZMD is yielding the highest result. From Table 1, it is found that the proposed MSC + GD happen to be influential to increase the APLR value of ZMD. It is also found to significantly increase the APHR value of ARTD. For K-99 and K-216 databases, the ZMD is giving distinctly improved APLR and APHR values than with the other descriptors. From the Tables 2 and3; it is evident that the proposed MSC + GD is accompanied with an improved performance in terms of enhancement of APLR and APHR.

The PR plots for these five descriptors comparing to the set B, K-99 and K-216 are presented in Figures 10,11 and12 respectively. Figure 10 reveals that all the five descriptors are yielding considerable enhanced performance with regard to the precision measure for the Set B database in the range of low recalls. At higher recalls, the proposed MSC + GD measure is found to result for improved precision measure, rather than the other standard descriptors. The proposed CSD is found to increase the precision measure marginally at lower recalls i.e. ≤50, bit, it is found to significantly increase the precision at higher recalls i.e. >50. The PR plot for K-99 database is depicted in Figure 11. From this figure, it is observed that the ZMD outperforms other standard descriptors with increased precision measure in the range of both lower and higher recalls. An overview of the results infers that the proposed MSC + GD measure is considerable increase in precision measure at lower recalls ranged between 40 and 50 and higher recalls ranged between 60 to 70 and 90 to 100. The precision measure is found to attain considerable improvement at higher recalls i.e. at 80 to 100. Figure 12 describes the PR plot of various descriptors for K-216 database. The ZMD in this PR plot is found to be superior, rather than other standard descriptors at lower and higher recalls. Thus the proposed MSC + GD measure is giving increased precision measure regarding PR plots at both lower and higher recalls.

Table 1 The APLR and APHR values for various descriptors with Set B database
Table 2 The APLR and APHR values for various descriptors with K-99 database
Table 3 The APLR and APHR values for various descriptors with K-216 database
Figure 10
figure 10

The PR graph for various descriptors with set B database.

Figure 11
figure 11

The PR graph for various descriptors with K-99 database.

Figure 12
figure 12

The PR graph for various descriptors with K-216 database.

Other Performance measures viz., Average False Discovery Rate (AFDR) and Average Processing Time (APT) are also estimated as detailed in section-2.2, while the estimated AFDR values for the three databases are presented in Table 4. It is observed that the proposed MSC + GD results for a lower value for all the three databases. Since, the APT measure is argued to exhibit profound influence on shape toning stage (for each of the shape descriptor), it is also estimated for all the three databases, and presented in Table 5. The proposed MSC + GD is found to yield for less APT value in comparison with other descriptors. Therefore, basing the observed trends of performance measures, it is argues that the proposed descriptor exhibits higher efficiency. As a measure of performance, the retrieval rate with bull’s eye score (Cem Direko glu and Nixon2011) is also estimated. This measure involves the calculation of the ratio of the total number of shapes (i.e. from the same class) to the highest possible number of shapes in the same database. The estimated bull’s eye score for top 40 results in Set B database is presented in Table 6. It is clear that Inner Distance Shape Context (IDSC) is yielding highest score when compared with the others. However, as this includes complicated dynamic programming procedure, the simple Euclidean distance measure for the proposed descriptor is argued to be more efficient retrieval parameter.

Table 4 The AFDR for various descriptors with three databases
Table 5 APT of various descriptors with set B database
Table 6 Bull’s eye score for various descriptors with set B database

Conclusions

  • Shape context based description is proved to be efficient when compared with various other standard descriptors with respect to various performance measures viz., APLR, APHR, AFDR and APT.

  • The proposed descriptor improves the precision measures at high recalls when compared with the low recalls thus enabling more relevant objects to be recognized.

  • With less feature vector size, the proposed descriptor enables the object recognition system to be efficient with less APT and AFDR measures.

References

  • Abbasi S, Mokhtarian F, Kittler J: Curvature scale space image in shape similarity retrieval. Multimed Syst 1999, 7: 467-476.

    Article  Google Scholar 

  • Abbasi S, Mokhtarian F, Kittler J: Enhancing CSS-based shape retrieval for objects with shallow concavities. Image Vis Comput 2000, 18: 199-211.

    Article  Google Scholar 

  • Adamek T, O’Connor NE: A multi scale representation method for non rigid shapes with a single closed contour. IEEE Trans Circuits Syst Video Technol 2004, 14: 742-753.

    Article  Google Scholar 

  • Alajlan N, Rube IE, Kamel MS, Freeman G: Shape retrieval using triangle-area representation and dynamic space warping. Pattern Recogn 2007, 40: 1911-1920.

    Article  Google Scholar 

  • Arica N, Vural FTY: BAS: a perceptual shade descriptor based on the beam angle statistics. Pattern Recogn Lett 2003, 24: 1627-1639.

    Article  Google Scholar 

  • Attalla E, Siy P: Robust shape similarity retrieval based on contour segmentation polygonal multi resolution and elastic matching. Pattern Recogn 2005, 38: 2229-2241.

    Article  Google Scholar 

  • Bartolini I, Ciaccia P, Patella M: WARP: accurate retrieval of shapes using phase of Fourier descriptors and time warping distance. IEEE Trans Pattern Anal Mach Intell 2005, 27: 142-147.

    Article  Google Scholar 

  • Belkasim SO, Shridhar M, Ahmadi M: Pattern recognition with moment invariants: a comparative study and new results. Pattern Recogn 1991, 24: 1117-1138.

    Article  Google Scholar 

  • Belongie S, Malik J, Puzicha J: Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 2002, 24(4):509-522.

    Article  Google Scholar 

  • Bookstein FL: Principal warps: thin-plate-splines and decomposition of deformations. IEEE Trans Pattern Anal Mach Intell 1989, 11(6):567-585.

    Article  Google Scholar 

  • Bronstein MM, Bronstein AM: Shape recognition with spectral distances. IEEE Trans Pattern Anal Mach Intell 2011, 33(5):1065-1071.

    Article  Google Scholar 

  • Bronstein MM, Kokkinos I: Scale invariant heat kernel signatures for non rigid shape recognition. IEEE Conf Comput Vision Pattern Recogn (CVPR) 2010, 1704-1711. doi:10.1109/CVPR.2010.5539838

    Google Scholar 

  • Nixon MS, Cem Direko glu: Shape classification via image-based multi scale description. Pattern Recogn 2011, 44: 2134-2146.

    Article  Google Scholar 

  • Chauang G, Kuo C: Wavelet descriptor of planar curves: theory and applications. IEEE Trans Image Process 1996, 5: 56-70.

    Article  Google Scholar 

  • Daliri MR, Torre V: Robust symbolic representation for shape recognition and retrieval. Pattern Recogn 2008, 41: 1782-1798.

    Article  Google Scholar 

  • Day AM, Arnold DB, Havemann S, Fellner DW: Combining polygonal and subdivision surface approaches to modeling and rendering of urban environments. Comput Graph 2004, 28(4):497-507.

    Article  Google Scholar 

  • Dubois SR, Glanz FH: An autoregressive model approach to two- dimensional shape classification. IEEE Trans Pattern Anal Mach Intell 1986, 8: 55-65.

    Article  Google Scholar 

  • Flusser J: On the independence of rotation moment invariants. Pattern Recogn 2000, 33: 1405-1410.

    Article  Google Scholar 

  • Forsyth D, Mundy J: Shape, contour and grouping in computer vision. Lect Notes Comput Sci 1999, 1681: 1-3.

    Google Scholar 

  • Ghazal AE, Basir O, Belkasim S: Farthest point distance: a new shape signature for Fourier descriptors. Signal Process Image Comm 2009, 24: 572-586.

    Article  Google Scholar 

  • Ghazal AE, Basir O, Belkasim S: Invariant curvature-based Fourier shape descriptors. J Vis Commun Image R 2012, 23: 622-633.

    Article  Google Scholar 

  • Goshtasby A: Description and discrimination of planar shapes using shape matrices. IEEE Trans Pattern Anal Mach Intell 1985, 7: 738-743.

    Article  Google Scholar 

  • Grigorescu C, Petkov N: Distance sets for shape filters and shape recognition. IEEE Trans Image Process 2003, 12(10):1274-1286.

    Article  Google Scholar 

  • Hu M: Visual pattern recognition by moment invariants. IRE Trans Inform Theor IT 1962, 8: 115-147.

    Google Scholar 

  • Iyer N, Jayanti S, Lou K, Kalyanaraman Y, Ramani K: Three dimensional shape searching: state of the art review and future trends. Comput Aided Des 2005, 37: 509-530.

    Article  Google Scholar 

  • Junding S, Xiaosheng W: Chain Code Distribution-Based Image Retrieval. International Conference on Intelligent Information Hiding and Multimedia Signal Processing, China; 2006:139-142.

    Google Scholar 

  • Khotanzad A: Invariant image recognition by Zernike moments. IEEE Trans Pattern Anal Mach Intell 1990, 12: 489-497.

    Article  Google Scholar 

  • Konukoglu E, Glocker B, Criminisi A, Pohl KM: WESD-weighted spectral distance for measuring shape dissimilarity. IEEE Trans Pattern Anal Mach Intell 2013, 35(9):2284-2297.

    Article  Google Scholar 

  • Kunttu I, Lepisto L, Rauhamaa J, Visa A: Multi scale fourier descriptors for defect image retrieval. Pattern Recogn Lett 2006, 27: 123-132.

    Article  Google Scholar 

  • Latecki LJ, Lakamper R: Shape similarity measure based on correspondence of visual parts. IEEE Trans Pattern Anal Mach Intell 2000, 22(10):1185-1190.

    Article  Google Scholar 

  • Lateckia LJ, Lakaempera R, Wolter D: Optimal partial shape similarity. Image Vis Comput 2005, 23: 227-236.

    Article  Google Scholar 

  • Li S, Lee MC: Effective invariant features for shape-based image retrieval. J Am Soc Inf Sci Technol 2005, 56: 729-740.

    Article  Google Scholar 

  • Ling H, Jacobs DW: Shape classification using the inner distance. IEEE Trans Pattern Anal Mach Intell 2007, 29(2):286-299.

    Article  Google Scholar 

  • Lu G, Sajjanhar A: Region-based shape representation and similarity measure suitable for content based image retrieval. Multimed Syst 1999, 7: 165-174.

    Article  Google Scholar 

  • Mokhtarian F, Mackworth A: Scale-based description and recognition of planar curves and two-dimensional shapes. IEEE Trans Pattern Anal Mach Intell 1986, 8: 34-43.

    Article  Google Scholar 

  • Mokhtarian F, Abbasi F, Kittler J, Smeulders AWM, Jain R: Efficient and robust retrieval by shape content through curvature scale space. In Image Databases and Multi-Media Search. Singapore: World Scientific Publishing; 1997:51-58.

    Google Scholar 

  • Mori G, Malik J: Recognizing objects in adversarial clutter: breaking a visual CAPTCHA. IEEE Conf Comput Vision Pattern Recogn 2003, 1: 1063-6919.

    Google Scholar 

  • Mukundan R, Ong SH, Lee PA: Image analysis by Tchebichef moments. IEEE Trans Image Process 2001, 10: 1357-1364.

    Article  Google Scholar 

  • Nixon MS, Aguado AS: Feature Extraction and Image Processing. 1st edition. Newnes Publishers, Burlington MA; 2002:247-287.

    Book  Google Scholar 

  • Peter J, Otterloo V: A contour- Oriented Approach to Shape Analysis. Prentice Hall, Hertfordshire UK; 1991.

    Google Scholar 

  • Petrakis EGM, Diplaros A, Milios E: Matching and retrieval of distorted and occluded shapes using dynamic programming. IEEE Trans Pattern Anal Mach Intell 2002, 24: 1501-1516.

    Article  Google Scholar 

  • Salve SG, Jondhale KC: Shape matching and object recognition using shape contexts. In Computer Science and Information Technology (ICCSIT), vol 9. 3rd IEEE International Conference; 2010:471-474.

    Google Scholar 

  • Sebastian TB, Klein PN, Kimia BB: Recognition of shapes by editing shock graphs. ICCV 2001, 1: 755-762.

    Google Scholar 

  • Sebastian T, Klein P, Kimia B: On aligning curves. IEEE Trans Pattern Anal Mach Intell 2003, 25(1):116-125.

    Article  Google Scholar 

  • Sebastian TB, Klein PN, Kimia BB: Recognition of shapes by editing their shock graphs. IEEE Trans Pattern Anal Mach Intell 2004, 26(5):550-571.

    Article  Google Scholar 

  • Shu X, Xiao Jun W: A novel contour descriptor for 2D shape matching and its application to image retrieval. Image Vis Comput 2011, 29: 286-294.

    Article  Google Scholar 

  • Siddiqi K, Shokoufandeh A, Dickinson SJ, Zucker SW: Shock graphs and shape matching. Int J Comput Vis 1999, 35(1):13-32.

    Article  Google Scholar 

  • Sikora T: The MPEG-7 visual standard for content description- an overview. IEEE Trans Circuits Syst Video Technol 2001, 11(6):696-702.

    Article  Google Scholar 

  • Tao Y, Wi G: Delaunay Triangulation for Image Object Indexing: A Novel Method for Shape Representation. Seventh SPIE Symposium on Storage and Retrieval for Image and Video Databases San Jose, CA; 1999:631-642.

    Google Scholar 

  • Teague M: Image analysis via the general theory of moments. J Opt Soc Am 1980, 70: 920-930.

    Article  Google Scholar 

  • Thayananthan A, Stenger B, Torr PHS, Cipolla R: Shape context and chamfer matching in cluttered scenes. IEEE Conf Comput Vision Pattern Recogn 2003, 1: 127-133.

    Google Scholar 

  • Tiagrajah VJ, Razeen AASM: An enhanced shape descriptor based on radial distances. IEEE Int Conf Signal Image Process Appl (ICSIPA) 2011, 472-477. doi:10.1109/ICSIPA.2011.6144073

    Google Scholar 

  • Tu Z, Yuille A: Shape matching and recognition-using generative models and informative features. Proc Eur Conf Comput Vis 2004, 3: 195-209.

    Google Scholar 

  • Wallace TP, Wintz PA: An efficient three dimensional aircraft recognition algorithm using normalized Fourier descriptors. Comput Graph Image Process 1980, 13: 99-126.

    Article  Google Scholar 

  • Xie J, Heng P, Shah M: Shape matching and modeling using skeletal context. Pattern Recogn 2008, 41(5):1756-1767.

    Article  Google Scholar 

  • Xingyuan W, Zongyu W: A novel method for image retrieval based on structure elements’ descriptor. J Vis Commun Image R 2013, 24: 63-74.

    Article  Google Scholar 

  • Yadav RB, Nishchal NK, Gupta AK, Rastogi VK: Retrieval and classification of shape-based objects using Fourier, generic Fourier, and wavelet-Fourier descriptors technique: a comparative study. Opt Lasers Eng 2007, 45: 695-708.

    Article  Google Scholar 

  • Yang M, Kpalma K, Ronsin J: A Survey of shape feature extraction techniques. In Pattern Recognition Edited by: Pen Y. 2008, 43-90.

    Google Scholar 

  • Zahn T, Roskies RZ: Fourier descriptors for plane closed curves. IEEE Trans Comput 1972, 21: 269-281.

    Article  Google Scholar 

  • Zhang D, Lu G: Shape-based image retrieval using generic Fourier descriptor. Signal Process Image Commun 2002, 17: 825-848.

    Article  Google Scholar 

  • Zhang D, Lu G: A comparative study of curvature scale space and Fourier descriptors for shape based image retrieval. J Vis Commun Image Represents 2003, 14(1):39-57.

    Article  Google Scholar 

  • Zhang D, Lu G: Review of shape representation and description techniques. Pattern Recogn 2004, 37(1):1-19.

    Article  Google Scholar 

  • Zhang D, Lu G: Study and evaluation of different fourier methods for image retrieval. Image Vis Comput 2005, 23(11):33-49.

    Article  Google Scholar 

  • Zhang H, Malik J: Learning a discriminative classifier using shape context distances. IEEE Conf on Comput Visi Pattern Recogn 2003, 1: 242-247.

    Google Scholar 

  • Zhang G, Ma ZM, Tong Q, He Y, Zhao T: Shape Feature Extraction using Fourier Descriptors with Brightness in Content-Based Medical Image Retrieval. International Conference on Intelligent Information Hiding and Multimedia Signal Processing; 2008:71-74.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Satyanarayana Chittipothula.

Additional information

Competing interests

PSVG areas of interest include Object Oriented Technologies, Information Retrieval, Algorithms, Computer Networks, and Image Processing. PDM areas of interests are Experimental soft condensed matter, Liquid crystals-design and characterization, Theory of spectroscopy, chemistry – design of supermolecules, Face Recognition, Image Processing, and optical character recognition. SCH area of interest includes Image processing, Database Management Systems, Speech Recognition, Pattern recognition and network security.

Authors’ contributions

RMM carried out the shape context studies, participated in the development of the proposed shape descriptor and drafted the manuscript. PSVG participated in the design of the study with the shape toning process, and drafted the manuscript. PDM participated in the design of the proposed shape descriptor with Fourier Transformation, and drafted the manuscript. SCH participated in the design of the study with feature extraction, and drafted the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Rights and permissions

Open Access  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Madireddy, R.M., Gottumukkala, P.S.V., Murthy, P.D. et al. A modified shape context method for shape based object retrieval. SpringerPlus 3, 674 (2014). https://doi.org/10.1186/2193-1801-3-674

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/2193-1801-3-674

Keywords