Local Detectors and Descriptors for Object Class Recognition

Full Text (PDF, 288KB), PP.12-18

Views: 0 Downloads: 0


Faten A. Khalifa 1,* Noura A. Semary 1 Hatem M. El-Sayed 1 Mohiy M. Hadhoud 1

1. Faculty of Computers and Information, Menofia University, Menofia, Egypt

* Corresponding author.

DOI: https://doi.org/10.5815/ijisa.2015.10.02

Received: 10 Feb. 2015 / Revised: 5 Jun. 2015 / Accepted: 14 Jul. 2015 / Published: 8 Sep. 2015

Index Terms

Local feature detectors, Local feature descriptors, Binary descriptors, RANSAC, Object recognition, Augmented reality, Digilog book


Local feature detection and description are widely used for object recognition such as augmented reality applications. There have been a number of evaluations and comparisons between feature detectors and descriptors and between their different implementations. Those evaluations are carried out on random sets of image structures. However, feature detectors and descriptors respond differently depending on the image structure. In this paper, we evaluate the overall performance of the most efficient detectors and descriptors in terms of speed and efficiency. The evaluation is carried out on a set of images of different object classes and structures with different geometric and photometric deformations. This evaluation would be useful for detecting the most suitable detector and descriptor for a particular object recognition application. Moreover, multi-object applications such as digilog books could change the detector and descriptor used based on the current object. From the results, it has been observed that some detectors perform better with certain object classes. Differences in performance of the descriptors vary with different image structures.

Cite This Paper

Faten A. Khalifa, Noura A. Semary, Hatem M. El-Sayed, Mohiy M. Hadhoud,"Local Detectors and Descriptors for Object Class Recognition", International Journal of Intelligent Systems and Applications(IJISA), vol.7, no.10, pp.12-18, 2015. DOI:10.5815/ijisa.2015.10.02


[1]D. G. Lowe, “Object recognition from local scale-invariant features,” The Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 2, pp. 1150-1157, 1999.
[2]A. Yilmaz, O. Javed, and M. Shah, “Object tracking: A survey,” ACM Comput. Surv., vol. 38, no. 4, pp. 1-45, 2006.
[3]M. Brown, and D. G. Lowe, “Unsupervised 3D object recognition and reconstruction in unordered datasets,” Proceedings of the international conference on 3D digital imaging and modeling, pp. 56-63, 2005.
[4]N. Snavely, S. M. Seitz, and R. Szeliski, “Skeletal graphs for efficient structure from motion,” Proc. Computer Vision and Pattern Recognition, 2008.
[5]S. Se, D. Lowe, and J. Little, “Mobile robot localization and mapping with uncertainty using scale-invariant visual landmarks,” International Journal of Robotics Research, vol. 21, no. 8, pp. 735- 758, August 2002.
[6]T. Tuytelaars, and K. Mikolajczyk, “Local invariant feature detectors: a survey,” Found. Trends. Comput. Graph. Vis., vol. 3, no. 3, pp. 177-280, 2008.
[7]D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, vol. 60, no. 2, pp. 91-110, 2004.
[8]H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool, “Speeded-up robust features (SURF),” Computer Vision and Image Understanding, vol. 110, no. 3, pp. 346-359, 2008.
[9]E. Rublee, V. Rabaud, K. Konolige, and G. Bradski, “ORB: an efficient alternative to SIFT or SURF,” Proceedings of the IEEE International Conference on Computer Vision, pp. 2564-2571, 2011.
[10]M. Calonder, V. Lepetit, C. Strecha, and P. Fua, “BRIEF: binary robust independent elementary features,” Proceedings of the 11th European Conference on Computer Vision, pp. 778-792, 2010.
[11]E. Rosten, and T. Drummond, “Machine learning for high-speed corner detection,” Proceedings of the 9th European Conference on Computer Vision, pp. 430-443, 2006.
[12]S. Leutenegger, M. Chli, and R. Y. Siegwart, “BRISK: binary robust invariant scalable keypoints,” Proceedings of the International Conference on Computer Vision, pp. 2548-2555, 2011.
[13]K. Mikolajczyk, et al., “A Comparison of Affine Region Detectors,” International Journal of Computer Vision, vol. 65, no. 1-2, pp. 43-72, 2005.
[14]K. Mikolajczyk, and C. Schmid, “A performance evaluation of local descriptors,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 10, pp. 1615-1630, 2005.
[15]L. Juan, and O. Gwun, “A comparison of SIFT, PCA-SIFT and SURF,” International Journal of Image Processing, vol. 3, no. 4, pp. 143-152, 2009.
[16]O. Miksik, and K. Mikolajczyk, “Evaluation of local detectors and descriptors for fast feature matching,” Proceedings of the International Conference on Pattern Recognition, pp. 2681-2684, 2012.
[17]D. Gossow, P. Decker, and D. Paulus, “An evaluation of open source SURF implementations,” J. Ruiz-del-Solar, E. Chown, and P. Pl?ger, ed., RoboCup 2010: Robot Soccer World Cup XIV, Springer, pp. 169-179, 2011.
[18]J. Bauer, N. Sünderhauf, and P. Protzel, “Comparing several implementations of two recently published feature detectors,” Proceedings of the International Conference on Intelligent and Autonomous Systems, 2007.
[19]J. Lankinen, V. Kangas, and J.-K. Kamarainen, “A comparison of local feature detectors and descriptors for visual object categorization by intra-class repeatability and matching,” International Conference in Pattern Recognition, 2012.
[20]M. Tounsi, I. Moalla, A. M. Alimi, and F. Lebourgeois, “A comparative study of local descriptors for Arabic character recognition on mobile devices,” Seventh International Conference on Machine Vision, vol. 9445, 2015.
[21]J. Figat, T. Kornuta, and W. Kasprzak, “Performance evaluation of binary descriptors of local features,” ICCVG, LNCS, vol. 8671, pp. 187-194, 2014.
[22]M.A. Fischler, and R.C. Bolles, “Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography,” Communications of the ACM, vol. 24, no 6, pp. 381-395, 1981.
[23]L. L. Almeida, M.S. Paiva, F. A. Silva, and A. O. Artero, “Super-resolution image created from a sequence of images with application of character recognition,” International journal of Intelligent Systems and Applications, vol. 6, no 1, pp. 11-19, 2013.
[24]J. Deng, et al., “Imagenet: a large-scale hierarchical image database,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 248-255, 2009.
[25]G. Bradski, “The opencv library,” Doctor Dobbs Journal, vol. 25, no. 11, pp. 120-126, 2000.