基于内容的图像检索技术综述A survey on state-of-the-art techniques in content-based image retrieval
刘颖;范九伦;
摘要(Abstract):
基于内容的图像检索技术(CBIR)是近些年来图像检索领域的研究热点,其发展经历了最初的以图像数字特征为索引,到引入图像语义学习技术使得CBIR更贴近人类语义以方便用户查询,再到如今融合多种图像信息以进一步提高检索效率。本文在对已有文献进行综述的基础上,结合近几年的研究成果,讨论CBIR的最新技术动态,分析CBIR的发展趋势并指出几个未来的研究方向。
关键词(KeyWords): 图像检索;图像语义学习;图像特征提取;信息融合
基金项目(Foundation): 陕西省“百人计划”支持项目
作者(Author): 刘颖;范九伦;
Email:
DOI: 10.13682/j.issn.2095-6533.2012.02.022
参考文献(References):
- [1]Faloutsos C,Barber R,Flickner M,et al.Efficientand Effective Querying by Image Content[J].Journalof Intelligent Information Systems,1994,3(3/4):231-262.
- [2]Pentland A,Picard R W,Scaroff S.Content-basedManipulation for Image Databases[J].Inter.Journalof Computer Vision,1996,18(3):233-254.
- [3]Liu Y,Zhang D S,Lu G,et al.A Survey of Content-Based Image Retrieval with High-Level Semantics[J].Pattern Recognition,2007,40(1):262-282.
- [4]Stanchev P L,Green Jr D,Dimitrov B.High levelColor Similarity Retrieval[J].Int.J.Information.Theories Application,2003,10(3):363-369.
- [5]Liu Y,Zhang D S,Lu G.Region-Based Image Re-trieval with High-Level Semantics using Decision TreeLearning[J].Elsevier Pattern Recognition,2008,41(8):2554-2570.
- [6]Tamura H,Mori S,Yamawaki T.Texture FeaturesCorresponding to Visual Perception[J].IEEE Trans.Syst.Man Cybern,1978,8(6):460-473.
- [7]Ma W Y,Manjunath B.Netra:A Tool Box for Navi-gating Large Image Databases[C]//Proc.of the IEEEInter.Conf.on Image Processing,1997:568-571.
- [8]Liu Y,Zhang D S,Lu G,et al.Study on TextureFeature Extraction in Region-Based Image RetrievalSystem[C]//Proc.of Multimedia Modeling Conf(MMM),2006:264-271.
- [9]Zhang D S,Lu G.Review of Shape Representationand Description Techniques[J].Pattern Recognition,2004,37(1):1-19.
- [10]Papadopoulos G,Saathoff C,Escalante H,et al.AComparative Study of Object-level Spatial ContextTechniques for Semantic Image Analysis[J].ComputerVision and Image Understanding,2011,115(9):1288-1307.
- [11]Chiang C-C,Hung Y-P,Yang H,et.al.Region-based Image Retrieval Using Color-Size Features ofWatershed Regions[J].Elsevier Journal of VisionCommunication and Image Representation,2009,20(3):167-177.
- [12]董小丽,张力和,米晓莉.基于颜色索引相关统计的彩色图像特征提取[J].光电子.激光,2011,22(4):623-628.
- [13]Wang Z,Jia K,Liu P.An Effective Web Content-based Image Retrieval Algorithm by Using SIFT Fea-ture[C]//World Congress on Software Engineering(WCSE),2009:291-295.
- [14]曾接贤,赵永刚,符祥.基于改进距离聚合向量的图像检索算法[J].模式识别与人工智能,2010,23(5):715-719.
- [15]王守觉,孙华,柳培忠.基于仿生形象思维方法的图像检索算法[J].电子学报,2010,38(5):993-997.
- [16]陈星星,张荣.基于多尺度相位特征的图像检索方法[J].电子与信息学报,2009,31(5):1193-1196.
- [17]Zhang R,Zhang L,Wang X-J,et al.Multi-feature pL-SA for Combining Visual Features in Image Annota-tion[C]//Proc.of 19th ACM Inter.Conf.on Multi-media(ACM MM),2011:1513-1516.
- [18]郝红卫,黄芳益,周静.基于ROI与MCS的图像检索方法[J].模式识别与人工智能,2008,21(2):240-245.
- [19]Nguyen D,Yap G,Liu Y,et al.A Bayesian ApproachIntegrating Regional and Global Features for ImageSemantic Learning[C]//Proc.of Inter.Conf.onMultimedia and Expo,2009:546-549,
- [20]Yanai K.Web Image Mining Toward Generic ImageRecognition[C]//Proc.of 12th Inter.World WideWeb Conference,2003:1.
- [21]冯松鹤,郎丛妍,须得.一种融合图学习与区域显著性分析的图像检索算法[J].电子学报,2011,39(10):2288-2294.
- [22]Biederman I.Recognition by Components:A Theoryof Human Image Understanding[J].Journal of Psy-chological Review,1987,serial no.94:115-147.
- [23]Zhao B,Li F-F,Xing E.Large-Scale Category StructureAware Image Categorization[C]//Proc.of the NeuralInformation Processing Systems(NIPS),2011.
- [24]Deng J,Satheesh S,Berg A C,et al.Fast and Bal-anced:Efficient Label Tree Learning for Large ScaleObject Recognition[C]//Proc.of the Neural Informa-tion Processing Systems(NIPS),2011.
- [25]Deng J,Berg A C,Li K,et al.What Does ClassifyingMore Than 10,000Image Categories Tell us[C]//Proc.of European Conf.on Computer Vision(EC-CV),Part V,2010:71-84
- [26]Bengio S,Weston J,Grangier D.Label Embedding Treesfor Large Multi-Class Tasks[C]//Proc.of Neural Infor-mation Processing Systems(NIPS),2010.
- [27]Binder A,Mller K-R,Kawanabe M.On Taxonomiesfor Multi-Class Image Categorization[C]//Inter.Jour.of Computer Vision,2011:1-21.
- [28]Fellbaum C.An Electronic Lexical Database[M].WordNet:Bradford Books,1998.
- [29]Deng J,Dong W,Socher R,et al.A large-Scale Hierar-chical Image Database[C]//Proc.of Inter.Conf.onComputer Vision and Pattern Recognition 2009:248-255.
- [30]ImageNet:Large Scale Visual Recognition Challenge 2010(ILSVR2010)[EB/OL](2010-11-19)[2012-01-11]ht-tp://www.image-net.org/challenges/LSVRC/2010/.
- [31]Jacob L,Bach F,Vert J-P.Clustered Multi-TaskLearning:A Convex Formulation[C]//Proc.of theNeural Information Processing Systems(NIPS),2008.
- [32]Bakker B,Heskes T.Task Clustering and Gating forBayesian Multitask Learning[J].Journal of MachineLearning Research,2003(4):83-99.
- [33]Boiman O,Shechtman E,Irani M.In Defense of Nea-rest-Neighbor Based Image Classification[C]//Proc.of Inter.Conf.on Computer Vision and Pattern Rec-ognition(ICVR),2008:1-8..
- [34]Fergus R,Bernal H,Weiss Y,et al.Semantic LabelSharing for Learning with Many Categories[C]//Proc.of European Conf.on Computer Vision(EC-CV),Part I,2010:762-765.
- [35]Lin Y,Lv F,Zhu S,et al.Large-Scale Image Classifi-cation:Fast Feature Extraction and SVM Training[C]//Proc.of Computer Vision and Pattern Recogni-tion(CVPR),2011:1698-1696.
- [36]Benavent J,Benavent X,Ves E de,et al.Experiencesat ImageCLEF 2010Using CBIR and TBIR Mixing In-formation Approaches[C]//Proc.of CLEF 2010(Notebook Papers,LABs and Workshops),Padua,Italy,Sep.2010.
- [37]Clinchant S,Ah-Pine J,Csurka G.Semantic Combinationof Textual and Visual Information in Multimedia Retrieval[C]//Proc.of Inter.Conf.on Multimedia Retrieval(ACM ICMR)-Trento,Italy,April,2011:44-47.
- [38]Caidedo J C,Moreno J G,Nino E A,et al.CombiningVisual Features and Text Data for Medical Image Re-trieval using Latent Semantic Kernels[C]//Proc.ofInter.Conf.on Multimedia Retrieval,MIR,2010:359-366.
- [39]Rasiwasia N,Pereira J C,Coviello E,et.al.A New Ap-proach to Cross-Modal Multimedia Retrieval[C]//Proc.of ACM Inter.Conf.on Multimedia,2010:251-260
- [40]Pham T,Maillot N,Lim J,et al.Latent Semantic Fu-sion Model for Image Retrieval and Annotation[C]//Proc.of the 16th ACM Conference on Information andKnowledge Management(CIKM),2007:439-444.
- [41]Wang X,Kananhali M.MultiFusion:A Boosting Ap-proach for Multimedia Fusion[J].ACM Trans.OnMultimedia Computing,Communications and Applica-tions,2010,6(4),Article25:1-18
- [42]Kherfi M L,Brahmi D,Ziou D.Combining Visual Fea-tures with Semantics for A More Effective Image Re-trieval[C]//Proc.of Inter.Conf.on Pattern Recogni-tion(ICPR),2004,2:961-964.
- [43]Ferecatu M,Boujemaa N,Crucianu M.Semantic In-teractive Image Retrieval Combining Visual and Con-ceptual Content Description[J].Multimedia Systems,Feb.2008,13(5):309-322.
- [44]Barrat S,Tabbone S.Visual Features with SemanticCombination Using Bayesian Network for A More Ef-fective Image Retrieval[C]//19th Inter.Conf.on Pat-tern Recognition(ICPR),Dec.2008:1-4.
- [45]Guttman A.R-Tree:A Dynamic Index Structure forSpatial Searching[C]//Proc.of the ACMSIGMODInter.Conf.on Management of Data,1984:47-57.
- [46]Finkel R,Bentley J.Quad-Tree:A Data Structure forRetrieval on Composite Keys[J],Acta Informatica,1974,4(1):1-9.
- [47]Yamamoto H,Iwasa H,Yokaya N.et al.Content-Based Similarity Retrieval of Images Based on SpatialColor Distribution[C]//10th Inter.Conf.on Image A-nalysis and Processing,Venice,Italy,Sep.1999:951-956.
- [48]Yu C,Ooi B C,Tan K-L,et al.Indexing the Dis-tance:An Efficient Method to KNN Processing[C]//Proc.of the 27th VLDB Conference,Roma,Italy,2001:421-430.
- [49]Berchtold S,Keim D A,Kriegel H-P.The X-tree:An Index Structure for High-Dimensional Data[C]//Proc.of the 22nd VLDB Conference,Mumbai(Bom-bay),India,1996:28-39.
- [50]Jegou H,Douze M,Schmid C.Product Quantizationor Nearest Neighbor Search[J].IEEE Trans.On Pat-tern Analysis and Machine Intelligence,2010,33(1):117-128.
- [51]何云峰,周玲,于俊清,等.基于局部特征聚合的图像检索方法[J].计算机学报,2011,34(11):2224-2233.
- [52]Siddieque J B,Feris R S,Davis L S.Image Rankingand Retrieval Based on Multi-Attribute Queries[C]//Proc.of Inter.Conf.on Computer Vision and PatternRecognition(CVPR),2011:801-808
- [53]Huang G B,Ramesh M,Berg T,et.al.Labeled Faces inthe Wild:A Database for Studying Face Recognition in Un-constrained Environments[M].Technical report,2007.
- [54]Parikh D,Grauman K.Relative Attributes[C]//Proc.of Inter.Conf.on Computer Vision(ICCV),2011:503-510.
- [55]Hisamori T,Ohashi G.Query-by-sketch InteractiveImage Retrieval Using Rough Sets[C]//Proc.of In-ter.Conf.on Systems,Man and Cybernetics,Oct.2007:1223-1229.
- [56]百度.图像检索技术[EB/OL](2011-5-15)[2012-01-11]http://baike.baidu.com/view/2573202.
- [57]Town C P,Sinclair D.Language-Based Querying ofImage Collections on the Basis of An Extensible Ontol-ogy[J].Int.J.Image and Vision Computing,2004,22(3):251-267.