Interactive hypergraph visual analytics for exploring large and complex image collections

Abstract

Analyzing unannotated large complex image collections in domains like forensics, accident investigation, or social media analysis involves interpreting complex, overlapping relationships among images: images may belong to multiple content- or context-based groupings simultaneously. Domain experts, like forensic investigators, accident investigators, investigative journalists, and social media analysts require a way to make well informed, high-impact decisions, while not necessarily being specialists in analyzing such collections. Traditional clustering assigns images to a single cluster, not representing overlapping relationships, while supervised classification and multi-label classification require annotations and often rely on generic pre-trained models that do not capture domain specific semantics of complex real-world image collections. Hypergraphs effectively capture overlapping relationships, but construction from raw, unannotated image data and translating their complexity into information and insights for domain experts, remain challenging. We propose an interactive visual analytics approach specifically designed for constructing, exploring, and analyzing hypergraphs. Core contributions include: (1) a framework for constructing and evaluating hypergraphs from raw image data, (2) CoverEdge Similarity (CES), a scalable measure for comparing constructed hypergraphs with ground truth, (3) scalable visual analytics integrating coordinated spatial, grid, and matrix visualization, and (4) practical domain insights from evaluation with real-life image collections. To determine which construction algorithm can create meaningful hypergraphs, we designed and validated a similarity measure to evaluate constructed hypergraphs against ground truth. Across annotated benchmark collections, our TEMI-adaptation as construction method performed best overall, compared to others like fuzzy c-means, and produced overlaps that were qualitatively useful for analysis. A qualitative think-aloud study with eight domain experts on real-life accident investigation image collections containing several thousand to tens of thousands of images suggests that the system supports iterative exploration and search, with participants completing most tasks within minutes. A video demo is available in the supplemental materials.

Keywords

visual analytics hypergraph construction complex image collection hypergraph evaluation

Get full access to this article

View all access options for this article.

References

Grötschla

Lanzendörfer

Calzavara

, et al. AEye: a visualization tool for image datasets. In: IEEE visualization and visual analytics, 2024. https://doi.org/10.1109/VIS55277.2024.00064

Wang

, et al. iGraph: a graph-based technique for visual analytics of image and text collections. Proc SPIE 2015; 9397: 939708. https://doi.org/10.1117/12.2074198

Wang

Aboagye

, et al. Visual analytics for efficient image exploration and user-guided image captioning. IEEE Trans Vis Comput Graph 2024; 30(6): 2875–2887. https://doi.org/10.1109/TVCG.2024.3388514

Worring

Koelma

. Insight in image collections by multimedia pivot tables. In: Proceedings of the 5th ACM on international conference on multimedia retrieval, 2015, pp.291–298. https://doi.org/10.1145/2671188.2749312

de Rooij

van Wijk

Worring

. Mediatable: interactive categorization of multimedia collections. IEEE Comput Graph Appl 2010; 30(5): 42–51. https://doi.org/10.1109/MCG.2010.66

Perez-Messina

Ceneda

Miksch

. Guided visual analytics for image selection in time and space. IEEE Trans Vis Comput Graph 2024; 30(1): 66–75. https://doi.org/10.1109/TVCG.2023.3326572

Bäuerle

van Onzenoodt

Jönsson

, et al. Semantic hierarchical exploration of large image datasets. In: EuroVis 2023 - Short Papers, 2023, pp.103–107. https://doi.org/10.2312/evs.20231051

Khaleel

Idris

Tavanapong

, et al. Visactive: visual-concept-based active learning for image classification under class imbalance. ACM Trans Multimed Comput Commun Appl 2024; 20(3): 1–21. https://doi.org/10.1145/3617999

Yang

MacEachren

Mitra

, et al. Visually-enabled active deep learning for (geo) text and image classification: a review. ISPRS Int J Geo Inf 2018; 7(2): 65. https://doi.org/10.3390/ijgi7020065

10.

Battiston

Cencetti

Iacopini

, et al. Higher-order representations of networks. Phys Rep 2020; 874: 1–92. https://doi.org/10.1016/j.physrep.2020.05.004

11.

Berge

. Hypergraphs: combinatorics of finite sets. Elsevier, 1984. Vol. 45.

12.

Surana

Chen

Rajapakse

. Hypergraph similarity measures. IEEE Trans Netw Sci Eng 2023; 10(2): 658–674. https://doi.org/10.1109/TNSE.2022.3217185

13.

Zahálka

Worring

. Towards interactive, intelligent, and integrated multimedia analytics. In: 2014 IEEE conference on visual analytics science and technology (VAST), 2014, pp.3–12. IEEE. https://doi.org/10.1109/VAST.2014.7042476

14.

Gisolf

Geradts

Worring

. Search and explore strategies for interactive analysis of real-life image collections with unknown and unique categories. In: MultiMedia modeling: 27th international conference, MMM 2021, Proceedings, Lecture Notes in Computer Science, Prague, Czech Republic, 22–24 June 2021, pp.244–255, vol. 12573. Springer. https://doi.org/10.1007/978-3-030-67835-7\_21

15.

Fischer

Arya

Streeb

, et al. Visual analytics for temporal hypergraph model exploration. IEEE Trans Vis Comput Graph 2021; 27(2): 550–560. https://doi.org/10.1109/TVCG.2020.3030408

16.

Huang

Elhoseiny

Elgammal

, et al. Learning hypergraph-regularized attribute predictors. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), 2015, pp.409–417. https://doi.org/10.1109/CVPR.2015.7298638

17.

Aksoy

Arendt

Jenkins

, et al. High performance hypergraph analytics of domain name system relationships. In: Proceedings of the HICSS symposium on cybersecurity big data analytics, 2019.

18.

Fang

Sang

, et al. Topic-sensitive influencer mining in interest-based social media networks via hypergraph learning. IEEE Trans Multimedia 2014; 16(3): 796–812. https://doi.org/10.1109/tmm.2014.2298216

19.

Gao

Munsell

, et al. Identifying high order brain connectome biomarkers via learning on hypergraph. Mach Learn Med Imaging 2016; 10019: 1–9. https://doi.org/10.1007/978-3-319-47157-0_1

20.

Franzese

Groce

Murali

, et al. Hypergraph-based connectivity measures for signaling pathway topologies. PLoS Comput Biol 2019; 15(10): 1–26. https://doi.org/10.1371/journal.pcbi.1007384

21.

Gao

Zhang

Lin

, et al. Hypergraph learning: methods and practices. IEEE Trans Pattern Anal Mach Intell 2022; 44(5): 2548–2566. https://doi.org/10.1109/TPAMI.2020.3039374

22.

Ridnik

Ben-Baruch

Zamir

, et al. Asymmetric loss for multi-label classification. In: Proceedings of the IEEE/CVF international conference on computer vision (ICCV), 2021, pp.82–91. https://doi.org/10.1109/ICCV48922.2021.00015

23.

Bogatinovski

Todorovski

Džeroski

, et al. Comprehensive comparative study of multi-label classification methods. Expert Syst Appl 2022; 203: 117215. https://doi.org/10.1016/j.eswa.2022.117215

24.

Han

Chen

, et al. A survey of multi-label classification based on supervised and semi-supervised learning. Int J Mach Learn Cybern 2023; 14: 697–724. https://doi.org/10.1007/s13042-022-01658-9

25.

You

Zhang

Wang

, et al. Attentionxml: label tree-based attention-aware deep model for high-performance extreme multi-label text classification. In: Advances in neural information processing systems 32 (NeurIPS 2019), 2019, pp.5820–5830. https://doi.org/10.48550/arXiv.1811.01727

26.

Liu

Wang

Shen

, et al. The emerging trends of multi-label learning. IEEE Trans Pattern Anal Mach Intell 2022; 44(11): 7955–7974. https://doi.org/10.1109/TPAMI.2021.3119334

27.

Lanchantin

Wang

Ordonez

, et al. General multi-label image classification with transformers. In: Proceedings of the 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR), 2021, pp.16473–16483. https://doi.org/10.1109/CVPR46437.2021.01621

28.

Sovatzidi

Vasilakakis

Iakovidis

. Towards the interpretation of multi-label image classification using transformers and fuzzy cognitive maps. In: Proceedings of the 2023 IEEE international conference on fuzzy systems (FUZZ), 2023, pp.1–7. https://doi.org/10.1109/FUZZ52849.2023.10309713

29.

Zhu

. Residual attention: a simple but effective method for multi-label recognition. In: Proceedings of the 2021 IEEE/CVF international conference on computer vision (ICCV), 2021, pp.184–193. https://doi.org/10.1109/ICCV48922.2021.00025

30.

Yasunaga

Aghajanyan

Shi

, et al. Retrieval-augmented multimodal language modeling. In: Proceedings of the 40th international conference on machine learning, proceedings of machine learning research, 2023, pp.40506–40526, vol. 202. https://doi.org/10.48550/arXiv.2211.12561

31.

Zhao

Chen

Wang

, et al. Retrieving multimodal information for augmented generation: a survey. In: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023, pp.4736–4756. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-emnlp.314

32.

Askari

. Fuzzy C-means clustering algorithm for data with unequal cluster sizes and contaminated with noise and outliers: review and development. Expert Syst Appl 2021; 165: 113856. https://doi.org/10.1016/j.eswa.2020.113856

33.

Krishnapuram

Keller

. The possibilistic C-means algorithm: insights and recommendations. IEEE Trans Fuzzy Syst 1996; 4(3): 385–393. https://doi.org/10.1109/91.531779

34.

Gailhard

Tartaglione

Naviner

, et al. HYGENE: a diffusion-based hypergraph generation method, 2024. https://doi.org/10.48550/arXiv.2408.16457.2408.16457

35.

Lin

Yan

Liu

, et al. Automatic hypergraph generation for enhancing recommendation with sparse optimization. IEEE Trans Multimedia 2024; 26: 5680–5693. https://doi.org/10.1109/TMM.2023.3338083

36.

Gao

Wang

Tao

, et al. 3-d object retrieval and recognition with hypergraph analysis. IEEE Trans Image Process 2012; 21(9): 4290–4303. https://doi.org/10.1109/TIP.2012.2199502

37.

Yang

Wang

Yang

, et al. Deep learning approaches for similarity computation: a survey. IEEE Trans Knowl Data Eng 2024; 36(12): 7893–7912. https://doi.org/10.1109/TKDE.2024.3422484

38.

Warrens

van der Hoef

. Understanding the adjusted rand index and other partition comparison indices based on counting object pairs. J Classif 2022; 39(3): 487–509. https://doi.org/10.1007/s00357-022-09413-z

39.

Collins

Penn

Carpendale

. Bubble sets: revealing set relations with isocontours over existing visualizations. IEEE Trans Vis Comput Graph 2009; 15(6): 1009–1016. https://doi.org/10.1109/TVCG.2009.122

40.

Dinkla

van Kreveld

Speckmann

, et al. Kelp diagrams: point set membership visualization. Comput Graph Forum 2012; 31: 875–884. https://doi.org/10.1111/j.1467-8659.2012.03080.x

41.

Meulemans

Riche

Speckmann

, et al. Kelpfusion: a hybrid set visualization technique. IEEE Trans Vis Comput Graph 2013; 19(11): 1846–1858. https://doi.org/10.1109/TVCG.2013.76

42.

Oliver

Zhang

. Scalable hypergraph visualization. IEEE Trans Vis Comput Graph 2024; 30(1): 595–605. https://doi.org/10.1109/TVCG.2023.3326599

43.

Jacobsen

Wallinger

Kobourov

, et al. Metrosets: visualizing sets as metro maps. IEEE Trans Vis Comput Graph 2021; 27(2): 1257–1267. https://doi.org/10.1109/TVCG.2020.3030475

44.

Valdivia

Buono

Plaisant

, et al. Analyzing dynamic hypergraphs with parallel aggregated ordered hypergraph visualization. IEEE Trans Vis Comput Graph 2021; 27(1): 1–13. https://doi.org/10.1109/TVCG.2019.2933196

45.

Valdivia

Buono

Plaisant

, et al. Using dynamic hypergraphs to reveal the evolution of the business network of a 17th century french woman merchant. In: Proceedings of the VIS4DH Workshop, 2018, pp.1–5.

46.

Agarwal

Beck

. Set streams: visual exploration of dynamic overlapping sets. Comput Graph Forum 2020; 39: 383–391. https://doi.org/10.1111/cgf.13988

47.

Peña-Araya

Xue

Pietriga

, et al. Hyperstorylines: interactively untangling dynamic hypergraphs. Inf Vis 2022; 21(1): 38–62. https://doi.org/10.1177/14738716211045007

48.

Fischer

Frings

Keim

, et al. Towards a survey on static and dynamic hypergraph visualizations. In: 2021 IEEE Visualization Conference (VIS), 2021, pp.81–85. IEEE. https://doi.org/10.1109/VIS49827.2021.9623305

49.

McDaid

Greene

Hurley

. Normalized mutual information to evaluate overlapping community finding algorithms, 2011. https://doi.org/10.48550/arXiv.1110.2515.1110.2515

50.

Amelio

Pizzuti

. Correction for closeness: adjusting normalized mutual information measure for clustering comparison. Comput Intell 2017; 33: 579–601. https://doi.org/10.1111/coin.12100

51.

McInnes

Healy

Melville

. UMAP: uniform manifold approximation and projection for dimension reduction, 2018. https://doi.org/10.48550/arXiv.1802.03426

52.

Keim

Zhang

. Solving problems with visual analytics: challenges and applications. In: Proceedings of the 11th international conference on knowledge management and knowledge technologies, 2011. https://doi.org/10.1145/2024288.2024290

53.

Wiedenbeck

. The use of icons and labels in an end user application program: an empirical study of learning and retention. Behav Inf Technol 1999; 18(2): 68–82. https://doi.org/10.1080/014492999119129

54.

Wah

Branson

Welinder

, et al. Caltech-UCSD birds-200-2011 (CUB-200-2011). Technical Report CNS-TR-2011-001, California Institute of Technology, 2011.

55.

Zhu

Wang

, et al. MLRSNet: a multi-label high spatial resolution remote sensing dataset for semantic scene understanding. ISPRS J Photogramm Remote Sens 2020; 169: 337–350. https://doi.org/10.1016/j.isprsjprs.2020.09.020

56.

Yang

. DSEG660: Multi-label image classification, https://kaggle.com/competitions/hbku2019 (2019, accessed 27 March 2025).

57.

Dutch Safety Board. Investigation Crash MH17, 17 July 2014, 2015. https://www.onderzoeksraad.nl/en/onderzoek/2049/investigation-crash-mh17-17-july-2014

58.

Erdős

Rényi

. On random graphs. I. Publ Math Debrecen 1959; 6(3-4): 290–297. https://doi.org/10.5486/PMD.1959.6.3-4.12

59.

Liu

Lin

, et al. Swin transformer v2: scaling up capacity and resolution. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), 2022, pp.11999–12009. https://doi.org/10.1109/CVPR52688.2022.01170

60.

Adaloglou

Michels

Kalisch

, et al. Exploring the limits of deep image clustering using pretrained models. In: Proceedings of the 34th British Machine Vision Conference (BMVC), 2023. https://papers.bmvc2023.org/0297.pdf

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

3.24 MB

0.00 MB