C&SA | Contents

Volume 61 >>> № 3 MAY — JUNE 2025

-->

DOI 10.34229/KCA2522-9664.25.3.13

UDC 004.318

O. Palagin¹, M. Petrenko², V. Kaverinskiy³, K. Malakhov⁴

¹ V.M. Glushkov Institute of Cybernetics, National Academy of Sciences of Ukraine, Kyiv, Ukraine

palagin_a@ukr.net

² V.M. Glushkov Institute of Cybernetics, National Academy of Sciences of Ukraine, Kyiv, Ukraine

petrng@ukr.net

³ V.M. Glushkov Institute of Cybernetics, National Academy of Sciences of Ukraine, Kyiv, Ukraine

insamhlaithe@gmail.com

⁴ V.M. Glushkov Institute of Cybernetics, National Academy of Sciences of Ukraine, Kyiv, Ukraine

k.malakhov@incyb.kiev.ua

A METHOD FOR ENHANCING THE EFFICIENCY
OF RDF/XML-STRUCTURE PROCESSING
IN THE APACHE JENA SEMANTIC WEB FRAMEWORK

Abstract. This study examines the impact of partitioning large-scale OWL-ontologies (RDF/XML-structures) and parallel query execution on the performance of complex SPARQL queries. The experimental results indicate that ontology partitioning, particularly for queries with long execution times, can significantly reduce query processing duration. For medium- and long-execution-time queries, a combination of ontology partitioning and parallel execution yields a performance improvement of up to 45% compared to sequential execution. Conversely, for short-execution-time queries, ontology partitioning may introduce additional delays, which can be partially mitigated through parallel processing. The study also finds that partitioning an ontology into more than 7вЂ“10 segments does not yield further performance gains, rendering excessive fragmentation an inefficient approach. The article underscores the importance of eliminating redundant constraints in queries, particularly those concerning hierarchical relationships between parent and descendant classes within the ontology. Optimizing or removing these constraints can significantly enhance query execution speed. Furthermore, a formal model is presented to theoretically describe the effects of ontology partitioning and parallel query execution on processing time. Additionally, the study establishes formal criteria for determining the impact of these techniques on different types of queries.

Keywords: Ontology engineering, Semantic Web, Apache Jena Semantic Web Framework, OWL, OWL-ontology, RDF/XML-structure, RDF/XML, SPARQL, Natural Language Dialogue System.

full text

REFERENCES

1. Nacer H., Aissani D. Semantic web services: Standards, applications, challenges and solutions. Journal of Network and Computer Applications. 2014. Vol. 44. P. 134вЂ“151. https://doi.org/10.1016/.

2. Ristoski P., Paulheim H. Semantic Web in data mining and knowledge discovery: A comprehensive survey. Journal of Web Semantics. 2016. Vol. 36. P. 1вЂ“22. https://doi.org/ 10.1016/.

3. DuCharme B. Learning SPARQL: Querying and updating with SPARQL 1.1. Beijing; Cambridge; Farnham; Kln; Sebastopol; Tokyo: OвЂ™Reilly Inc., 2013. 386 p.

4. SPARQL 1.1 Overview. https://www.w3.org/TR/.

5. Chebanyuk O. Software reuse approach based on review and analysis of reuse risks from projects uploaded to GitHub. In: Computer Science and Education in Computer Science. Zlateva T., Tuparov G. (Eds.). CSECS 2023. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering. 2023. Vol. 514. P. 144вЂ“155. https://doi.org/10.1007/.

6. Chebanyuk O. Investigation of drawbacks of the software development artifacts reuse approaches based on semantic analysis. In: Advances in Computer Science for Engineering and Education VI. Hu Z., Dychka I., He M. (Eds.). ICCSEEA 2023. Lecture Notes on Data Engineering and Communications Technologies. 2023. Vol. 181. P. 514вЂ“523. https://doi.org/10.1007/.

7. Chebanyuk O.V. An approach of text to model transformation of software models. Proc. 13th International Conference on Evaluation of Novel Approaches to Software Engineering ENASE (23-24 March 2018, Funchal, Madeira, Portugal). Funchal, 2018. P. 432вЂ“439. https://doi.org/10.5220/0006804504320439.

8. Opanasenko V.M., Fazilov S.K., Mirzaev O.N., Kakharov S.S. ugli. An ensemble approach to face recognition in access control systems. Journal of Mobile Multimedia. 2024. Vol. 20, Iss. 3. P. 749вЂ“768. https://doi.org/10.13052/jmm1550-4646.20310.

9. Apache Jena Semantic Web Framework. https://github.com/apache/jena. 2025.

10. Saleem M., Hasnain A., Ngonga Ngomo A.-C. LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation. Journal of Web Semantics. 2018. Vol. 48. P. 85вЂ“125. https://doi.org/10.1016/.

11. Saleem M., SzГЎrnyas G., Conrads F., Bukhari S.A.C., Mehmood Q., Ngonga Ngomo A.-C. How representative is a SPARQL benchmark? An analysis of RDF triplestore benchmarks. Proc. WWWвЂ™19: The World Wide Web Conference (13вЂ“17 May 2019, San Francisco, CA, USA). San Francisco, 2019. P. 1623вЂ“1633. https://doi.org/10.1145/.

12. Katib A., Slavov V., Rao P. RIQ: Fast processing of SPARQL queries on RDF quadruples. Journal of Web Semantics. 2016. Vol. 37вЂ“38. P. 90вЂ“111. https://doi.org/10.1016/.

13. Guo Y., Pan Z., Heflin J. LUBM: A benchmark for OWL knowledge base systems. Journal of Web Semantics. 2005. Vol. 3, Iss. 2вЂ“3. P. 158вЂ“182. https://doi.org/10.1016/.

14. Li W., Wang S., Wu S., Gu Z., Tian Y. Performance benchmark on semantic web repositories for spatially explicit knowledge graph applications. Computers, Environment and Urban Systems. 2022. Vol. 98. Article number 101884. https://doi.org/10.1016/.

15. Palagin O.V., Kaverinskiy V.V., Petrenko M.G., Malakhov K.S. Digital health systems: Ontology-based universal dialog service for hybrid E-rehabilitation activities support. Proc. 2023 IEEE 12th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS) (07вЂ“09 September 2023, Dortmund, Germany). Dortmund, 2023. P. 84вЂ“89. https://doi.org/10.1109/.

16. Litvin A., Palagin O.V., Kaverinskiy V.V., Malakhov K.S. Ontology-driven development of dialogue systems. SACJ. 2023. Vol. 35, N 1. P. 37вЂ“62. https://doi.org/10.18489/.

17. Kaverinsky V.V., Malakhov K.S. Natural language-driven dialogue systems for support in physical medicine and rehabilitation. South African Computer Journal. 2023. Vol. 35, N 2. P. 119вЂ“126. https://doi.org/10.18489/.

18. Bakhshi M., Nematbakhsh M., Mohsenzadeh M., Rahmani A.M. Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs. Expert Systems with Applications. 2020. Vol. 146. Article number 113205. https://doi.org/10.1016/.

19. Almendros-JimР‘nez J.M., Becerra-TerР›n A. Discovery and diagnosis of wrong SPARQL queries with ontology and constraint reasoning. Expert Systems with Applications. 2021. Vol. 165. Article number 113772. https://doi.org/10.1016/.

20. Mosser M., Pieressa F., Reutte J.L., Soto A., D. Querying APIs with SPARQL. Information Systems. 2022. Vol. 105. Article number 101650. https://doi.org/10.1016/.

21. Naveen Kumar V., Ashok Kumar P.S. An efficient and scalable SPARQL query processing framework for big data using MapReduce and hybrid optimum load balancing. Data & Knowledge Engineering. 2023. Vol. 148. Article number 102239. https://doi.org/10.1016/.

22. Malakhov K.S., Vakulenko D., Kaverinskiy V.V. EBSCO articles dataset (domain knowledge: rehabilitation medicine) + JSON of every article. 2023. https://zenodo.org/record/ 8308214, https://doi.org/10.5281/ZENODO.8308214"target=_blank> https://doi.org/10.5281/ZENODO.8308214.

23. Palagin O.V., Kaverinskiy V.V., Litvin A., Malakhov K.S. OntoChatGPT information system: Ontology-driven structured prompts for ChatGPT meta-learning. IJC. 2023. Vol. 22, Iss. 2. P. 170вЂ“183. https://doi.org/10.47839/.

24. Litvin A.A., Velychko V.Yu., Kaverinskiy V.V. A new approach to automatic ontology generation from the natural language texts with complex inflection structures in the dialogue systems development. Proc. 13th International Scientific and Practical Conference from Programming UkrPROGвЂ™2022 (11вЂ“12 October 2022, Kyiv, Ukraine). Kyiv, 2022. P. 172вЂ“185. https://ceur-ws.org/.

25. Nakajima H., Miura J. Combining ontological knowledge and large language model for user-friendly service robots. arXiv:2410.16804v1 [cs.RO] 22 Oct 2024. https://doi.org/10.48550/.

26. GPT-4 Technical Report. arXiv:2303.08774v6 [cs.CL] 4 Mar 2024. https://doi.org/10.48550/.

27. OpenAI o1 System Card. 2024. https://cdn.openai.com/o1-system-card.pdf.

28. QwQ: Reflect Deeply on the Boundaries of the Unknown. 2025. https://qwenlm.github.io/ blog/.

29. DeepSeek-AI. DeepSeek-R1: Incentivizing reasoning capability in LLMs via reinforcement learning. arXiv:2501.12948v1 [cs.CL] 22 Jan 2025. https://doi.org/10.48550/.

UDC 004.318

A METHOD FOR ENHANCING THE EFFICIENCY OF RDF/XML-STRUCTURE PROCESSING IN THE APACHE JENA SEMANTIC WEB FRAMEWORK

REFERENCES

A METHOD FOR ENHANCING THE EFFICIENCY
OF RDF/XML-STRUCTURE PROCESSING
IN THE APACHE JENA SEMANTIC WEB FRAMEWORK