Iranian Journal of Information Processing Management (22518231)39(2)pp. 565-598
Information fusion is a vital process in information management systems which aims to merge information from multiple sources to provide a more comprehensive and accurate view of specialized domain. In structured information management systems there are several methods for aggregating, consolidating, and fusioning information, but their approaches have not yet provided a clear pattern. The aim of the current research is to explain the concept and appropriate models of information fusion in information management systems and its use in relationship-based databases such as thesaurus. The research method is a conceptual type with an analytical approach. The research population consisted of texts and outputs in the field of information fusion, and the data collected by the library method. The findings showed that information fusion models are in four general categories of models based on information flow, workflow and activity, roles and functions of entities, and understanding concepts. The Omnibus workflow and activity model and the model based on the roles and functions of Endsley's entity as selected models according to the basic JDL are suitable for use in relationship-based information management systems such as thesauruses and ontologies, and they have the ability to be used with some changes. The main disadvantage of the reviewed information fusion models is not paying attention to the characteristics of specific information structures such as thesauruses. Not considering role of expert users in information systems based on user decisions that have the task of extracting specialized information and ignoring the collective participation system to solve complex problems, including finding similarities and solving it in the mass of information, not describing the required functions to solve the problems, managing the effects in case of system or user errors and restoring fusioned and consolidated information, as well as non-implementation of methods with operational examples in such structures, can be counted among these problems. In general, in each model of information fusion in information management systems based on thesaurus, the general processes of finding similarities, examining similarities, aggregating parameters and information fusion and managing the effectiveness of sub-systems should be considered. Unlike traditional information fusion methods that focus more on merge data, semantic information fusion emphasizes fusioning related knowledge and concepts stored in thesauruses instead of merging data and information. Therefore, it is suggested to pay attention to semantic approaches in the process of integrating and fusioning new models. © 2024 Iranian Research Institute for Scientific Information and Documentation. All rights reserved.
Iranian Journal of Information Processing Management (22518231)39(1)pp. 241-264
This study aimed to evaluate Philosophy Thesaurus of Research Center for Islamic Documents and Information Management according to ISO 25964 Standard (Part I & II) and also to examine the interoperability with other vocabulary control systems (ASFA Thesaurus and Persian Subject Headings). The research method was an applied survey in terms of purpose, and a survey-descriptive study in terms of data collection. The research population included the terms of Philosophy Thesaurus, which were selected 375 terms by using Morgan’s table and random sampling method. The research population in the interoperability section was also the first six levels of the Philosophy Thesaurus with 85 terms. The tool and method of data collection was, respectively, a researcher-made checklist according to ISO 25964 standards and direct observation. The results showed that the compliance rate of ISO 25964 standard (part I) in the Philosophy Thesaurus is %75.52. The highest and lowest levels of compliance was related to the ‘software management’ component (%79.34) and the ‘semantic relations’ component (%66.66), respectively. The results of the interoperability of Philosophy Thesaurus with ASFA Thesaurus according to ISO 25964 standard (Part II) showed a %24.88 mapping, the highest of which was exact equivalence mapping (%49.41). Also, %42.47 of the terms has the possibility of automatic mapping. Interoperability with Persian Subject Headings was %89.41, the highest rate is related to exact equivalence mapping (%47.06). Also, %30 of the selected terms of the Philosophy Thesaurus can be automatically mapped with the terms of Persian Subject Headings. The interoperability in the thesaurus is an economical and efficient solution to save the high costs of producing and compiling and expanding the thesaurus. Also, paying attention to the rules of interoperability in accordance with the standards is the basis for data integration, and the user can achieve an efficient search with improvement precision and recall, regardless of the time, place, and type of database. © 2023 Iranian Research Institute for Scientific Information and Documentation. All rights reserved.
Knowledge Organization (09437444)48(5)pp. 345-356
This study aims to assess the localization of Schema.org for manuscript description in the Iranian-Islamic information context using documentary and qualitative content analysis. The schema.org introduces schemas for different Web content objects so as to generate structured data. Given that the structure of Schema.org is ontological, the inheritance of the manuscript types from the properties of their parent types, as well as the localization and description of the specific properties of the manuscripts in the Iranian-Islamic information context were investigated in order to improve their indexability and semantic visibility in the Web search engines. The proposed properties specific to the manuscript type and the six proposed properties to be added to the “CreativeWork” type are found to be consistent with other schema prop-erties. In turn, these properties lead to the localization of the existing schema for the manuscript type compatibility with the Iranian-Islamic information context. This schema is also applicable to centers with published records on the Web, and if markup with these properties, their indexability and semantic visibility in Web search engines increases accordingly. The generation of structured data in the Web environment through this schema is deemed to promote the concept of the Semantic Web, and make data and knowledge retrieval easier. © 2021, International Society for Knowledge Organization. All rights reserved.
Iranian Journal of Information Processing Management (22518231)34(4)pp. 1755-1786
The purpose of this conceptual research was to explain the capabilities, semantic platform and view point of Schema.org to processing and organization of web content objects (data entities) by analytical approach. To collect data documentary analysis was used. The research community included texts and researches related to the field of "structured data" and "Schema.org". A total of 43 sources, as well as the official website of the "Schema.org" were selected using a purposive sampling method for analysis. The results of the survey showed that Schema.org is a common vocabulary that is used to describe and markup web content objects and create structured data for better processing and organization. It has a certain structure and semantic platform. Its structure is like an ontology for naming the types and properties of content objects, the relationships between types and properties, and the capabilities of describing these properties and relationships. Its semantic platform is adapted by semantic markups such as microformat, microdata, RDFa 1.1, and JSON-LD. The results of the research showed that there are three major approaches to the processing and organizing of content objects in the Schema.org: The ontological, context-oriented, and nesting approaches. Overall research results showed existence of different approaches to Schema. org represents a comprehensive view of the Web content objects while paying attention to improving interoperability with search engines. Also, the production of structured data with such schemata is an important contribution to the realization of semantic web or web of data. © 2019 Iranian Research Institute for Scientific Information and Documentation. All rights reserved.
Iranian Journal of Information Processing Management (22518231)33(4)pp. 1793-1822
Ontology is a useful tool for organizing resources and on the other hand is a useful tool for the knowledge representation. With the development of semantic web technologies, building and creating ontologies to expedite the process is necessary. The aim of the study is to explain the situation of methodologies, designing scientometrics conceptual model, and steps of ontology construction as ScientometricsOnt. The method of the study is applied research. The research population of the study is book, article, glossary, thesaurus, dissertation, and research projects of the field of scientometrics in Persian language. To collect terms internal databases are used and related resources are searched. For creating scientometrics conceptual model and explaining relationships and individuals domain analysis approach is used. The reliability and validity of scientometrics conceptual model have been approved by experts in the field of scientometrics. The tool for building ontology is Protégé 5 software. The used methodology was OAsys method Bermejo (2007) with some changes. The results revealed that Ontology of scientometrics was formed in eleven major class which has 20 relationships and 100 individuals. This ontology can be a very useful tool for better knowledge representation in this field. Also, Scientometrics ontology can be a basis for extending and developing terms and future concepts in the field. © 2018 Iranian Research Institute for Scientific Information and Documentation. All rights reserved.