Abstract
In this paper we present the rationale adopted for the integration of the knowledge level of D
Keywords
Introduction
In this work we present a novel version of the D
The D
This work is organized as follows: in the first sections we sketch (by referring to previous work for additional details) the main elements inspiring our system and its theoretical bases as well as its overall architecture; in Section 5 we describe the improvements related to the automatic generation of common-sense knowledge; in Section 7 we show how our hybrid system for conceptual categorization was integrated into SOAR and LIDA, and finally we describe the evaluation experiments and elaborate on the future works.
Types of Conceptual Representations
In Cognitive Science different theories about how humans organise, reason and retrieve conceptual information have been proposed. The oldest one, known as “classical”, states that concepts can be simply represented in terms of sets of necessary and sufficient conditions. In the mid ’70s of the last Century, however, Rosch’s experimental results demonstrated its inadequacy for ordinary –or common sense– concepts, that cannot be described in terms of necessary and sufficient traits [57].
In particular, Rosch’s results showed that ordinary concepts are organized in our mind in terms of prototypes. Since then different theories of concepts have been proposed to explain different representational and reasoning aspects concerning the problem of typicality. We recall here the prototype theory and the exemplars theory. 2 According to the prototype view, knowledge about categories is stored in terms of prototypes, i.e., in terms of some representation of the most typical instance of the category. In this view, the concept bird should coincide with the representation of a typical bird (e.g., a robin). According to the exemplar view, a given category is mentally represented as a set of specific exemplars explicitly stored in memory: the mental representation of the concept bird is a set containing the representations of (some of) the birds we encountered during our past experience. Although these approaches have been largely considered as competing ones, several results (starting from the work of Malt [42]) suggested that human subjects may use, in different occasions, different representations to retrieve and categorize concepts. Such experimental evidences led to the development of the so called “heterogeneous hypothesis” about the nature of concepts, hypothesizing that different types of conceptual representations coexist at the same time: prototypes, exemplars, classical representations, and so on [41].
This hypothesis has been recently extended within the field of knowledge representation applied to biologically inspired cognitive architectures: a novel approach to concept representation has been proposed, considering concepts as “heterogeneous proxytypes” [31]. In this view, conceptual structures in natural and artificial cognitive systems and architectures are assumed to be composed by heterogeneous representations (or bodies of knowledge) referring to the same conceptual entity. Each body of knowledge provides specific types of information and specific access and reasoning procedures to the concept it is referred to. Such heterogeneous representations are proxytypes [56], in the sense that they can be contextually activated by external stimuli, coming from the environment, and ‘go proxy’ in working memory (a sort of temporary buffer available in human and artificial memory structures), for their reference category. The proxyfication may be then the result of activities such as concept identification, recognition, retrieval, and so forth. The different types of conceptual representations hypothesized to coexist in the heterogeneous proxytypes approach are typicality-based representations of a given concept (i.e., prototypes and exemplars-based representations), as well as representations in terms of necessary and/or sufficient conditions. As an example of such type of representational hypothesis let us consider the ordinary concept of water. The classical component will contain the information that water is exactly the chemical substance whose formula is H2O, that is the substance whose molecules have two hydrogen atoms with a covalent bond to the single oxygen atom. On the other hand, the prototypical facet of the concept will grasp that water usually occurs in liquid state, and is a colourless, odourless and tasteless fluid. The exemplar-based representations grasp information on individuals, such as a given instance of water presenting, for example, an unusual water color (e.g., red-water). According to the heterogeneous proxytypes approach, the activation in working memory of such conceptual structures is context-dependent: if an agent perceives an instance of water in a typical scenario –for instance, in a bottle– the only type of conceptual knowledge that will be activated and proxyfied will be the prototypical knowledge associated to that concept (and not, for example, the classical or exemplar-based information associated to the same conceptual entity).
A Dual Process Conceptual Categorization
From a reasoning perspective the heterogeneous hypothesis assumes that the retrieval of the above mentioned representations is driven by different process types. In particular, prototype and exemplar-based retrieval is based on a fast and approximate kind of categorization, and benefits from common-sense information associated to concepts. 3 On the other hand, the retrieval of classical representation of concepts is featured by explicit rule following, and makes no use of common-sense information. These two differing categorization strategies have been widely studied in psychology of reasoning in the frame of the dual process theory, that postulates the co-existence of two different types of cognitive systems [15, 25]. The systems of the first type (type 1) are automatic, associative, parallel and fast. The systems of the second type (type 2) are more recent, conscious, sequential and slow, and featured by explicit rule following. We assume that both systems can be composed in turn by many sub-systems and processes.
The System
The D
Following the hypotheses in [18, 37] the two sorts of reasoning processes interact, since Type 1 processes are executed first and their results are then refined by Type 2 processes. In the implemented system the typical representational and reasoning functions are assigned to the System 1 (hereafter S1), which executes processes of Type 1, and is associated to the Conceptual Spaces framework [21]. The reasoning functions herein are implemented as similarity calculations in a metric space. On the other hand, the classical representational and reasoning functions are assigned to the System 2 (hereafter S2) to execute processes of Type 2, and are associated to a standard symbolic based ontological representation (in our case the OpenCyc ontology [29] was used).
In Section 6 we briefly describe the categorization pipeline of the system by presenting the dynamics of the interaction between S1 and S2 processes. In the following we introduce the two representational and reasoning frameworks used in our system, by focusing on i) how typicality information (including both prototypes and exemplars) and their corresponding non monotonic reasoning procedures can be encoded through conceptual spaces; and on ii) how classical information can be naturally encoded in terms of formal ontologies.
Conceptual spaces (CSs) are a representational framework where knowledge is represented as a set of quality dimensions, and where a geometrical or topological structure is associated to each quality dimension. Instances can be represented as points in a multidimensional space, and their similarity can be computed as the intervening distance between each two points, based on some suitable metrics (such as Euclidean and Manhattan distance, or standard cosine similarity). 4 In this setting, concepts correspond to convex regions, and regions with different geometrical properties correspond to different sorts of concepts [20, 21]. Prototypes have a natural geometrical interpretation in conceptual spaces, in that they correspond to the geometrical centre of a convex region. This can be thought of as a centroid, that is the mean position of all the points in all dimensions. This representation also allows us, given a convex region, to associate each point to a certain centrality degree, that can be interpreted as a measure of its typicality [22]. This framework has been used also to encode the exemplars, represented as points in the multidimensional space. Conceptual spaces can be also used to compute the proximity between any two entities, and between entities and prototypes. Concepts, in this framework, are characterized in terms of domains [20, 21]. Typical domain examples are color, size, shape, texture. In turn, domain information can be specified along some dimensions, e.g., regarding color domain, relevant dimensions are hue, chromaticity, and brightness. 5 On the other hand, the representation of the classical information related to a given concept is demanded to classical ontological formalizations. In this setting, formal ontologies provide the characterization of concepts in terms of necessary and sufficient conditions (if these conditions exists: as mentioned, most common sense concepts cannot be characterized in these terms). Additionally, the ontological representations are used by the 𝒮2 component (as mentioned, in our implementation it is grounded on the OpenCyc ontology).
Figure 1 shows an example of the heterogeneous representation for the concept dog. In this example, the exemplar and prototype-based representations make use of non classical (or typical) information and, as mentioned, are represented by using the framework of the conceptual spaces. Namely, the prototypical representation grasps information such as that dogs are usually conceptualized as domestic animals, with typically four legs, a tail etc.; the exemplar-based representations grasp information on individuals. For example, in Fig. 1 it is represented the individual of Lessie, which is a particular exemplar of dog with white and brown fur and with a less domestic attitude w.r.t. the prototypical dog (e.g. its typical location is lawn). Both sorts of representations activate Type 1 processes. On the other hand, the classical body of knowledge is filled with necessary and sufficient information to characterize the concept (representing, for example, the taxonomic information that a dog is a mammal and a carnivore), and activates Type 2 processes. This body of knowledge is represented with standard ontological formalisms and is grounded on OpenCyc. For the sake of readability the information in Figure 1 is visualized with a uniform format, even though the different representations are actually encoded in different formalisms.

Heterogeneous representation of the dog concept in the hybrid knowledge base.
With respect to the previous versions of the system, one of the main advances of the current version of D
In this Section we provide an overview of the methods used for the automatic population of conceptual spaces starting from linguistic resources. A complete description of this module can be found in [35]. The portion of the system performing this task is called
Semantic Extraction. In the semantic extraction step, we access the ConceptNet node associated with t and scan its incoming and outgoing edges: in so doing we retrieve the related terms. The list of 12 relations that are presently considered –out of the 57 relations available in ConceptNet– is provided in Table 1. Since ConceptNet does not provide any anchoring mechanism to associate its terms to meaning identifiers (BabelSynset IDs), it is necessary to determine which edges are relevant for the concept associated to t. In other words, when we access the ConceptNet page for t, we find not only the edges regarding t with a given sense, but all the edges regarding t in any possible meaning. To select only (and possibly all) the edges that concern the sense c
t
, we introduce the notion of relevance. To give an intuition of this process, terms found in ConceptNet are relevant (and thus retained) either if they exhibit a heavy weight in the NASARI vector corresponding to the considered concept, or if they share at least some terms with the NASARI vector. Finally, relevant terms are disambiguated and added to the bag of concepts C [34, pp. 438–41]. For example, given in input the pair 〈bank, c
bank
〉, where c
bank
= 00008364
The list of the considered ConceptNet relations
Semantic Matching. The semantic matching step consists in generating a new exemplar ex in the CS representation, and in filling it with the information previously extracted. An exemplar is basically a list of sets of BabelSynset IDs, where each set corresponds to a quality dimension; it is named and identified in accordance with the seed term t and its meaning c t . The system adopts a set of quality dimensions that has been designed to meet the representational requirements posed by a set of sentences and cross-domain concepts collected in the frame of an interdisciplinary research project, aimed at investigating the neuroscientific bases of lexical processing. The selected quality dimensions aim at representing perceptually salient features (such as size, shape and color), with commonsense knowledge (e.g., partOf, hasPart, function) and taxonomic information (class, family). Some of these dimensions were borrowed from (the most frequent ones in) ConceptNet, whilst in other cases it was necessary to undertake an ad-hoc approach, as illustrated below. Without loss of generality, the considered set of dimensions can be extended or refined to describe some specific domain, for example by devising further dimensions to represent physical dimensions.
The process of assigning a certain value to a quality dimension is called dimension anchoring, and its implementation differs according to the way quality dimensions are filled: every quality dimension can be filled either based on ConceptNet or on a dictionary. In the former case (ConceptNet-driven approach) the process of extracting values to fill d leverages the set of edges; in the latter case (dictionary-driven approach) we exploit the dictionary associated with the quality dimension d. Additionally, every quality dimension can be metric or not (the whole picture is provided in Table 2). For metric quality dimensions we devised a set of translation maps (e.g., in the case of color, we directly translate the red color into its L*a*b color space: 〈53, 80, 67〉).
List of the considered quality dimensions; the last two columns indicate respectively whether each dimension is filled in a dictionary-driven (DD) or in a ConceptNet-driven (CND) way
In the conceptual spaces framework metrical values are fundamental to be able to compute forms of common-sense reasoning by exploiting the distances between exemplars in the resulting geometrical framework. After the new exemplar ex is filled with the values extracted through the above mentioned procedure, we translate the values of the metric quality dimensions by exploiting the related translation maps. Translation maps have been devised to map the extracted values onto the corresponding set of metric values in the conceptual space. For example, the locomotion dimension is used to account for the type of movement (1:swim, 2:dig, 3:crawl, 4:walk, 5:run, 6:roll, 7:jump, 8:fly). In the conceptual space representation the above mentioned values are translated into a numerically ordered scale such that the distance between indexes of values mirrors the semantic distance between the different types of locomotion: e.g., in this setting “dig” and “crawl” are assumed to be closer than “swim” and “fly” [7].
In order to build an actual conceptual space, the
The semantic extraction step ended up with 516 success cases, where the bag-of-concepts C contained at least one extracted concept. The 76 failures were caused by the lack of the ConceptNet node for the input term, rather than by the extraction of irrelevant concepts. Further 30 input terms were dropped because their final bag-of-concepts did not contain any suitable value to fill the exemplars. This led to a total of 486 correctly extracted exemplars, and to filling overall 2, 388 dimensions (on average 4.9 per exemplar).
Categorization Pipeline of the System
In this Section we briefly recall, for the sake of self-containedness, how the whole categorization pipeline of D
Mapping Conceptual Spaces and Ontological Representation
A relevant issue we face is aligning knowledge resources based on different sorts of representational formalisms. Under an architectural perspective, 𝒮1 and 𝒮2 rely on knowledge bases encoded in different ways, which need to be connected and mapped onto a shared and uniform representation of meaning in order to allow the SOAR and LIDA layers to operate them (see Section 7).
The heterogeneous proxytypes approach, in particular, requires the existence of co-referring representational structures to account for conceptual knowledge. Such co-reference relies on the fact that all the different bodies of knowledge are assumed to semantically point to the main reference conceptual container. In our system, such a container has been automatically provided with a WordNet synset ID. In addition, also the pointing representations, containing the different types of conceptual knowledge, have been equipped with the same WordNet synset ID referred to their corresponding concept. The anchoring mechanism between such heterogeneous representations follows two different ways. The first one corresponds to the mapping between the concept and its related ontological component. This mapping is provided in the OpenCyc ontology, which is sometimes equipped with the information regarding the corresponding WordNet synset ID. On the other hand, the anchoring between the conceptual space representations and the corresponding general concept is obtained thanks to the connection obtained with the BabelNet linguistic resource via the

Resources involved for the grounding of heterogeneous representations via WordNet.
The current system has been integrated with two additional widely known cognitive architectures 9 : SOAR [26] and LIDA [16]. SOAR is one of the most mature cognitive architectures and has been used by many researchers worldwide during the last 30-years. One of the main themes in SOAR is that all cognitive tasks can be represented by problem spaces that are searched by production rules grouped into operators. These production rules are fired in parallel to produce reasoning cycles. From a representational perspective, SOAR exploits symbolic representations of knowledge (called chunks) and uses pattern matching and spreading activation to select relevant knowledge elements. The LIDA architecture, on the other hand, is a partial implementation of the LIDA cognitive model [17] and employs a variety of modules that are designed using a variety of computational mechanisms drawn from AI. A peculiarity of this architecture resides in its detailed workflow determining the interactions between automatic (sub-conscious) processes and controlled (conscious) ones in the memory systems of its agent. Such model is explicitly grounded on the Global Workspace Theory[4, 5].
The rationale underlying such integration efforts was to investigate whether the outlined approach is compatible with architectures implementing different cognitive theories of mind, so to be able to argue that it can be considered as a framework general enough for representation and reasoning on conceptual information.
One main difference between the two architectures is that LIDA is considered a hybrid architecture, while SOAR, on the other hand, is entirely symbolic. In particular, LIDA employs both symbolic and subsymbolic representational elements, and, more specifically, it employs the subsymbolic activation of symbolic representational chunks; SOAR adheres to the Newell and Simon’s physical symbol system hypothesis [51] which states that symbolic processing is a necessary and sufficient condition for intelligent behavior [50]. Both architectures, however, are not natively dual-process based. Therefore in both cases, the dual mechanisms of reasoning needed to be explicitly designed and instantiated within an existing general framework. In addition, none of the architectures addresses the problem concerning the representation of (and the reasoning on) common-sense knowledge components, such as prototypes and exemplars (and the related reasoning strategies). In SOAR this problem arises despite the fact that the chunks can be represented as a sort of frame-like structures containing some common-sense (e.g., prototypical) information. In fact, the main problem of this architecture w.r.t. the heterogeneity assumption, relies on the fact that it does not specify how the typical conceptual components (that can eventually be represented in terms of frame-like slots) and the corresponding non monotonic-reasoning strategy, can interact with a possibly conflicting representational and reasoning procedures characterizing a different conceptualisation of the same conceptual entity
10
. In short it assumes, like most of the symbolic-oriented CAs, the availability of a monolithic conceptual structure (e.g., a frame-like prototype or a classical concept) without specifying how such information can be integrated and harmonized with other knowledge components to form the whole knowledge spectrum characterizing a given concept. In LIDA, on the other hand, despite many kinds of approximate comparisons and similarity-based reasoning (e.g., in tasks such as categorization), are, in theory, possible to execute, the peculiarity concerning prototype or exemplars based representations (along with the the design of the interaction between their different reasoning strategies) is not provided. In this sense, the integration of D
As for the previous integrations [39, 40], for both architectures we focused on the Declarative Memory (also named Semantic Memory), and Working Memory modules, and on the corresponding retrieval mechanisms. Besides, the dual process strategies of concept categorization have been integrated into SOAR and LIDA processes and connected to the retrieval request executed in the Working Memory.
Figure 3 gives a very general overview of the rationale behind the described integration of D

General overview of the D
It is worth-noting that, in the literature, there are efforts proposed to extend the Declarative Memories of CAs in order to enable more complex and cognitively inspired forms of reasoning. To this class of works belongs that one proposed by [53] aiming at extending the knowledge layer of ACT-R with external ontological content related to the event modelling or that one by Salvucci [58], enriching the knowledge model of the Declarative Memory (DM) of ACT-R with a world-level knowledge base such as DBpedia (i.e. the semantic version of Wikipedia represented in terms of ontological formalisms) and a previous one proposed in [6] presenting an integration of the ACT-R Declarative and Procedural Memory with the Cyc ontology [29]. The main problematic aspect concerning the extension of the DM with such wide-coverage integrated ontological resources, however, is that the underlying formalisms of such frameworks only allow to represent conceptual information in terms of symbolic structures. As a consequence, they encounter the standard problems affecting this class of representations in dealing with the representation of common-sense knowledge components, mostly absent in such resources (for a more detailed and extended discussion we refer the interested reader to [34]). In this sense, they provide an integration only with the 𝒮2 classical conceptual component. This aspect represents a problem since all these integrated symbolic systems do not represent at all the typical information associated to a given concept. As we will see in more detail in Section 8, this phenomenon prevents the type of common-sense conceptual retrieval based on typical traits that, on the other hand, represent one of the main contributions resulting by our integrations.
Concepts are represented in SOAR as empty chunks (that is, chunks having no associated information, except for the WordNet synset ID and a human readable name), referred to by the external bodies of knowledge (prototypes and exemplars) acting like semantic pointers. Here we have integrated the hybrid knowledge base directly into the semantic memory or SMEM of SOAR, that is equivalent to the Declarative Memory in LIDA and ACT-R. This sort of memory is accessed through two dedicated working memory channels, called
LIDA Integration
The integration at the representational and reasoning level in LIDA followed the same rationale as indicated for SOAR. In particular, we adapted the following modules of the architecture to integrate the structure assumed in D the Sensory Module, receiving input from the external environment and implementing the IE step; the Perceptual Associative Memory (PAM), that receives the encoded linguistic stimulus by the Sensory Module and sends its instantiation to a working memory buffer called Workspace; the Declarative Memory, that is queried via a cue-based retrieval by the Workspace.
The dual process based categorization mechanisms have been implemented based on the following procedure: every request is encoded in the Workspace working memory as a particular type of instance (instance chunk). The dimensions and values of every instance chunk are filled by the PAM module with the information extracted from the linguistic description. Such process is arranged as a series of rounds, each producing a query, by using the cue retrieval mechanism provided by the architecture, to the implicit 𝒮1 component and to the explicit 𝒮2 module. As indicated in Section 6, these mechanisms require to handle different types of responses returned by the dual systems. Such responses involve different parts of the memory structure of LIDA. In particular, once the the chunk request is built, a retrieval request is executed on the 𝒮1 knowledge base, with the aim at retrieving an exemplar or a prototype-based representation. The obtained 𝒮1 result is then proxyfied and temporarily stored in a buffer of the LIDA working memory (the so called Workspace). Afterward, a second request is sent to the Declarative Memory in order to check, as previously illustrated, the results of the 𝒮1 with the external 𝒮2 knowledge base represented by the Cyc ontology.
An important aspect that is modelled in LIDA regards the fact that, while the consistency check of the second request to 𝒮2 is performed, the temporary result obtained by the 𝒮1 categorization process is broadcasted to the entire system. By using the LIDA’s architectural terminology, this means that the 𝒮1 result stored in the working memory buffer (and waiting for the 𝒮2 response) is also sent to the to the Global Workspace module of the architecture. This additional step allows the system to make immediately available the 𝒮1 output to the remaining modules of the architectures without waiting for the slow 𝒮2 result. This process enables the LIDA agents to perform other tasks in real time with the available information. In case the response of 𝒮2 results inconsistent with the previous 𝒮1 result (and if the task for which the categorization was requested is still in focus), then the architecture will be available to broadcast to the Global Workspace a revised answer.
Evaluation
By following the suggestions presented in [52] we tested our integrated categorization system in a conceptual categorization task very similar to the psychological test known as “Word Reasoning”. For human subjects, the Word Reasoning task consists in identifying a concept based on one to three clues. The testee might be told “You can see through it” as a first clue; “It is square and you can open it”, and so on. The processing required by a Word Reasoning items goes beyond retrieval because the testee has to integrate the clues and choose among alternative hypotheses. In addition, such task can be seen as a common-sense reasoning one,known to be still on of the grand challenges of AI [46], since the answer to ths kind of queries require to resort to a “common knowledge about the world that is possessed by every schoolchild that has the methods for making obvious inferences from this knowledge” [11]. Unfortunately, as reported by [52], the standard specific questions provided for this task in the Wechsler Preschool and Primary Scale of Intelligence are proprietary. Nonetheless, the general structure of each sentence is public, so that we have re-used a dataset composed of 111 linguistic descriptions (corresponding to very simple riddles) designed by a team of linguists and neuroscientist in the frame of a research project investigating neural correlates of lexical processing. Such descriptions exhibit a structure similar to that of the Word Reasoning task: on average, no more than 3 cues are present in each riddle. The descriptions were given in input to the implemented system. An example of such descriptions is “The mice hunter with whiskers and long tail”, where the expected category to be retrieved was cat, and in particular its representation corresponding to the “prototype of cat”; conversely, a description such as “The felin mice hunter without fur” was expected to lead as answer to “exemplar of canadian-sphynx”. The expected categorical targets represent a gold standard, since they correspond to the results provided by 45 human subjects in a psychological experimentation. The present experimentation extends that presented by [39] in several ways: firstly, the simulation of the categorization processes is now performed on two additional, integrated, cognitive architectures; secondly, it considers an extended knowledge base; finally, more than half of the 𝒮1 knowledge base has been automatically extracted, as described in Section 5. We designed a twofold experimentation where our system results were compared with both the human responses, and with the results obtained by other systems: Wolfram Alpha, and two general-purpose search engines (Google and Bing) used in question-answering mode [24]. In particular, for the search engines we compared our system results with the first 10 answers they returned for each riddle/query.
11
We then manually evaluated the content of each page in order to assess whether the resulting document was associated to the expected category (i.e., expected w.r.t. the human answers). We tested the whole pipeline, where the salient information is extracted by starting from the linguistic description, the corresponding representation is retrieved, proxyfied and loaded in working memory, in both the architectures, according to the dual process approach. The information extraction of the linguistic input is not implemented in the two cognitive architectures, but it relies on the CoreNLP Stanford Parser [43], which is used to convert the textual description into a chunk request. Our evaluation records two distinct metrics: Concept-categorization accuracy (CC- Proxyfication accuracy (P-
Results and Discussion
The integrated system shows good results for the detection of the expected concepts, compared to both human and artificial systems responses. These figures are reported in Table 3-a. Table 3-b reports the detailed errors committed in the proxyfication phase. Provided that proxyfication errors occur only when the concept has been correctly categorized, three kinds of errors were recorded: an exemplar returned in place of an expected prototype (column Ex-Proto); a prototype returned in place of an expected exemplar (column Proto-Ex), or a wrong exemplar retrieved in place of a correct one (e.g., an individual of polar_bear in place of an individual of asian_black_bear, column Ex-Ex).
Compared to human response the vast majority of errors of our system are due to the confusion between exemplars and prototypes. In particular, in the 28.2% of the considered stimuli an exemplar-proxyfied representation has been returned by the system in spite of the expected prototype. This unexpected error is due to the heuristics —proper to human cognition [44] and implemented by the categorization algorithm — that favors the results of the exemplars-based categorization w.r.t. their prototypical counterpart. While this heuristics is helpful in most cases, the analysis of such proxyfication errors points out that the interaction of such common-sense reasoning mechanisms deserves additional clarification in the theoretical and experimental psychological literature. Such heuristics, in fact, resulted counterintuitive and not efficacious for the correct categorization of general descriptions.
The other systems suffer from different problems: in particular, Google committed the majority (18.9%) of errors in favoring prototypes over expected exemplars, whilst the predominant error in Bing was due to the retrieval of a wrong exemplar (Ex-Ex error). The results of Wolfram Alpha, finally, were not surprising since this system is able to answer classical scientific-oriented queries, while it is not yet equipped to deal with common-sense knowledge and reasoning. This aspect, in fact, still represents one of the more challenging aspects in the current knowledge-based AI systems [30] affecting, as reported in [12], also systems such as IBM Watson. 12
Additional empirical results supporting the feasibility of the overall dual-process based framework are provided in [34]. Such results, that we mention here for the sake of completeness, show that the 𝒮1-𝒮2 component adopted in D
This paper has proposed, as a unifying representational and reasoning framework for artificial agents, a cognitively-inspired conceptual architecture implemented in a system which has now been integrated into SOAR and LIDA. The proposed framework has shown a good deal of compatibility with such general cognitive architectures making different assumptions about the structures and processes of human cognition (and of human-like cognition in AI systems). Although there is room for both refining the framework and tuning the implemented system, the results obtained in a task of common-sense conceptual categorization are encouraging when compared to both human and artificial responses. As above mentioned, we stress that, with respect to our previous work, the extensions described in this paper enable us to fully collocate the employed representational framework adopted in D
As a mid term goal, we plan to integrate the proposed representational and reasoning framework into further general cognitive architectures (e.g., in LEABRA [54] or SPAUN [8, 14]), based on still different representational and reasoning assumptions w.r.t. SOAR and LIDA. Should also these integrations be feasible, we would be allowed to reinforce our claim that the proposed representational and reasoning framework can be used as a reference for the knowledge level of different cognitive agents, thus providing a sort of interlingua for heterogeneous architectures. In particular, in a multi-agent setting, the provided framework can be seen as a communication layer. Such a layer is suited i) to extend the knowledge stored in the Long Term Memory of the individual agents; and ii) to provide a more advanced —shared across the architectures— set of reasoning procedures to query, retrieve and reason on conceptual knowledge coupling standard and common-sense reasoning procedures. Such procedures contribute to fill the gap between the existing cognitive architectures and the categorization heuristics used by human cognition and not previously available (or only partially available) in those systems [38].
Another strength of the proposed approach regards the possibility, for diverse cognitive agents, to interpret and process the shared knowledge level, meantime maintaining the specificities and the constraints proper to each architecture.
Footnotes
It is worth-noting that, in this setting, the provided integration should not be seen as a mere implementation fact. On the contrary, the implemented integration of our framework with the software instantiations of the involved cognitive architecture follows a previous, and more complex, integration developed between the abstract models of cognition assumed by each architecture and our system. As a consequence, the resulting integration is provided by respecting the architectural constraints of each of these systems. This means, in other words, that our work provides evidence that the adopted representational framework is cognitively compliant with such diverse agent architectures.
If we have to categorize a stimulus with the following features: “it has fur, woofs and wags its tail”, the result of a prototype-based categorization would be dog, since these cues are associated to the prototype of dog. Prototype-based reasoning, however, is not the only type of reasoning based on typicality. In fact, if an exemplar corresponding to the stimulus being categorized is available too, it is acknowledged that humans use to classify it by evaluating its similarity w.r.t. the exemplar, rather than w.r.t. the prototype associated to the underlying concepts [
]. This type of common sense categorization is known in literature as exemplars-based categorization.
In the present setting, distances are computed in a multi-dimensional space that can be thought of as a vectorial model.
All mentioned resources are either built on top or directly linked to WordNet (WN), which is a lexical database for the English language [
]. Rather than organizing terms alphabetically (like ordinary dictionaries, where senses are possibly scattered) WN groups terms into synonyms sets called synsets, that are equipped with short definitions and usage examples. Such sets are represented as the nodes of a large semantic network, where the intervening edges represent a number of semantic relations among synset elements (such as hyponymy, hypernymy, antonymy, meronymy, holonymy).
The pairs term-concept fed to the
].
The morphological information computed from input sentences has been used to devise a simple finite-state automaton describing the structure of the input sentences (more on the input descriptions in Section 8). This approach would not scale to handle more complex sentences. We defer to future work the adoption of richer language models. Despite these limitation, however, it allowed us to complete the automatization of the software pipeline going all throughout from the simple linguistic input description used for the evaluation (that will be described later) to its final conceptual categorization.
The term “cognitive architecture” was introduced by Allen Newell and his colleagues in their work on unified theories of cognition [25]. One of the main reasons justifying the introduction of such systems in the AI and Computational Cognitive Science fields was the goal of reaching human level intelligence in a general setting (on the role of CAs for general intelligent systems see also [
]).
A classical example of the described situation is the following: let us think to the case of WHALE. A prototypical conceptualization would classify whales as FISH (since a whale share many typical traits with fishes). On the other hand, a classical conceptualization would classify whales as MAMMAL.
We considered the first 10 results returned by the search engines, and not only the first one, since we are aware that these systems are not standard Question-Answering Systems. However, the underlying purpose of such systems can be plausibly ascribed to the area of Question-Answering, that is, the search engines try to answer to queries, that express a question/information need and looks for an answer. This fact is corroborated by evidence in literature, showing that over more than 80% of Web queries are informational in nature [24], and by recent works in QA and in Semantic Textual Similarity [1, 48]. In particular, in these settings, a relevant trend is based on adopting IR techniques that —rather than focussing on the generation of direct answers— are aimed, as the search engines, at finding text excerpts that contain the answer within large collections of documents [
].
A comparison with the Watson system for common-sense queries represents a mid-term evaluation goal. We recently (in Spring/Summer 2017) inspected the possibility of querying the Watson system by using its Open Services
. Unfortunately, the available services do not provide the possibility of querying the Watson Knowledge Base. Currently, in fact, every user can create his/her own KB collecting a set of documents and then answer queries on the internal representation extracted from that documents. Despite this is an interesting feature, in our opinion a comparison with an ad-hoc built Knowledge Base would be unfair and rather difficult to actually implement.
For the reasons explaining why the framework of the Conceptual Spaces can be considered in Cognitive Architectures an efficacious, intermediate, representational level connecting symbolic and sub-symbolic representations and what are the advantages offered by such framework with respect to some of the main problems affecting the representational level of CAs we refer to [10,
].
Acknowledgements
We thank Francesco Ghigo and Francesco Caronte for their help with the implementation and the anonymous reviewers for their insightful comments.
