In this work we exploit a nonmonotonic Description Logic of typicality as a tool for the generation and the exploration of novel creative concepts. Our logic, called TCL, is able to deal with the phenomenon of prototypical concept combination, which has been shown to be problematic to be modelled in formalisms like fuzzy logic. The logic TCL is the result of combining three main ingredients: (i) the logic of typicality , that allows one to describe typical properties of a class by means of a typicality operator, (ii) the distributed semantics of probabilistic Description Logics, that allows one to provide a probability distribution over worlds/scenarios, and (iii) established heuristics used by humans for concept composition coming from the field of cognitive semantics. Our approach could be useful in many applicative scenarios, ranging from video games to the creation of new movie characters.
The automatic generation of novel concepts, obtained by coupling the typical knowledge of pre-existing ones, is an important trait of human creativity. Dealing with this problem requires, from an AI perspective, the harmonization of two conflicting requirements that are hardly accommodated in symbolic systems: the need of a syntactic compositionality (typical of logical systems) and one concerning the exhibition of typicality effects [16]. According to a well-known argument [23], in fact, prototypical concepts are not compositional. The argument runs as follows: consider a concept like pet fish. It results from the composition of the concept pet and of the concept fish. However, the prototype of pet fish cannot result from the composition of the prototypes of a pet and a fish: e.g. a typical pet is furry and warm, a typical fish is grayish, but a typical pet fish is neither furry and warm nor grayish (typically, it is red).
In this work we provide a framework able to account for this type of human-like concept combination in the scenario of typical concept invention. We exploit a nonmonotonic Description Logic (from now on DL) of typicality called TCL (typical compositional logic), that has been already shown able to capture well established examples in the literature of cognitive science concerning concept combination, in particular we have shown that the logic TCL is able to model the above mentioned pet fish composition (i.e. the guppy effect), the conjunction fallacy problem (also known as the Linda problem) and the phenomena of metaphorical concept composition. We remind the interested reader to [18, 19].
This logic combines three main ingredients. The first one relies on the DL of typicality introduced in [7, 8]. In this logic, “typical” properties can be directly specified by means of a “typicality” operator T enriching the underlying DL, and a TBox can contain inclusions of the form T (C) ⊑ D to represent that “typical Cs are also Ds”
1
. As a difference with standard DLs, in the logic one can consistently express exceptions and reason about defeasible inheritance as well. For instance, a knowledge base can consistently express that “normally, athletes are fit”, whereas “sumo wrestlers usually are not fit” by T (Athlete) ⊑ Fit and T (SumoWrestler) ⊑ ¬ Fit, given that SumoWrestler ⊑ Athlete. The semantics of the T operator is characterized by the properties of rational logic [13], recognized as the core properties of nonmonotonic reasoning. is characterized by a minimal model semantics corresponding to an extension to DLs of a notion of rational closure as defined in [13] for propositional logic: the idea is to adopt a preference relation among models, where intuitively a model is preferred to another one if it contains less exceptional elements, as well as a notion of minimal entailment restricted to models that are minimal with respect to such preference relation. As a consequence, T inherits well-established properties like specificity and irrelevance: in the example, the logic allows us to infer that, normally, an athlete who is also bald is fit, i.e. T (Athlete ⊓ Bald) ⊑ Fit (being bald is irrelevant with respect to being fit) and, if one knows that Hiroyuki is a typical sumo wrestler, to infer that he is not fit, giving preference to the most specific information.
As a second ingredient, we consider a distributed semantics similar to the one of probabilistic DLs known as DISPONTE [27], allowing one to label axioms with degrees representing probabilities, but restricted to typicality inclusions. The basic idea is to label inclusions T (C) ⊑ D with a real number between 0.5 and 1, representing its probability
2
: such a number represents the probability of finding elements of C being also D, then we impose that it is at least 50% since we are only considering typicality properties. We assume that each axiom is independent from each other. The resulting knowledge base defines a probability distribution over scenarios: roughly speaking, a scenario is obtained by choosing, for each typicality inclusion, whether it is considered as true of false. In a slight extension of the above example, we could have the need of representing that both typicality inclusions about athletes and sumo wrestlers have a probability of 80%, whereas we also believe that athletes are usually young with a higher probability of 95%, with the following KB: (1) SumoWrestler ⊑ Athlete; (2) 0.8: : T (Athlete) ⊑ Fit; (3) 0.8: : T (SumoWrestler) ⊑ ¬ Fit; (4) 0.95: : T (Athlete) ⊑ YoungPerson. We consider eight different scenarios, representing all possible combinations of typicality inclusion: as an example, {((2) , 1) , ((3) , 0) , ((4) , 1)} represents the scenario in which (2) and (4) hold, whereas (3) does not. We equip each scenario with a probability depending on those of the involved inclusions, then we restrict reasoning to scenarios whose probabilities belong to a given and fixed range.
As a third element of the proposed formalization we employ a method inspired by cognitive semantics [10] for the identification of a dominance effect between the concepts to be combined: for every combination, we distinguish a HEAD, representing the stronger element of the combination, and a MODIFIER. The basic idea is: given a KB and two concepts CH (HEAD) and CM (MODIFIER) occurring in it, we consider only some scenarios in order to define a revised knowledge base, enriched by typical properties of the combined concept C ⊑ CH ⊓ CM (the heuristics for the scenario selections are detailed in Section 2, Definition 7).
In this work, we show that the proposed logic TCL is able to tackle the problem of composing prototypical concepts (by creating a novel, plausible, conceptual prototype) and we exploit the proposed formalism as a tool for the generation of novel creative concepts, that could be useful in many applicative scenarios. We also outline that the reasoning complexity of TCL is ExpTime-complete, as in the standard , witnessing that the proposed approach is promising since it remains in the same complexity class of the underlying logic.
The plan of the paper is as follows. In Section 2 we first describe the logic TCL presented in [19], then in Section 3 we introduce COCOS, a tool implementing reasoning services in such a logic. Section 4 describes some applications of the proposed approach. We conclude in Section 5 with a discussion on related approaches and some pointers to future works. This paper extends and revises preliminary results presented in [17].
A logic for concept combination
The nonmonotonic Description Logic TCL combines the semantics based on the rational closure of [7] with the probabilistic DISPONTE semantics [27]. By taking inspiration from [21], we consider two types of properties associated to a given concept: rigid and typical. Rigid properties are those that hold under any circumstance, e.g. C ⊑ D (all Cs are Ds). Typical properties are represented by inclusions equipped by a probability. Additionally, we employ a cognitive heuristic for the identification of a dominance effect between the concepts to be combined, distinguishing between HEAD and MODIFIER
3
.
The language of TCL extends the basic DL by typicality inclusions of the form T (C) ⊑ D equipped by a real number p ∈ (0.5, 1) representing its probability, whose meaning is that “normally, Cs are also Ds with probability p”.
Definition 1 (Language of TCL). We consider an alphabet C of concept names, R of role names, and O of individual constants. Given A ∈ C and R ∈ R, we define:
C, D : = A ∣ ⊤ ∣ ⊥ ∣ ¬ C ∣ C ⊓ C ∣ C ⊔ C ∣ ∀ R . C ∣ ∃ R . C We define a knowledge base where:
is a finite set of rigid properties of the form C ⊑ D;
is a finite set of typicality properties of the form p: : T (C) ⊑ D, where is the probability of the inclusion;
is the ABox, i.e. a finite set of formulas of the form either C (a) or R (a, b), where a, b ∈ O.
It is worth noticing that we avoid typicality inclusions with degree 1. Indeed, an inclusion 1: : T (C) ⊑ D would mean that it is a rigid property, that we represent with . Also, observe that we only allow typicality inclusions equipped with probabilities p > 0.5. Indeed, the very notion of typicality derives from the one of probability distribution, in particular typical properties attributed to entities are those characterizing the majority of instances involved. Moreover, in our effort of integrating two different semantics – DISPONTE and typicality logic – the choice of having probabilities higher than 0.5 for typicality inclusions seems to be the only compliant with both formalisms. In fact, despite the DISPONTE semantics allows one to assign also low probabilities/degrees of belief to standard inclusions, in the logic TCL it would be misleading to also allow low probabilities for typicality inclusions. Please, note that this is not a limitation of the expressivity of the logic TCL: we can in fact represent properties not holding for typical members of a category, for instance if one need to represent that typical students are not married, we can have that 0.8: : T (Student) ⊑ ¬ Married.
Following from the DISPONTE semantics, each axiom is independent from each others. This allows us to deal with conflicting typical properties equipped with different probabilities.
A model of TCL extends standard models by a preference relation among domain elements as in the logic of typicality [7]. In this respect, x < y means that x is “more normal” than y, and that the typical members of a concept C are the minimal elements of C with respect to this relation. An element is a typical instance of some concept C if and there is no element in more typical than x.
Definition 2 (Model). A model is any structure
where:
is a non empty set of items called the domain;
< is an irreflexive, transitive, well-founded and modular (for all x, y, z in , if x < y then either x < z or z < y) relation over ;
is the extension function that maps each concept C to , and each role R to . For concepts of , is defined as usual. For the T operator, we have
where .
A model can be equivalently defined by postulating the existence of a function , where assigns a finite rank to each domain element [7]: the rank of x is the length of the longest chain x0 < … < x from x to a minimal x0, i.e. such that there is no x′ such that x′ < x0. The rank function and < can be defined from each other by letting x < y if and only if .
Definition 3 (Model satisfying a KB). Let be a KB. Given a model , we assume that is extended to assign a domain element of to each individual constant a of O. We say that:
satisfies if, for all , we have ;
satisfies if, for all , we have that , i.e. ;
satisfies if, for each assertion , if F = C (a) then , otherwise if F = R (a, b) then .
Even if the typicality operator T itself is nonmonotonic (i.e. T (C) ⊑ E does not imply T (C ⊓ D) ⊑ E), what is inferred from a KB can still be inferred from any KB’ with KB ⊆ KB’, i.e. the resulting logic is monotonic. In order to perform useful nonmonotonic inferences, in [7] the authors have strengthened the above semantics by restricting entailment to a class of minimal models. Intuitively, the idea is to restrict entailment to models that minimize the atypical instances of a concept. The resulting logic corresponds to a notion of rational closure on top of . Such a notion is a natural extension of the rational closure construction provided in [13] for the propositional logic. This nonmonotonic semantics relies on minimal rational models that minimize the rank of domain elements. Informally, given two models of KB, one in which a given domain element x has rank 2 (because for instance z < y < x), and another in which it has rank 1 (because only y < x), we prefer the latter, as in this model the element x is assumed to be “more typical” than in the former. Query entailment is then restricted to minimal canonical models. The intuition is that a canonical model contains all the individuals that enjoy properties that are consistent with KB. This is needed when reasoning about the rank of the concepts: it is important to have them all represented. A query F is minimally entailed from a KB if it holds in all minimal canonical models of KB. In [7] it is shown that query entailment in the nonmonotonic is in ExpTime.
Definition 4 (Entailment). Let be a KB and let F be either C ⊑ D (C could be T (C′)) or C (a) or R (a, b). We say that F follows from if, for all minimal satisfying , then also satisfies F.
Let us now define the notion of scenario of the composition of concepts. Intuitively, a scenario is a knowledge base obtained by adding to all rigid properties in and to all ABox facts in only some typicality properties. More in detail, we define an atomic choice on each typicality inclusion, then we define a selection as a set of atomic choices in order to select which typicality inclusions have to be considered in a scenario.
Definition 5 (Atomic choice). Given , where we define (Ei, ki) an atomic choice for some i ∈ {1, 2, …, n}, where ki ∈ {0, 1}.
Definition 6 (Selection). Given , where and a set of atomic choices ν, we say that ν is a selection if, for each Ei, one decision is taken, i.e. either (Ei, 0) ∈ν and (Ei, 1) ∉ν or (Ei, 1) ∈ν and (Ei, 0) ∉ν for i = 1, 2, …, n. The probability of ν is .
Definition 7 (Scenario). Given , where and given a selection σ, we define a scenario We also define the probability of a scenario wσ as the probability of the corresponding selection, i.e. P (wσ) = P (σ). Last, we say that a scenario is consistent when it admits a model in the logic TCL.
Selected scenarios
We denote with the set of all scenarios for a knowledge base . It immediately follows that the probability of a scenario P (wσ) is a probability distribution over scenarios, that is to say .
Given a KB and given two concepts CH and CM occurring in , our logic allows one to define the compound concept C as the combination of the HEAD CH and the MODIFIER CM, where C ⊑ CH ⊓ CM and the typical properties of the form T (C) ⊑ D to ascribe to the concept C are obtained in the set of scenarios that:
are consistent;
are not trivial, in the sense that the scenarios considering all typical properties of the HEAD that can be consistently ascribed to C are discarded;
are those giving preference to the typical properties of the HEAD CH (with respect to those of the MODIFIER CM) with the highest probability. Notice that, in case of conflicting typical properties like D and ¬D, given two scenarios w1 and w2, both belonging to the set of consistent scenarios with the highest probability and such that an inclusion p1: : T (CH) ⊑ D belongs to w1 whereas p2: : T (CM) ⊑ ¬ D belongs to w2, the scenario w2 is discarded in favor of w1.
In order to select the desired scenarios we apply points 1, 2, and 3 above to blocks of scenarios with the same probability, in decreasing order starting from the highest one. More in detail, we first discard all the inconsistent scenarios, then we consider the remaining (consistent) ones in decreasing order by their probabilities. We then consider the blocks of scenarios with the same probability, and we proceed as follows:
we discard those considered as trivial, consistently inheriting all (or most of) the typical properties from the starting concepts to be combined;
among the remaining ones, we discard those inheriting typical properties from the MODIFIER in conflict with typical properties inherited from the HEAD in another scenario of the same block (i.e., with the same probability);
if the set of scenarios of the current block is empty, i.e. all the scenarios have been discarded either because trivial or because preferring the MODIFIER, we repeat the procedure by considering the block of scenarios, all having the immediately lower probability.
The set of scenarios not discarded in the current block are those selected by the logic TCL as the result of the procedure.
The knowledge base obtained as the result of combining concepts CH and CM into the compound concept C is called C-revised knowledge base:
for all D such that T (C) ⊑ D is inferred from w. The probability p is defined as follows: if D is a typical property inherited either from the HEAD (or from both the HEAD and the MODIFIER), then p corresponds to the probability of such inclusion of the HEAD in the initial knowledge base, i.e. ; otherwise, p corresponds to the probability of such inclusion of a MODIFIER in the initial knowledge base, i.e. . Notice that, since the C-revised knowledge base is still in the language of the TCL logic, we can iteratively repeat the same procedure in order to combine not only atomic concepts, but also compound concepts. Examples and details of such procedure are described in [18].
In [19] we have shown that reasoning in TCL in the revised knowledge is ExpTime-complete, obtaining that it remains in the same complexity class of standard . For the completeness, let n be the size of KB, then the number of typicality inclusions is O (n). It is straightforward to observe that we have an exponential number of different scenarios, for each one we need to check whether the resulting KB is consistent in which is ExpTime-complete. Hardness immediately follows from the fact that TCL extends . It is worth noticing that, even if reasoning in the logic TCL remains in the same complexity class of , the generation of all 2n scenarios could be very expensive in case of large knowledge bases. Optimization techniques like those in [1, 2] could be used in order to limit this problem.
COCOS: a typicality based concept combination system
In this section we describe COCOS [20], a first implementation of a reasoning machinery for the logic TCL, generating scenarios and choosing the selected one(s) according to Section 2.1. The current version of the system is implemented in Python and exploits the translation of an knowledge base into standard introduced in [7] and adopted by the system RAT-OWL [9]. COCOS makes use of the library owlready2
4
that allows one to rely on the services of efficient DL reasoners, e.g. the HermiT reasoner.
Our system relies on the procedures developed for the logic TCL and recalled in the previous section. COCOS allows one: i) to include the logical descriptions of the concepts to be combined; ii) to select which among the concepts has to be intended as HEAD and as MODIFIER(s); iii) to choose, via a slider, how many typical properties we want to inherit in the scenarios that will be selected by TCL. In addition to presenting the selected scenario with typical properties of the combined concept, COCOS also allows the users to select alternative scenarios by means of a slider, ranging from more trivial to more surprising ones.
Some pictures of the graphical interface of COCOS.
As mentioned, COCOS represents a very preliminary attempt to implement reasoning services for the logic TCL. We are currently working on a more mature version by adopting techniques in [1,2, 1,2] in order to improve its efficiency.
Artificial prototypes composition and concept invention
In this section we exploit the logic TCL to show both how it allows to automatically generate novel, plausible, prototypical concepts by composing two initial prototypes and how it can be used as a generative tool in the field of computational creativity (with applications in the so called creative industry). In detail, we first show how our logic can model the generation of a quite complex concept recently introduced in the field of narratology, i.e. that one of the Anti-Hero (a role invented by narratologists to generate new story lines), by combining the typical properties of the concepts Hero and Villain. Of course, the specific domain of the example is not relevant here; our goal is showing how TCL can model this kind of prototypical concept composition (a crucial aspect of human concept invention) that, on the other hand, has been proven to be problematic for other kinds of logics (e.g fuzzy logic, [11, 23]). We then show how the same machinery can be used as a creative support tool to generate new characters, for instance for a video game or a movie. In particular, we first show how the logic TCL is able to generate a concept like Batman as the combination of the prototypes of Man and Bat, then we exploit it to generate an entirely new fictional character that we assume to be composed by combining the narrative prototypes of Batman and of an Homeric Hero, as well as to create a new type of villain as the result of the combination with a chair.
For the examples we have exploited COCOS introduced in Section 2. The current version of the system along with the files for the following examples are available at http://di.unito.it/cocos.
The Anti Hero
We will take into account the concepts of Hero, AntiHero and Villain extracted by the common sense descriptions coming from the TvTropes repository (https://tvtropes.org). In such online repository, typical descriptions of character roles are provided. They can be useful for practitioners of the narrative field in order to design their own character according to the main assets presented in such schemas. In particular, Tropes can be seen as devices and conventions that a writer can reasonably rely on as being present in the audience members’ minds and expectations. Regarding the Hero, TvTropes identifies the following relevant representative features: e.g. the fact that it is characterized by his/her fights against the Villain of a story, the fact that his/her actions are necessarily guided by general goals to be achieved in the interest of the collectivity, the fact that they fight against the Villain in a fair way and so on. Examples of such Trope are: Superman, Flash Gordon, Captain America, etc. The AntiHero, on the other hand, is described as characterized by the fact of sharing most of its typical traits with the Hero (e.g. the fact that it is the protagonist of a plot fighting against the Villain of the story); however, his/her moves are not guided by a general spirit of sacrifice for the collectivity but, rather, they are usually based on some personal motivations that, incidentally and/or indirectly, coincide with the needs of the collectivity. Furthermore the AntiHero may also act in a not fair way in order to achieve the desired goal. A classical example of such trope is Batman, whose moves are guided by his desire of revenge. Finally the Villain is represented as a classic negative role in a plot and is characterized as the main opponent of the protagonist/Hero. In addition to this classical contraposition, TvTropes also reports some physical elements characterizing such role from a visual point of view. For example: the characters of this Trope are usually physically endowed with some demoniac cues (e.g. they have the “eyes of fire”). Finally, they are guided by negative moral values. Examples of such role can be easily taken from the classical literature to the modern comics. Some representative exemplars are Cruella de Vil in Disney’s filmic saga or Voldemort in Harry Potter.
Let us now exploit our logic TCL in order to define a prototype of AntiHero. First of all, we define a knowledge base describing both rigid and typical properties of concepts Hero and Villain, then we rely on the logic TCL in order to formalize an AntiHero-revised knowledge base.
Let be a KB, where the ABox is empty. Concerning rigid properties, let be as follows:
We make use of the logic TCL in order to build the compound concept AntiHero as the result of the combination of concepts Hero and Villain. Differently from what the natural language seems to suggest, we consider this compound concept by assuming that the HEAD is Villain (since the AntiHero shares more typical traits with this concept than with the Hero concept). In COCOS, such knowledg base is easily encoded in a file containing:
the choice about the HEAD concept and the MODIFIER one. This is provided by a key-value expression, in this case we just need to add the strings:
Head Concept Name: Villain
Modifier Concept Name: Hero
the set of rigid and typical properties, expressed by pairs
head,concept_name
modifier,concept_name
and by triples
T(head),concept_name,prob
T(modifier),concept_name,prob
respectively. In the example, for instance T1, R3 and T6 are encoded as follows:
T(modifier),protagonist,0.95#T1
head,withnegativemoralvalues#R3
T(head),impulsive,0.8 #T6
First of all,e we have that the compound concepts inherits all the rigid properties of both its components (if not contradictory), therefore in the logic TCL we have that:
AntiHero ⊑ ∃ hasOpponent . Villain
AntiHero ⊑ ∀ fightsFor . PersonalGoal
AntiHero ⊑ WithNegativeMoralValues
For the typical properties, we consider all the 27 = 256 different scenarios obtained from all possible selections about inclusion in . Some of them are inconsistent, namely those including either axiom T2 or axiom T3, since they would ascribe typical properties in contrast with inherited rigid properties of (ii) and (iii): rigid properties impose that an anti hero has negative moral values, and all his goals are personal, therefore he is an atypical hero in those respects (T2 states that typical heroes fights also for some collective goals, whereas T3 states that normally heroes have positive moral values). Also scenarios containing both axioms T4 and T5 are inconsistent, due to the fact that the concepts AngelicIconicity and DemoniacIconicity are disjoint (formalized by R6).
Let us consider the remaining, consistent scenarios: the one having the highest probability considers all the typical properties of both concepts by excluding only AngelicIconicity, that is to say the one with the lowest probability between the two typical properties in conflict. In TCL this scenario is discarded since it is the most trivial one. When we consider scenarios less trivial, i.e., more surprising scenarios (we analyze scenarios in decreasing order of probability), we discard the scenario with probability 0.13%, which includes T4, associated to the MODIFIER, rather than T5, associated to the HEAD, since it does not satisfy the HEAD/MODIFIER heuristics, allowing one to conclude, in a counter intuitive way, that typical anti heroes have an angelic iconicity rather than a demoniac one.
Next scenarios, sharing the same probability (0.09%), are as follows:
0.95: : T (Hero) ⊑ Protagonist (T1)
0.75: : T (Villain) ⊑ DemoniacIconicity (T5)
0.8: : T (Villain) ⊑ Impulsive (T6)
0.95: : T (Hero) ⊑ Protagonist (T1)
0.8: : T (Villain) ⊑ Impulsive (T6)
0.75: : T (Villain) ⊑ Protagonist (T7)
According to the logic TCL, both are adequate and represent the outcome of the whole heuristic procedures adopted in TCL. Probably, in this case, it could be more useful to opt for the first solution allowing to inherit a further typical property (i.e. DemoniacIconicity) for the generated prototypical Anti-Hero. However, we remain agnostic about the selection of the final options provided by TCL. This choice can be plausibly left to human decision makers and based on their own goals.
The resulting, alternative Hero ⊓ Villain-revised knowledge bases are as follows:
0.95: : T (Hero ⊓ Villain) ⊑ Protagonist
0.75 : : T (Hero ⊓ Villain) ⊑ DemoniacIconicity
0.8: : T (Hero ⊓ Villain) ⊑ Impulsive
0.75: : T (Hero ⊓ Villain) ⊑ Protagonist
0.8: : T (Hero ⊓ Villain) ⊑ Impulsive
Notice that, in the second case, the inclusion T (Hero ⊓ Villain) ⊑ Protagonist is inherited from both the HEAD and the MODIFIER, therefore the probability in the revised knowledge base corresponds to the one coming from the HEAD (0.75).
COCOS ends its computation by providing the output of Fig. 2, where tuples of 0 and 1 represent selections generating scenarios and the selected one is scenario numbered 19 at the end.
An excerpt of the output of COCOS running the example of the AntiHero.
A final element that is worth noticing in TCL is the following: in our logic, adding a new inclusion, e.g. T (AntiHero) ⊑ Brave, would not be problematic. That is to say that our formalism is able to tackle the phenomenon of prototypical attributes emergence for the new compound concept, widely described in the Cognitive Science literature [10].
Batman
In this section we use the logic TCL to generate a character that is considered a typical Anti-Hero in the narratological commununity: i.e. Batman. In this case, we assume that this character results from the combination of the concepts Man and Bat, where Man is the HEAD and Bat is the MODIFIER.
First of all, let us consider the following list of prototypical descriptions for the concept Man (the list is, of course, plausible but not exhaustive):
Man ⊑ Male (R1)
Man ⊑ Human (R2)
Man ⊑ Mammal (R3)
0.7: : T (Man) ⊑ HeteroSexual (T1)
0.85: : T (Man) ⊑ SportLover (T2)
0.6: : T (Man) ⊑ ¬ AcuteVoice (T3)
0.8: : T (Man) ⊑ Omnivore (T4)
Let us now formalize in the logic TCL the prototype of a bat, as follows:
Bat ⊑ Mammal (R4)
Bat ⊑ Animal (R5)
0.9: : T (Bat) ⊑ Fly (T5)
0.95: : T (Bat) ⊑ Nocturnal (T6)
0.9: : T (Bat) ⊑ ¬ Omnivore (T7)
0.9: : T (Bat) ⊑ LivesInCave (T8)
We have exploited COCOS [20] in order to generate all 28 = 256 different scenarios. Excluding the inconsistent scenarios, let us consider those remaining in descending order of probability. The first scenario, whose probability is 4.95%, includes typicality inclusions T1, T2, T3, T5, T6, T7, and T8, namely all typical properties of the MODIFIER are inherited. However, this scenario is discarded since a typical property of the MODIFIER (¬Onnivore) is preferred to a conflict one in the HEAD (Onnivore). The same for subsequent scenario including T1, T2, T5, T6, T7, and T8, with probability 3.297%. The following scenario, excluding only T7, has probability 2.198%, but it is considered as trivial (all typicality properties of HEAD are inherited) and therefore discarded. The next scenario has probability 2.12% and includes T2, T3, T5, T6, T7, and T8. As for the most probable scenario, here again ¬Onnivore of T7 in the MODIFIER is preferred to the conflicting Onnivore of T4 in the HEAD: again, this scenario is therefore discarded.
In TCL the selected scenario, whose probability is 1.47% is the next one, including T1, T2, T4, T5, T6, and T8 and it describes the following Man ⊓ Bat-revised knowledge base:
0.7: : T (Man ⊓ Bat) ⊑ HeteroSexual (T1)
0.85: : T (Man ⊓ Bat) ⊑ SportLover (T2)
0.8: : T (Man ⊓ Bat) ⊑ Omnivore (T4)
0.9: : T (Man ⊓ Bat) ⊑ Fly (T5)
0.95: : T (Man ⊓ Bat) ⊑ Nocturnal (T6)
0.9: : T (Man ⊓ Bat) ⊑ LivesInCave (T8)
This kind of revised knowledge base is perfectly compliant with that one generated in the previous section about the Anti-Hero. Also in this case, adding new inclusions which are typical of an Anti-Hero such as:
0.95: : T (Man ⊓ Bat) ⊑ Protagonist (i)
0.8: : T (Man ⊓ Bat) ⊑ Impulsive (ii)
0.75: : T (Man ⊓ Bat) ⊑ Brave (iii)
would not be problematic. It is also true that, at this stage, the obtained description is also formally compliant with the both the descriptions of Hero and Villain. However, as soon as data about different instances of the new character class come in, e.g. by knowing that (Man ⊓ Bat) (michaelKeaton), we can assume that it will become evident that the following facts will hold:
∃hasOpponent . Villain (michaelKeaton)
∀fightsFor . PersonalGoal (michaelKeaton)
WithNegativeMoralValues (michaelKeaton)
and therefore that correct narrative class of the novel generated character can only be the Anti-Hero one: indeed, it can be assumed that Michael Keaton is a typical Batman, therefore inheriting all the typical properties of the revised knowledge base. It can be observed that this does not hold in case we consider another individual, call it Diomedes, being a Hero and a Batman: in this case, the only consistent definition leads to assume that Diomedes in an atypical Batman.
In the next subsection we show how TCL can be used to invent novel concepts by iteratively applying the heuristics devised in TCL.
Iterative character generation: Combining Batman with an Homeric Hero
A C-revised knowledge base in the logic TCL is still in the language of the TCL logic. This allows us to iteratively repeat the same procedure in order to combine not only atomic concepts, but also compound concepts.
In this section we show how to generate a further new character by exploiting this feature, in particular we handle the concept combination of C-revised knowledge bases in order to combine the concept Batman, obtained in Section 4.2 as the combination of Man and Bat, and the prototype of an HomericHero.
For the sake of readability, let us recall the knowledge base concerning Batman:
Batman ⊑ Male (R1)
Batman ⊑ Human (R2)
Batman ⊑ Mammal (R3)
Batman ⊑ Animal (R4)
0.7: : T (Batman) ⊑ HeteroSexual (T1)
0.85: : T (Batman) ⊑ SportLover (T2)
0.8: : T (Batman) ⊑ Omnivore (T3)
0.9: : T (Batman) ⊑ Fly (T5)
0.95: : T (Batman) ⊑ Nocturnal (T4)
0.9: : T (Batman) ⊑ LivesInCave (T6)
Concerning the Homeric hero, we can consider the following knowledge base:
We have considered the alternative of having BatMan as the HEAD and HomericHero as the MODIFIER and vice-versa and obtained the following revised knowledge bases. The Batman ⊓ HomericHero-revised knowledge base, where Batman is the HEAD, is as follows:
It is worth noticing that in the revised knowledge base, the typical batman homeric hero is nocturnal, thus contradicting the typical property of one of the superclasses (i.e. HomericHero). This datum does not constitute a problem since the logic TCL is based on the nonmonotonic logic which allows one to handle this kind of exceptions by the nonmonotonicity of the operator T (C ⊑ D does not imply that T (C) ⊑ T (D), then we can have a property P holding for elements of T (C) whereas ¬P holds for elements of T (D)). Notice also that the combined concept Batman ⊓ HomericHero inherits the typical property of being male from the MODIFIER of the combination, i.e. 0.9: : T (Batman ⊓ HomericHero) ⊑ Male from T7. This inclusion is allowed and it is not problematic because it is not in conflict with the rigid property R1 stating that all batmans are males.
The HomericHero ⊓ Batman-revised knowledge base, where HomericHero is the HEAD, is as follows:
As a difference with the Batman ⊓ HomericHero-revised knowledge base, the typical property of being Nocturnal is no longer inherited by the combined concept. It is worth noticing that, even if the typical property ¬Nocturnal would have been inherited, this would not be problematic, again by the nonmonotonicity of the underlying logic .
The Villain Chair
Let us assume to generate a novel concept obtained as the combination of concepts Villain (as HEAD)
5
and Chair (as MODIFIER). Let be as follows:
We consider the 512 scenarios, from which we discard the inconsistent ones, namely those including T5: indeed, since R2 imposes that villains are animate, in the underlying we conclude that Villain ⊓ Chair ⊑ Animate, therefore all scenarios including T5, imposing that T (Villain ⊓ Chair) ⊑ ¬ Animate are consistent only if there are no villain chairs. As in the previous examples, we also discard trivial scenarios containing all the inclusions related to the HEAD. The first suitable scenarios are those have probability 0.23% and contain all typical properties coming from the MODIFIER and three out of four typical properties coming from the HEAD. Such scenarios define two alternative revised knowledge bases: one containing T2 and not T3, the other one containing T3 and not T2. These scenarios are the preferred ones selected by the logic TCL.
However, in this application setting, we could imagine to use our framework as a creativity support tool and thus considering alternative - more surprising - scenarios by adding additional constraints. For example, we could impose that the compound concept should inherit exactly six typical properties. In this case, we would get that the scenario having the highest probability (0.16%) is the one including all the typical properties of the HEAD, namely T1, T2, T3 and T4, and two out of four typical properties of the MODIFIER, namely T6 and T8. Due to its triviality, this scenario is discarded, in favor of the following more creative scenarios, with probability 0.13%, obtained by excluding T7 of the MODIFIER and one out of four typical properties of the HEAD:
0.95 : : T (Villain ⊓ Chair) ⊑ ∃ hasComponent . Back
0.8: : T (Villain ⊓ Chair) ⊑ Comfortable
0.7: : T (Villain ⊓ Chair) ⊑ Inflammable
0.9: : T (Villain ⊓ Chair) ⊑ DemoniacIconicity
0.75: : T (Villain ⊓ Chair) ⊑ Protagonist
0.8: : T (Villain ⊓ Chair) ⊑ Impulsive
0.95 : : T (Villain ⊓ Chair) ⊑ ∃ hasComponent . Back
0.8: : T (Villain ⊓ Chair) ⊑ Comfortable
0.7: : T (Villain ⊓ Chair) ⊑ Inflammable
Related and future works
In this work we have considered a nonmonotonic Description Logic TCL, extending the DL of typicality with a DISPONTE semantics, in order to deal with the generation of novel creative concepts. This logic enjoys good computational properties, since entailment in it remains ExpTime as the underlying monotonic , and is able to take into account the concept combination of prototypical properties. To this aim, the logic TCL allows one to have inclusions of the form p: : T (C) ⊑ D, representing that, with a probability p, typical Cs are also Ds. Then, several different scenarios – having different probabilities – are described by including or not such inclusions, and prototypical properties of combinations of concepts are obtained by restricting reasoning services to scenarios having suitable probabilities, excluding “trivial” ones with the highest probabilities.
In AI, several approaches have been proposed to deal with the problem of prototypical concept composition in a human-like fashion. The authors of [14] present a detailed analysis of the limits of the set-theoretic approaches, the fuzzy logics (whose limitations was already shown in [28]), the vector-space models and quantum probability approaches proposed to model this phenomenon. In addition, they propose to use hierarchical conceptual spaces [5] to model the phenomenon in a way that accurately reflects how humans exploit their creativity in conjunctive concept combination. While we agree with the authors with the comments moved to the described approaches, we showed that our logic can equally model, in a cognitively compliant-way, the composition of prototypes by using a nonmonotonic formalism whose complexity remains in the same class of standard monotonic . Other attempts similar to the one proposed here concerns the conceptual blending: a task where the obtained concept is entirely novel and has no strong association with the two base concepts (while in concept combination the compound concept is always a subset of the base concepts, for details see [22]). In [3] the authors propose a mechanism for conceptual blending based on the DL . They construct the generic space of two concepts by introducing an upward refinement operator that is used for finding common generalizations of concepts. However, differently from us, what they call prototypes are expressed in the standard monotonic DL, which does not allow one to reason about typicality and defeasible inheritance. More recently, a different approach is proposed in [4], where the authors see the problem of concept blending as a nonmonotonic search problem and propose to use Answer Set Programming (ASP) to deal with this search problem in a nonmonotonic way. In a related work [25], the author extends the logic of typicality by means of probabilities equipping typicality inclusions of the form T (C) ⊑ pD, whose intuitive meaning is that, “normally, Cs are Ds and we have a probability of 1 - p of having exceptional Cs not being Ds”. Probabilities of exceptions are then used in order to reason about plausible scenarios, obtained by selecting only some typicality assumptions and whose probabilities belong to a given and fixed range. As a difference with the logic TCL, all typicality assumptions are systematically taken into account: as a consequence, one cannot exploit such a DL for capturing compositionality, since it is not possible to block inheritance of prototypical properties in concept combination. The logic TCL extends the work of [25] in that it does not systematically take into account all typicality assumptions. As a consequence, TCL blocks inheritance of prototypical properties in concept combination. The same criticism applies also to the approach proposed in [24], where is extended by inclusions of the form T (C) ⊑ dD, where d is a degree of expectedness, used to define a preference relation among extended ABoxes: entailment of queries is then restricted to ABoxes that are minimal with respect to such preference relations and that represent surprising scenarios. Also in this case, however, the resulting logic does not allow us to define scenarios containing only some inclusions, since all of them are systematically considered. Similarly, probabilistic DLs [26] themselves cannot be employed as a framework for dealing with the combination of concepts, since these logics are not able to represent and reason about prototypical properties.
In future research we aim at extending our approach to more expressive DLs, such as those underlying the standard OWL language. Starting from the work of [6], applying the logic with the typicality operator and the rational closure to , we intend to study whether and how TCL could provide an alternative solution to the problem of the “all or nothing” behavior of rational closure with respect to property inheritance. Other applications field of the proposed approach other employments can be considered. For example, we plan to use our logic for the development of a goal-directed and automatic generation of novel knowledge in a cognitive artificial agent, starting from an initial commonsense knowledge base. This approach will require to enrich the knowledge processing mechanisms of current cognitive agents [15] since it does not assume that the only way to process and reason on new knowledge is via an external knowledge injection. On the contrary, it assumes that a mechanism for the automatic and creative re-framing, or re-formulation, of the available knowledge. Such procedure, in particular, can be used to extend the mechanisms of universal subgoaling in cognitive architectures like SOAR [12].
In this work we have also described COCOS, a tool for combining concepts in the logic TCL. As mentioned, COCOS represents a very preliminary attempt to implement reasoning services for the logic TCL, and a more mature version, exploiting optimization techniques in [1, 2] in order to improve its efficiency, is currently under development.
Footnotes
In this paper we use the term properties as intended in classical logic. In this respect, in Description Logics, for an inclusion relation of the form either C ⊑ D or T (C) ⊑ D, we say that D is a property of the concept C (as we will see later, either a rigid or a typical one).
Probabilities for typicality inclusions are assumed to come from an application domain as in any probabilistic formal framework (probabilities in probabilistic extensions of logic programs, degrees of belief in fuzzy logics). Here, we focus on the proposal of the formalism itself, therefore the machinery for obtaining probabilities from an application domain will not be discussed.
Here we assume that some methods for the automatic assignment of the HEAD/MODIFER pairs are/may be available and focus on the discussion of the reasoning part.
Please, note that with respect to Section 4.1 we have slightly enriched the description of Villain.
Acknowledgments
This work has been partially supported by the project “METALLIC #2: METodi di prova per il ragionamento Automatico per Logiche non-cLassIChe #2”, Progetto di Ricerca INdAM GNCS 2019.
References
1.
AlbertiM., BellodiE., CotaG., RiguzziF. and ZeseR.. cplint on SWISH: Probabilistic logical inference with a web browser, Intelligenza Artificiale11(1) (2017), 47–64.
2.
BellodiE., LammaE., RiguzziF., ZeseR. and CotaG.. A web system for reasoning with probabilistic OWL, Journal of Software: Practice and Experience47(1) (2017), 125–142.
3.
ConfalonieriR., SchorlemmerM., KutzO., PeñalozaR., PlazaE. and EppeM.. Conceptual blending in EL++. In
LenzeriniMaurizio and PeñalozaRafael, editors, Proceedings of the 29th International Workshop on Description Logics, Cape Town, South Africa, April 22-25, 2016., (2016), volume 1577 of CEUR Workshop Proceedings. CEURWS.org.
4.
EppeM., MacleanE., ConfalonieriR., KutzO., SchorlemmerM., PlazaE. and KühnbergerK-U.. A computational framework for conceptual blending, Artificial Intelligence256 (2018), 105–129.
5.
GärdenforsP., The Geometry of Meaning: Semantics Based on Conceptual Spaces. MIT Press, 2014.
6.
GiordanoL., GliozziV., OlivettiN. and PozzatoG.L.. Rational closure in SHIQ. In DL 2014, 27th International Workshop on Description Logics, volume 1193 of CEUR Workshop Proceedings (2014), pp. 543–555. CEUR-WS.org.
7.
GiordanoL., GliozziV., OlivettiN. and PozzatoG.L.. Semantic characterization of Rational Closure: From Propositional Logic to Description Logics, Artificial Intelligence226 (2015), 1–33.
8.
GiordanoL., GliozziV., OlivettiN. and PozzatoG. Luca. Minimal model semantics and rational closure in description logics. In
EiterThomas,
GlimmBirte,
KazakovYevgeny, and KrötzschMarkus, editors, Informal Proceedings of the 26th International Workshop on Description Logics, Ulm, Germany, July 23 - 26, 2013, volume 1014 of CEUR Workshop Proceedings (2013), pp. 168–180. CEUR-WS.org.
9.
GiordanoL., GliozziV., PozzatoG. Luca and
RenzulliR.. An efficient reasoner for description logics of typicality and rational closure. In
ArtaleAlessandro,
GlimmBirte, and KontchakovRoman, editors, Proceedings of the 30th International Workshop on Description Logics, Montpellier, France, July 18-21, 2017, volume 1879 of CEUR Workshop Proceedings. (2017), CEUR-WS.org.
10.
HamptonJ.A.. Inheritance of attributes in natural concept conjunctions, Memory & Cognition15(1) (1987), 55–71.
11.
HamptonJ.A.. Conceptual combinations and fuzzy logic, Concepts and fuzzy logic209, 2011.
12.
LairdJ., RosenbloomP. and NewellA.. Universal subgoaling and chunking: The automatic generation and learning of goal hierarchies, volume 11, Springer Science & Business Media, 2012.
13.
LehmannD. and MagidorM.. What does a conditional knowledge base entail?Artificial Intelligence55(1) (1992), 1–60.
14.
LewisM. and LawryJ.. Hierarchical conceptual spaces for concept combination, Artificial Intelligence237 (2016), 204–227.
15.
LietoA., LebiereC. and OltramariA.. The knowledge level in cognitive architectures: Current limitations and possible developments, Cognitive Systems Research48 (2018), 39–55.
16.
LietoA.. Non classical concept representation and reasoning in formal ontologies. PhD thesis, Universita degli studi di Salerno, 2012.
17.
LietoA. and PozzatoG. Luca. Creative concept generation by combining description logic of typicality, probabilities and cognitive heuristics. In
GhidiniChiara,
MagniniBernardo,
PasseriniAndrea, and TraversoPaolo, editors, AI*IA 2018 - Advances in Artificial Intelligence - XVIIth International Conference of the Italian Association for Artificial Intelligence, Trento, Italy, November 20-23, 2018, Proceedings, volume 11298 of Lecture Notes in Computer Science (2018), pp. 183–196. Springer.
18.
LietoA. and PozzatoG. Luca. A description logic framework for commonsense conceptual combination integrating typicality, probabilities and cognitive heuristics, arXiv preprint arXiv:1811.02366, 2018.
19.
LietoA. and PozzatoG. Luca. A description logic of typicality for conceptual combination. In
CeciMichelangelo,
JapkowiczNathalie,
LiuJiming,
PapadopoulosGeorge A., and RasZbigniew W., editors, Foundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Limassol, Cyprus, October 29-31, 2018, Proceedings, volume 11177 of Lecture Notes in Computer Science (2018), pp. 189–199. Springer.
20.
LietoA., PozzatoG. Luca and
ValeseA.. COCOS: A typicality based concept combination system. In
FelliPaolo and MontaliMarco, editors, Proceedings of the 33rd Italian Conference on Computational Logic, Bolzano, Italy, September 20-22, 2018., volume 2214 of CEUR Workshop Proceedings (2018), pp. 55–59. CEUR-WS.org.
21.
LietoA., RadicioniD.P. and RhoV.. Dual peccs: A cognitive system for conceptual representation and categorization, Journal of Experimental & Theoretical Artificial Intelligence29(2) (2017), 433–452.
22.
NagaiY. and TauraT.. Formal description of concept-synthesizing process for creative design, Design computing and cognition’06 (2006), pp. 443–460.
23.
OshersonD.N. and SmithE.E.. On the adequacy of prototype theory as a theory of concepts, Cognition9(1) (1981), 35–58.
24.
PozzatoG. Luca. Reasoning about plausible scenarios in description logics of typicality, Intelligenza Artificiale11(1) (2017), 25–45.
25.
PozzatoG. Luca. Reasoning in description logics with typicalities and probabilities of exceptions. In
AntonucciAlessandro,
CholvyLaurence, and PapiniOdile, editors, Symbolic and Quantitative Approaches to Reasoning with Uncertainty - 14th European Conference, ECSQARU 2017, Lugano, Switzerland, July 10-14, 2017, Proceedings, volume 10369 of Lecture Notes in Computer Science (2017), pp. 409–420. Springer.
26.
RiguzziF., BellodiE., LammaE. and ZeseR.. Probabilistic description logics under the distribution semantics, Semantic Web6(5) (2015), 477–501.
27.
RiguzziF., BellodiE., LammaE. and ZeseR.. Reasoning with probabilistic ontologies. In
YangQiang and WooldridgeMichael, editors, Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, IJCAI 2015, Buenos Aires, Argentina, July 25-31, 2015 (2015), pp. 4310–4316. AAAI Press.
28.
SmithE.E. and OshersonD.N.. Conceptual combination with prototype concepts, Cognitive science8(4) (1984), 337–361.