5

I am just beginning to study basic modal logic as described here (up to page 5 so far). My first impression -- I'm sure it can't be right -- is that it is just different symbols for the same concepts in ordinary predicate logic. Instead of $\forall x: P(x)$, we write $\square P$ (quantifying over a domain of discourse corresponding the set of all "possible worlds.") Instead of $\exists x: P(x)$, we write $\diamond P$. It also seems that the other logical connectors are the same as those in propositional logic. What can we do in this basic modal logic that we cannot do in predicate logic or vice versa?

EDIT: No need to reinvent the wheel. See Standard Translation (from modal to FOL) at https://en.wikipedia.org/wiki/Standard_translation

FOLLOW-UP: Using these Standard Translations, I was able to formally derive a number of "axioms" of modal logic (some said to be controversial at wiki). Theorems 1-5, make no use of any restrictions on the accessibility relation R. The remainder variously make use of reflexive, symmetry and transitive properties on R.

  • I assume you read the abstract. This http://www.cs.tau.ac.il/~annaz/teaching/TAU_winter08/Seminar/modalea.pdf has some Powerpoint slides with motivating examples. – John Douma Oct 29 '18 at 19:32
  • @MauroALLEGRANZA Note that you're really comparing modal logic with propositional logic, which isn't what this question is about. – Noah Schweber Oct 29 '18 at 19:55
  • Re: your most recent edit, it seems you've taken symmetry as an axiom. Why? There are lots of important modal logics which aren't symmetric, even of the frame variety - e.g. the connection between modal logic and forcing uses non-symmetric frames since symmetric frames would be utterly useless for describing increasing states of information. (Also, the wiki article looks terrible; I wouldn't use it as a source for anything.) – Noah Schweber Nov 01 '18 at 14:11
  • 2
    Reflexivity and transitivity are less objectionable (in my mind) but there are still important situations where we don't want either. And each of these is certainly not true of all Kripke frames in general. (And even worse, there are modal logics of interest which don't play well with Kripke frames in the first place!) Basically, I think you're doing some injustice to modal logic in terms of your axiomatization: you're restricting attention to a very very narrow modal logic (specifically, I think you've built S$5$). – Noah Schweber Nov 01 '18 at 14:14
  • 1
    For example, in temporal logic we certainly don't want symmetry! And we might not want reflexivity either, if we interpret accessibility as moving strictly forward in time. Finally, the provability logic GL actively objects to reflexivity, by way of the Lob axiom. As for a nontransitive example, anytime the accessibility relation corresponds to something like "is close to," we probably don't want to assume transitivity since "close to close to $x$" doesn't necessarily imply "close to $x$." – Noah Schweber Nov 01 '18 at 14:20
  • @NoahSchweber Yes I did assume that R was symmetric, but it was only used in 2 out of 10 proofs at my link. Likewise, transitivity. I didn't know it was an issue, but thanks for pointing it out. I have only been studying modal logic for a week or so. – Dan Christensen Nov 01 '18 at 14:40
  • @DanChristensen Although it's huge, I strongly recommend the Chagrov/Zakharyaschev book; it really gives a great feel for the sheer variety of modal logics. – Noah Schweber Nov 01 '18 at 14:42
  • @NoahSchweber Is there a most commonly used form modal logic? If you could only present one to a class, what would it be and why? – Dan Christensen Nov 01 '18 at 14:47
  • @DanChristensen "Is there a most commonly used form modal logic?" I can't imagine so - the subject is so broad. If I were teaching this, though, I'd focus on the pure logic of Kripke frames (since they're the easiest semantics for the modal language), that is, the set of sentences validated by all Kripke frames. If memory serves, this is called "K" (although I have to run and don't have time to check this; I'll correct myself later if it turns out I'm misremembering). And then I'd end by describing how different classes of frames are appropriate for different purposes. (cont'd) – Noah Schweber Nov 01 '18 at 14:50
  • E.g. it's easy to motivate the idea of temporality, and then argue against symmetry; but it's also easy to motivate the idea of "plausible," and then symmetry's fairly, well, plausible. But part of the goal would be to convince the students that there isn't a single best perspective, so trying to focus too much in this respect would definitely not be my goal. – Noah Schweber Nov 01 '18 at 14:51
  • @j4nbur53 As an exercise, I chose a selection of axioms from the wiki article at https://en.wikipedia.org/wiki/Accessibility_relation Maybe you should write the author and explain why he/she should also include the converses. – Dan Christensen Nov 01 '18 at 15:21
  • 1
    Well the wikipedia article starts with "This article needs attention from an expert in logic. The specific problem is: inaccurate, partly wrong, presentation. WikiProject Logic may be able to help recruit an expert. (November 2016)", so what do you expect? –  Nov 01 '18 at 15:22
  • @j4nbur53 Here is your big chance at wiki fame and glory. Go for it. – Dan Christensen Nov 01 '18 at 15:24
  • Suit yourself, but thousands of readers will supposedly get the wrong impression about these axioms. – Dan Christensen Nov 01 '18 at 15:26
  • When I read $\implies$, I don't assume it means $\iff$. Weird, eh? – Dan Christensen Nov 01 '18 at 15:32
  • It seems obvious now that modal logic is just a subfield of classical, first-order logic. Albeit, an interesting subfield with many applications. – Dan Christensen Nov 01 '18 at 15:39
  • I could some very good, thought provoking answers. I thought readers would like to know how I put them to use. – Dan Christensen Nov 01 '18 at 15:47
  • @DanChristensen "It seems obvious now that modal logic is just a subfield of classical, first-order logic." Well, again, the validity issue takes us beyond first-order logic: there are modally-characterizable classes of frames which are not elementary classes as per j4n's answer or the last bit of my answer. So it's not quite that simple. I think the right way to view it is that modal logic is (one aspect of) the study of non-truth-functional operations in logic, which has various aspects, many of which are subsumed by first-order logic. – Noah Schweber Nov 01 '18 at 16:12
  • @NoahSchweber Can all formal proofs in modal logic be translated into formal proofs in FOL + set theory? – Dan Christensen Nov 01 '18 at 19:35

3 Answers3

17

Your impression is right, but missing the point in some sense: modal logic is strictly less powerful than first-order logic, and this is one of the reasons it is so important in various contexts (especially applications of logic in computer science)! The reason is that there is a fundamental "power-versus-tameness" tradeoff implicit in any choice of logic, and we often prefer the latter to the former. Modal logic should be thought of as a particularly well-behaved fragment of first-order logic: we're often interested in decidable (or similarly nice) fragments of first-order logic in applications, and modal logic and its variants provide a wide swath of examples of such logics.

Note that this is reflected in the history of modal logic: it long predated first-order logic, and was an expansion of propositional logic by adding modal operators. After first-order logic burst on the scene, we came to understand modal logic as an intermediate logic, and that's the perspective I'm describing here since I think it matches more the perspective you're adopting.

This paper by Vardi is a useful source in this regard. In particular, the following passage from page $2$ is quite relevant:

  • "There are two main computational problems associated with modal logic. The first problem is checking if a given formula is true in a given state of a given Kripke structure. This problem is known as the model-checking problem. The second problem is checking if a given formula is true in all states of all Kripke structures. This problem is known as the validity problem. Both problems are decidable. The model-checking problem can be solved in linear time, while the validity problem is PSPACE-complete. This is rather surprising when one considers the fact that modal logic, in spite of its apparent propositional syntax, is essentially a first-order logic, since the necessity and possibility modalities quantify over the set of possible worlds, and model checking and validity for first-order logic are computationally hard problems. Furthermore, the undecidability of first-order logic is very robust. Only very restricted fragments of first-order logic are decidable ..."

Vardi goes on to talk about types of tameness, specifically focusing on two ways of generating tame fragments of first-order logic - bounding the quantifiers and limiting the number of variables - and then argues that modal logic really represents a third, and extremely robust, kind of tameness. At this point we move beyond the focus of this specific question; the point I want to make is that modal logic is not a strengthening of first-order logic, but rather quite the opposite, and that for many applications this is actually a good thing.

The last section of Chagrov and Zakharyaschev's book is also relevant, and in general I strongly recommend that book: it's quite dense, but has a huge wealth of material.

It's worth noting that the idea of looking at "tame" fragments of "wild" logics appears all over the place, e.g. the analysis of monadic second-order logic (contra full second-order logic) and the decomposition of $\mathcal{L}_{\omega_1\omega}$ into well-behaved countable sublogics. Coming from a more set- or model-theoretic background, it may seem odd at first to apply the same idea to first-order logic because of how "primal" it is, but it is in fact a very rich line of research.


To add a bit of detail, here's the translation of modal logic into first-order logic (well, for Kripke frames anyways; I'll leave generalizations as exercises):

Given a Kripke frame $\mathcal{K}=(W,\leadsto,\models_\mathcal{K})$ ($W$ = worlds, $\leadsto$ = accessibility relation, $\models_\mathcal{K}$ = valuation) in a propositional language $\Sigma=\{p_i\}_{i\in I}$, our corresponding language $\Sigma_\mathcal{K}$ consists of a unary predicate $P_i$ for each $i\in I$ and a binary relation symbol $R$, and our corresponding structure $M_\mathcal{K}$ has domain $W$, interprets $P_i$ as $\{w\in W: w\models_\mathcal{K} p_i\}$, and interprets $R$ as $\{(u,v)\in W^2: u\leadsto v\}$.

(Note that this is not quite what you've described: you've described the "local" version, where we focus on a single world in $\mathcal{K}$.)

Now, for every modal sentence $s$ in the language $\Sigma$, we get a first-order formula $\varphi_s(x)$ in one free variable saying that $s$ holds at $x$ in the sense of $\mathcal{K}$; meanwhile, the formula $\psi_s\equiv\forall x(\varphi_s(x))$ says of course that $s$ holds throughout $\mathcal{K}$. The key point here is:

There are first-order sentences not coming from modal sentences in this, or any reasonable, way.

For example, consider something like "$\forall x\exists y(R(x,y)\vee R(y,x))$." How precisely can you express this modally, in any sense?


EDIT: That said, there are aspects of modal logic which take it beyond first-order. (I previously said a bit about this in comments, but I think now it's a good idea to put it in the answer body.) In particular, we say that a frame (without a chosen valuation) validates a given modal sentence if every valuation makes that sentence true at every world. Each modal sentence $\varphi$ then defines a class of frames $V(\varphi)$. For example, $$(\Diamond\Diamond p)\implies(\Diamond p)$$ is validated exactly in the transitive frames.

Viewing (sans valuation) frames as directed graphs, we can ask whether every "modal validation" class $V(\varphi)$ is an elementary class. The answer turns out to be no, the easiest example in my opinion being the Lob axiom $$\lambda\equiv\Box(\Box (p)\rightarrow p)\rightarrow \Box(p).$$ (Proof: it's not hard to show that $V(\lambda)$ is the set of transitive, converse well-founded frames, that is, those transitive frames not admitting any infinite sequence of worlds each of which sees the next. Now use the compactness theorem.)

See also j4n bur53's answer, and this paper of Thomason. Note that when people say that a given modal sentence has no first-order equivalent, or isn't first-order expressible, they're talking about validity.

PW_246
  • 1,256
Noah Schweber
  • 245,398
3

An example of a modal logic, that cannot be modelled via first order logic, is the McKinsey axiom. This axiom reads as follows:

$$M: \quad \Box ~ \Diamond ~ \phi \, \rightarrow \, \Diamond ~ \Box ~\phi$$

This modal logic is not mentioned in Joel McCance script, because the script only deals with couple of first order definable modal logics.

See also:
Is there a more useful formulation of the
frame condition for the McKinsey axiom?

https://math.stackexchange.com/a/1279608/4414

  • 1
    +1. To explain how this doesn't contradict what I wrote in my answer: what's really being said here is "the class of Kripke frames validating $M$ is not first-order." The axiom $M$ itself is easily translated into first-order logic in the sense described in my answer, but my answer is talking "post-valuation" whereas validity is talking about possible valuations. Incidentally, the Lob axiom $\Box(\Box p\rightarrow p)\rightarrow\Box p$ also gives an example of a sentence whose validating class isn't first-order definable. – Noah Schweber Oct 30 '18 at 02:33
  • According to van Benthem you cannot translate M to first order. Since it is $Π_1^1$. See for yourself: https://staff.science.uva.nl/j.vanbenthem/docs/CT.pdf , in premisse position, as an axiom M, you cannot turn a $\forall P(...) \vDash ...$ into a $\forall P( ... \vDash ...)$, it is equivalent to $\exists P (... \vDash ...)$. There is no problem with the translation in consequence position. –  Oct 30 '18 at 03:17
  • Again, the issue is what you mean when you say "translate $M$ to first-order." The thing that isn't expressible in first-order is the statement "this frame validates $M$." But we can easily write a sentence which holds of (the first-order structure corresponding to) a frame with valuation if and only if every world in the frame satisfies "$\Box\Diamond\varphi\implies \Diamond\Box\varphi$" (where $\varphi$ is a given propositional atom) with respect to that valuation. – Noah Schweber Oct 30 '18 at 03:24
  • Well from the context of my post M should act as an axiom. I don't know anything about your post. So I guess I cannot answer, I didn't go through your post, since I knew the McKinsey axiom from elsewhere. –  Oct 30 '18 at 03:25
  • The $\Pi^1_1$-ness comes from the fact that when we talk about validation, we're quantifying over all possible valuations, and this amounts to treating the propositional atom as an arbitrary subset of the set of worlds, and universally quantifying over it. – Noah Schweber Oct 30 '18 at 03:25
  • No the $Π_1^1$--ness comes from the axiom itself, and because as an axiom it is in premisse position. Before the consequence relation $\vDash$. So the higher order forall quantifier has negative polarity. Thats how I read van Bentham. I might be wrong though. –  Oct 30 '18 at 03:27
  • "Well from the context of my post M should act as an axiom" Yes - the point is that the "naive" interpretation the OP has in mind, that I go into in more detail in my answer, and that is meant when people describe modal logic as being a fragment of first-order logic, is different from the idea of modal sentences axiomatizing classes of frames. That's the clarifying point I wanted to make: the reason your answer doesn't contradict my answer is that we're talking about different things (modal sentences describing frames in your case, versus modal sentences describing frames with valuations). – Noah Schweber Oct 30 '18 at 03:28
  • You are indeed misreading van Bentham - the point is that for any modal sentence $\theta$, the statement "$\theta$ is validated by [frame]" is $\Pi^1_1$ (the point being that such a statement says "for every valuation, [first-order stuff]"). – Noah Schweber Oct 30 '18 at 03:31
  • I'm not fishing for compliments or bragging, and I upvoted your post. I just think it's important to clarify the relationship between your answer and my answer, since it's easy to misread them as contradicting each other. – Noah Schweber Oct 30 '18 at 03:32
  • Why should they contradict each other? Don't worry. Also don't worry about van Bentham, he explains when the $Π_1^1$ falls apart, there is some ultrafilter condition, etc.. etc.. So prima facie the statement has $Π_1^1$, but all the statememts in Joel McCance script, fall apart. Basically some simply by satisfying ∃P(...⊨...), it seems, the so called substitution trick. –  Oct 30 '18 at 03:34
  • Using the above mentioned standard translations (see my edit), if we assume $W={ x,y}, R(x,y), P(x), P(x), \neg P(y)$ and that $R$ is reflexive, symmetric and transitive, then $\square \diamond P \to \diamond \square P$ can be shown to be false. (See http://www.dcproof.com/ModalAxiomM.htm ) How widely applicable can it be? – Dan Christensen Nov 01 '18 at 14:01
  • You do kind of naive statistic sampling, and not mathematical reasoning. Missing the point of modal logic. There are also other sample points, for example a model where it is true: https://gist.github.com/jburse/fac800b585306e4b9d91d8331dc7db44#gistcomment-2748190 So what? –  Nov 01 '18 at 14:04
1

You were asking for correspondence. But your DC Proof results only show "valid if" and not "valid iff". So they only show sufficient conditions, but not necessary conditions. This is a little pointless. For example you have proved:

$$reflexiv+transitive \vdash "\Diamond P \Rightarrow \Box \Diamond P\,Axiom"$$

But now there is a problem that reflexiv+transitive cannot act as a replacement and hence correspondence of $"\Diamond P \Rightarrow \Box \Diamond P \,Axiom"$. It might give false positive. For example reflexiv+transitive gives:

$$reflexiv+transitive \vdash q \Rightarrow \Diamond q$$

But we do not have:

$$"\Diamond P => \Box \Diamond P\,Axiom" \nvdash q \Rightarrow \Diamond q$$

Proof: That reflexiv+transitive gives $q \Rightarrow \Diamond q$, is a consequence that the later is equivalent to $[] \neg p \Rightarrow \neg p$, and hence can use the correspondence for reflexivity. But on the other hand that the later doesn't hold we only need to show at least one counter model.

enter image description here

Model drawn and evaluated with:
https://rkirsling.github.io/modallogic/

You find a couple of "valid iff" proofs here:
http://www2.math.uu.se/~hedin/TillLog/LectureNotesAL.pdf

  • I chose several of the so-called axioms in modal logic given at wiki. Most of them did not require any additional assumptions about the accessibility relation. For the others, I arbitrarily picked one or more of the following to make them work and to illustrate how modal logic works when translated into FOL: reflexivity, symmetry, transitivity. References to these properties seem to show up often in the literature of modal logic. I left the converses of those theorems as an exercise just for you, Jan. You seem to have a lot of time on your hands these days. ;^) – Dan Christensen Nov 02 '18 at 03:38
  • The axiom ◊P=>◻◊P is not equivalent to reflexiv+transitive. It is pointless to prove reflexiv+transitive |- ◊P=>◻◊P. You can easily find counter models with this tool here: https://rkirsling.github.io/modallogic/ And you find a couple of "valid iff" proofs here: http://www2.math.uu.se/~hedin/TillLog/LectureNotesAL.pdf –  Nov 02 '18 at 10:35