Type systems |
---|
General concepts |
Major categories |
|
Minor categories |
See also |
In computer science and logic, a dependent type is a type whose definition depends on a value. It is an overlapping feature of type theory and type systems. In intuitionistic type theory, dependent types are used to encode logic's quantifiers like "for all" and "there exists". In functional programming languages like Agda, ATS, Coq, F*, Epigram, and Idris, dependent types help reduce bugs by enabling the programmer to assign types that further restrain the set of possible implementations.
Two common examples of dependent types are dependent functions and dependent pairs. The return type of a dependent function may depend on the value (not just type) of one of its arguments. For instance, a function that takes a positive integer [math]\displaystyle{ n }[/math] may return an array of length [math]\displaystyle{ n }[/math], where the array length is part of the type of the array. (Note that this is different from polymorphism and generic programming, both of which include the type as an argument.) A dependent pair may have a second value the type of which depends on the first value. Sticking with the array example, a dependent pair may be used to pair an array with its length in a type-safe way.
Dependent types add complexity to a type system. Deciding the equality of dependent types in a program may require computations. If arbitrary values are allowed in dependent types, then deciding type equality may involve deciding whether two arbitrary programs produce the same result; hence the decidability of type checking may depend on the given type theory's semantics of equality, that is, whether the type theory is intensional or extensional.[1]
In 1934, Haskell Curry noticed that the types used in typed lambda calculus, and in its combinatory logic counterpart, followed the same pattern as axioms in propositional logic. Going further, for every proof in the logic, there was a matching function (term) in the programming language. One of Curry's examples was the correspondence between simply typed lambda calculus and intuitionistic logic.[2]
Predicate logic is an extension of propositional logic, adding quantifiers. Howard and de Bruijn extended lambda calculus to match this more powerful logic by creating types for dependent functions, which correspond to "for all", and dependent pairs, which correspond to "there exists".[3]
(Because of this and other work by Howard, propositions-as-types is known as the Curry–Howard correspondence.)
Loosely speaking, dependent types are similar to the type of an indexed family of sets. More formally, given a type [math]\displaystyle{ A:\mathcal{U} }[/math] in a universe of types [math]\displaystyle{ \mathcal{U} }[/math], one may have a family of types [math]\displaystyle{ B:A\to\mathcal{U} }[/math], which assigns to each term [math]\displaystyle{ a:A }[/math] a type [math]\displaystyle{ B(a):\mathcal{U} }[/math]. We say that the type B(a) varies with a.
A function whose type of return value varies with its argument (i.e. there is no fixed codomain) is a dependent function and the type of this function is called dependent product type, pi-type (Π type) or dependent function type.[4] From a family of types [math]\displaystyle{ B: A \to \mathcal{U} }[/math] we may construct the type of dependent functions [math]\displaystyle{ \prod_{x : A} B(x) }[/math], whose terms are functions that take a term [math]\displaystyle{ a : A }[/math] and return a term in [math]\displaystyle{ B(a) }[/math]. For this example, the dependent function type is typically written as [math]\displaystyle{ \prod_{x:A} B(x), }[/math] [math]\displaystyle{ \prod_{x:A} B(x), }[/math] or [math]\displaystyle{ \prod {(x:A)} B(x) }[/math].
If [math]\displaystyle{ B:A\to\mathcal{U} }[/math] is a constant function, the corresponding dependent product type is equivalent to an ordinary function type. That is, [math]\displaystyle{ \prod_{x:A}B }[/math] is judgmentally equal to [math]\displaystyle{ A\to B }[/math] when B does not depend on x.
The name 'Π-type' comes from the idea that these may be viewed as a Cartesian product of types. Π-types can also be understood as models of universal quantifiers.
For example, if we write [math]\displaystyle{ \operatorname{Vec}(\mathbb{R},n) }[/math] for n-tuples of real numbers, then [math]\displaystyle{ \prod_{n:\mathbb{N}} \operatorname{Vec}(\mathbb{R},n) }[/math] would be the type of a function which, given a natural number n, returns a tuple of real numbers of size n. The usual function space arises as a special case when the range type does not actually depend on the input. E.g. [math]\displaystyle{ \prod_{n:\mathbb{N}} {\mathbb{R}} }[/math] is the type of functions from natural numbers to the real numbers, which is written as [math]\displaystyle{ \mathbb{N}\to\mathbb{R} }[/math] in typed lambda calculus.
For a more concrete example, taking A to be the type of unsigned integers from 0 to 255 (the ones that fit into 8 bits or 1 byte) and B(a) = Xa for a : A, then [math]\displaystyle{ \prod_{x:A} B(x) }[/math] devolves into the product of X0 × X1 × X2 × ... × X253 × X254 × X255.
The dual of the dependent product type is the dependent pair type, dependent sum type, sigma-type, or (confusingly) dependent product type.[4] Sigma-types can also be understood as existential quantifiers. Continuing the above example, if, in the universe of types [math]\displaystyle{ \mathcal{U} }[/math], there is a type [math]\displaystyle{ A:\mathcal{U} }[/math] and a family of types [math]\displaystyle{ B:A\to\mathcal{U} }[/math], then there is a dependent pair type [math]\displaystyle{ \sum_{x:A} B(x) }[/math]. (The alternative notations are similar to that of Π types.)
The dependent pair type captures the idea of an ordered pair where the type of the second term is dependent on the value of the first. If [math]\displaystyle{ (a,b):\sum_{x:A} B(x), }[/math] then [math]\displaystyle{ a:A }[/math] and [math]\displaystyle{ b:B(a) }[/math]. If B is a constant function, then the dependent pair type becomes (is judgementally equal to) the product type, that is, an ordinary Cartesian product [math]\displaystyle{ A\times B }[/math].[full citation needed]
For a more concrete example, taking A to again be type of unsigned integers from 0 to 255, and B(a) to again be equal to Xa for 256 more arbitrary Xa, then [math]\displaystyle{ \sum_{x:A} B(x) }[/math] devolves into the sum X0 + X1 + X2 + ... + X253 + X254 + X255.
Let [math]\displaystyle{ A:\mathcal{U} }[/math] be some type, and let [math]\displaystyle{ B:A\to\mathcal{U} }[/math]. By the Curry–Howard correspondence, B can be interpreted as a logical predicate on terms of A. For a given [math]\displaystyle{ a:A }[/math], whether the type B(a) is inhabited indicates whether a satisfies this predicate. The correspondence can be extended to existential quantification and dependent pairs: the proposition [math]\displaystyle{ \exists{a}{\in}A\,B(a) }[/math] is true if and only if the type [math]\displaystyle{ \sum_{a:A}B(a) }[/math] is inhabited.
For example, [math]\displaystyle{ m:\mathbb{N} }[/math] is less than or equal to [math]\displaystyle{ n:\mathbb{N} }[/math] if and only if there exists another natural number [math]\displaystyle{ k:\mathbb{N} }[/math] such that m + k = n. In logic, this statement is codified by existential quantification:
[math]\displaystyle{ m\le n \iff \exists{k}{\in}\mathbb{N}\,m+k=n. }[/math]
This proposition corresponds to the dependent pair type:
[math]\displaystyle{ \sum_{k:\mathbb{N}} m+k=n. }[/math]
That is, a proof of the statement that m is less than or equal to n is a pair that contains both a non-negative number k, which is the difference between m and n, and a proof of the equality m + k = n.
Henk Barendregt developed the lambda cube as a means of classifying type systems along three axes. The eight corners of the resulting cube-shaped diagram each correspond to a type system, with simply typed lambda calculus in the least expressive corner, and calculus of constructions in the most expressive. The three axes of the cube correspond to three different augmentations of the simply typed lambda calculus: the addition of dependent types, the addition of polymorphism, and the addition of higher kinded type constructors (functions from types to types, for example). The lambda cube is generalized further by pure type systems.
The system [math]\displaystyle{ \lambda \Pi }[/math] of pure first order dependent types, corresponding to the logical framework LF, is obtained by generalising the function space type of the simply typed lambda calculus to the dependent product type.
The system [math]\displaystyle{ \lambda \Pi 2 }[/math] of second order dependent types is obtained from [math]\displaystyle{ \lambda \Pi }[/math] by allowing quantification over type constructors. In this theory the dependent product operator subsumes both the [math]\displaystyle{ \to }[/math] operator of simply typed lambda calculus and the [math]\displaystyle{ \forall }[/math] binder of System F.
The higher order system [math]\displaystyle{ \lambda \Pi \omega }[/math] extends [math]\displaystyle{ \lambda \Pi 2 }[/math] to all four forms of abstraction from the lambda cube: functions from terms to terms, types to types, terms to types and types to terms. The system corresponds to the calculus of constructions whose derivative, the calculus of inductive constructions is the underlying system of the Coq proof assistant.
The Curry–Howard correspondence implies that types can be constructed that express arbitrarily complex mathematical properties. If the user can supply a constructive proof that a type is inhabited (i.e., that a value of that type exists) then a compiler can check the proof and convert it into executable computer code that computes the value by carrying out the construction. The proof checking feature makes dependently typed languages closely related to proof assistants. The code-generation aspect provides a powerful approach to formal program verification and proof-carrying code, since the code is derived directly from a mechanically verified mathematical proof.
Language | Actively developed | Paradigm[lower-alpha 1] | Tactics | Proof terms | Termination checking | Types can depend on[lower-alpha 2] | Universes | Proof irrelevance | Program extraction | Extraction erases irrelevant terms |
---|---|---|---|---|---|---|---|---|---|---|
Ada 2012 | Yes[5] | Imperative | Yes[6] | No | ? | Any term[lower-alpha 3] | ? | ? | Ada | ? |
Agda | Yes[7] | Purely functional | Few/limited[lower-alpha 4] | Yes | Yes (optional) | Any term | Yes (optional)[lower-alpha 5] | Proof-irrelevant arguments[9] Proof-irrelevant propositions[10] | Haskell, JavaScript | Yes[9] |
ATS | Yes[11] | Functional / imperative | No[12] | Yes | Yes | Static terms[13] | ? | Yes | Yes | Yes |
Cayenne | No | Purely functional | No | Yes | No | Any term | No | No | ? | ? |
Gallina (Coq) |
Yes[14] | Purely functional | Yes | Yes | Yes | Any term | Yes[lower-alpha 6] | Yes[15] | Haskell, Scheme and OCaml | Yes |
Dependent ML | No[lower-alpha 7] | ? | ? | Yes | ? | Natural numbers | ? | ? | ? | ? |
F* | Yes[16] | Functional and imperative | Yes[17] | Yes | Yes (optional) | Any pure term | Yes | Yes | OCaml, F#, and C | Yes |
Guru | No[18] | Purely functional[19] | hypjoin[20] | Yes[19] | Yes | Any term | No | Yes | Carraway | Yes |
Idris | Yes[21] | Purely functional[22] | Yes[23] | Yes | Yes (optional) | Any term | Yes | No | Yes | Yes[23] |
Lean | Yes | Purely functional | Yes | Yes | Yes | Any term | Yes | Yes | Yes | Yes |
Matita | Yes[24] | Purely functional | Yes | Yes | Yes | Any term | Yes | Yes | OCaml | Yes |
NuPRL | Yes | Purely functional | Yes | Yes | Yes | Any term | Yes | ? | Yes | ? |
PVS | Yes | ? | Yes | ? | ? | ? | ? | ? | ? | ? |
Sage | No[lower-alpha 8] | Purely functional | No | No | No | ? | No | ? | ? | ? |
Twelf | Yes | Logic programming | ? | Yes | Yes (optional) | Any (LF) term | No | No | ? | ? |
Original source: https://en.wikipedia.org/wiki/Dependent type.
Read more |