Paradigm | Multi-paradigm: functional, imperative, object-oriented |
---|---|
Designed by | Walter Bright, Andrei Alexandrescu (since 2007) |
Developer | D Language Foundation |
First appeared | 8 December 2001[1] |
Typing discipline | Inferred, static, strong |
OS | FreeBSD, Linux, macOS, Windows |
License | Boost[2][3][4] |
Filename extensions | .d[5][6] |
Website | dlang |
Major implementations | |
DMD (reference implementation), GCC,
GDC, LDC, SDC | |
Influenced by | |
BASIC,[7] C, C++, C#, Eiffel,[8] Java, Python | |
Influenced | |
Genie, MiniD, Qore, Swift,[9] Vala, C++11, C++14, C++17, C++20, Go, C#, and others. | |
|
D, also known as dlang, is a multi-paradigm system programming language created by Walter Bright at Digital Mars and released in 2001. Andrei Alexandrescu joined the design and development effort in 2007. Though it originated as a re-engineering of C++, D is now a very different language drawing inspiration from other high-level programming languages, notably Java, Python, Ruby, C#, and Eiffel.
The D language reference describes it as follows:
D is a general-purpose systems programming language with a C-like syntax that compiles to native code. It is statically typed and supports both automatic (garbage collected) and manual memory management. D programs are structured as modules that can be compiled separately and linked with external libraries to create native libraries or executables.
D is not source compatible with C and C++ source code in general. However, any code that is legal in both C and D should behave in the same way.
Like C++, D has closures, anonymous functions, compile-time function execution, ranges, built-in container iteration concepts, and type inference. Unlike C++, D also implements design by contract, modules, garbage collection, first class arrays, array slicing, nested functions and lazy evaluation. D uses Java-style single inheritance with interfaces and mixins rather than C++-style multiple inheritance. On the other hand, D's declaration, statement and expression syntax closely matches that of C++.
D is a systems programming language. Like C++, D supports low-level programming including inline assembler, which typifies the differences between D and application languages like Java and C#. Inline assembler lets programmers enter machine-specific assembly code within standard D code, a method used by system programmers to access the low-level features of the processor needed to run programs that interface directly with the underlying hardware, such as operating systems and device drivers, as well as writing high-performance code (i.e. using vector extensions, SIMD) that is hard to generate by the compiler automatically.
D supports function overloading and operator overloading. Symbols (functions, variables, classes) can be declared in any order - forward declarations are not required.
In D, text character strings are arrays of characters, and arrays in D are bounds-checked.[10] D has first class types for complex and imaginary numbers.[11]
D supports five main programming paradigms:
Imperative programming in D is almost identical to that in C. Functions, data, statements, declarations and expressions work just as they do in C, and the C runtime library may be accessed directly. On the other hand, some notable differences between D and C in the area of imperative programming include D's foreach
loop construct, which allows looping over a collection, and nested functions, which are functions that are declared inside another and may access the enclosing function's local variables.
import std.stdio; void main() { int multiplier = 10; int scaled(int x) { return x * multiplier; } foreach (i; 0 .. 10) { writefln("Hello, world %d! scaled = %d", i, scaled(i)); } }
Object-oriented programming in D is based on a single inheritance hierarchy, with all classes derived from class Object. D does not support multiple inheritance; instead, it uses Java-style interfaces, which are comparable to C++'s pure abstract classes, and mixins, which separates common functionality from the inheritance hierarchy. D also allows the defining of static and final (non-virtual) methods in interfaces.
Interfaces and inheritance in D support covariant types for return types of overridden methods.
D supports type forwarding, as well as optional custom dynamic dispatch.
Classes (and interfaces) in D can contain invariants which are automatically checked before and after entry to public methods, in accordance with the design by contract methodology.
Many aspects of classes (and structs) can be introspected automatically at compile time (a form of reflection using type traits
) and at run time (RTTI / TypeInfo
), to facilitate generic code or automatic code generation (usually using compile-time techniques).
D supports functional programming features such as function literals, closures, recursively-immutable objects and the use of higher-order functions. There are two syntaxes for anonymous functions, including a multiple-statement form and a "shorthand" single-expression notation:[12]
int function(int) g; g = (x) { return x * x; }; // longhand g = (x) => x * x; // shorthand
There are two built-in types for function literals, function
, which is simply a pointer to a stack-allocated function, and delegate
, which also includes a pointer to the relevant stack frame, the surrounding ‘environment’, which contains the current local variables. Type inference may be used with an anonymous function, in which case the compiler creates a delegate
unless it can prove that an environment pointer is not necessary. Likewise, to implement a closure, the compiler places enclosed local variables on the heap only if necessary (for example, if a closure is returned by another function, and exits that function's scope). When using type inference, the compiler will also add attributes such as pure
and nothrow
to a function's type, if it can prove that they apply.
Other functional features such as currying and common higher-order functions such as map, filter, and reduce are available through the standard library modules std.functional
and std.algorithm
.
import std.stdio, std.algorithm, std.range; void main() { int[] a1 = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9]; int[] a2 = [6, 7, 8, 9]; // must be immutable to allow access from inside a pure function immutable pivot = 5; int mySum(int a, int b) pure nothrow // pure function { if (b <= pivot) // ref to enclosing-scope return a + b; else return a; } // passing a delegate (closure) auto result = reduce!mySum(chain(a1, a2)); writeln("Result: ", result); // Result: 15 // passing a delegate literal result = reduce!((a, b) => (b <= pivot) ? a + b : a)(chain(a1, a2)); writeln("Result: ", result); // Result: 15 }
Alternatively, the above function compositions can be expressed using Uniform function call syntax (UFCS) for more natural left-to-right reading:
auto result = a1.chain(a2).reduce!mySum(); writeln("Result: ", result); result = a1.chain(a2).reduce!((a, b) => (b <= pivot) ? a + b : a)(); writeln("Result: ", result);
Parallel programming concepts are implemented in the library, and do not require extra support from the compiler. However the D type system and compiler ensure that data sharing can be detected and managed transparently.
import std.stdio : writeln; import std.range : iota; import std.parallelism : parallel; void main() { foreach (i; iota(11).parallel) { // The body of the foreach loop is executed in parallel for each i writeln("processing ", i); } }
iota(11).parallel
is equivalent to std.parallelism.parallel(iota(11))
by using UFCS.
The same module also supports taskPool
which can be used for dynamic creation of parallel tasks, as well as map-filter-reduce and fold style operations on ranges (and arrays), which is useful when combined with functional operations. std.algorithm.map
returns a lazily evaluated range rather than an array. This way, the elements are computed by each worker task in parallel automatically.
import std.stdio : writeln; import std.algorithm : map; import std.range : iota; import std.parallelism : taskPool; /* On Intel i7-3930X and gdc 9.3.0: * 5140ms using std.algorithm.reduce * 888ms using std.parallelism.taskPool.reduce * On AMD Threadripper 2950X, and gdc 9.3.0: * 2864ms using std.algorithm.reduce * 95ms using std.parallelism.taskPool.reduce */ void main() { auto nums = iota(1.0, 1_000_000_000.0); auto x = taskPool.reduce!"a + b"( 0.0, map!"1.0 / (a * a)"(nums) ); writeln("Sum: ", x); }
Concurrency is fully implemented in the library, and it does not require support from the compiler. Alternative implementations and methodologies of writing concurrent code are possible. The use of D typing system does help ensure memory safety.
import std.stdio, std.concurrency, std.variant; void foo() { bool cont = true; while (cont) { receive( // Delegates are used to match the message type. (int msg) => writeln("int received: ", msg), (Tid sender) { cont = false; sender.send(-1); }, (Variant v) => writeln("huh?") // Variant matches any type ); } } void main() { auto tid = spawn(&foo); // spawn a new thread running foo() foreach (i; 0 .. 10) tid.send(i); // send some integers tid.send(1.0f); // send a float tid.send("hello"); // send a string tid.send(thisTid); // send a struct (Tid) receive((int x) => writeln("Main thread received message: ", x)); }
Metaprogramming is supported through templates, compile-time function execution, tuples, and string mixins. The following examples demonstrate some of D's compile-time features.
Templates in D can be written in a more imperative style compared to the C++ functional style for templates. This is a regular function that calculates the factorial of a number:
ulong factorial(ulong n) { if (n < 2) return 1; else return n * factorial(n-1); }
Here, the use of static if
, D's compile-time conditional construct, is demonstrated to construct a template that performs the same calculation using code that is similar to that of the function above:
template Factorial(ulong n) { static if (n < 2) enum Factorial = 1; else enum Factorial = n * Factorial!(n-1); }
In the following two examples, the template and function defined above are used to compute factorials. The types of constants need not be specified explicitly as the compiler infers their types from the right-hand sides of assignments:
enum fact_7 = Factorial!(7);
This is an example of compile-time function execution (CTFE). Ordinary functions may be used in constant, compile-time expressions provided they meet certain criteria:
enum fact_9 = factorial(9);
The std.string.format
function performs printf
-like data formatting (also at compile-time, through CTFE), and the "msg" pragma displays the result at compile time:
import std.string : format; pragma(msg, format("7! = %s", fact_7)); pragma(msg, format("9! = %s", fact_9));
String mixins, combined with compile-time function execution, allow for the generation of D code using string operations at compile time. This can be used to parse domain-specific languages, which will be compiled as part of the program:
import FooToD; // hypothetical module which contains a function that parses Foo source code // and returns equivalent D code void main() { mixin(fooToD(import("example.foo"))); }
Memory is usually managed with garbage collection, but specific objects may be finalized immediately when they go out of scope. This is what the majority of programs and libraries written in D use.
In case more control over memory layout and better performance is needed, explicit memory management is possible using the overloaded operator new
, by calling C's malloc and free directly, or implementing custom allocator schemes (i.e. on stack with fallback, RAII style allocation, reference counting, shared reference counting). Garbage collection can be controlled: programmers may add and exclude memory ranges from being observed by the collector, can disable and enable the collector and force either a generational or full collection cycle.[13] The manual gives many examples of how to implement different highly optimized memory management schemes for when garbage collection is inadequate in a program.[14]
In functions, struct
instances are by default allocated on the stack, while class
instances by default allocated on the heap (with only reference to the class instance being on the stack). However this can be changed for classes, for example using standard library template std.typecons.scoped
, or by using new
for structs and assigning to a pointer instead of a value-based variable.[15]
In functions, static arrays (of known size) are allocated on the stack. For dynamic arrays, one can use the core.stdc.stdlib.alloca
function (similar to alloca
in C), to allocate memory on the stack. The returned pointer can be used (recast) into a (typed) dynamic array, by means of a slice (however resizing array, including appending must be avoided; and for obvious reasons they must not be returned from the function).[15]
A scope
keyword can be used both to annotate parts of code, but also variables and classes/structs, to indicate they should be destroyed (destructor called) immediately on scope exit. Whatever the memory is deallocated also depends on implementation and class-vs-struct differences.[16]
std.experimental.allocator
contains a modular and composable allocator templates, to create custom high performance allocators for special use cases.[17]
SafeD[18]
is the name given to the subset of D that can be guaranteed to be memory safe (no writes to memory that has not been allocated or that has been recycled). Functions marked @safe
are checked at compile time to ensure that they do not use any features that could result in corruption of memory, such as pointer arithmetic and unchecked casts, and any other functions called must also be marked as @safe
or @trusted
. Functions can be marked @trusted
for the cases where the compiler cannot distinguish between safe use of a feature that is disabled in SafeD and a potential case of memory corruption.[19]
Initially under the banners of DIP1000[20] and DIP25[21] (now part of the language specification[22]), D provides protections against certain ill-formed constructions involving the lifetimes of data.
The current mechanisms in place primarily deal with function parameters and stack memory however it is a stated ambition of the leadership of the programming language to provide a more thorough treatment of lifetimes within the D programming language[23] (influenced by ideas from Rust programming language).
Within @safe code, the lifetime of an assignment involving a reference type is checked to ensure that the lifetime of the assignee is longer than that of the assigned.
For example:
@safe void test() { int tmp = 0; // #1 int* rad; // #2 rad = &tmp; // If the order of the declarations of #1 and #2 is reversed, this fails. { int bad = 45; // The lifetime of "bad" only extends to the scope in which it is defined. *rad = bad; // This is valid. rad = &bad; // The lifetime of rad is longer than bad, hence this is not valid. } }
When applied to function parameter which are either of pointer type or references, the keywords return and scope constrain the lifetime and use of that parameter.
The language standard dictates the following behaviour:[24]
Storage Class | Behaviour (and constraints to) of a parameter with the storage class |
---|---|
scope | References in the parameter cannot be escaped. Ignored for parameters with no references |
return | Parameter may be returned or copied to the first parameter, but otherwise does not escape from the function. Such copies are required not to outlive the argument(s) they were derived from. Ignored for parameters with no references |
An annotated example is given below.
@safe: int* gp; void thorin(scope int*); void gloin(int*); int* balin(return scope int* p, scope int* q, int* r) { gp = p; // Error, p escapes to global variable gp. gp = q; // Error, q escapes to global variable gp. gp = r; // OK. thorin(p); // OK, p does not escape thorin(). thorin(q); // OK. thorin(r); // OK. gloin(p); // Error, p escapes gloin(). gloin(q); // Error, q escapes gloin(). gloin(r); // OK that r escapes gloin(). return p; // OK. return q; // Error, cannot return 'scope' q. return r; // OK. }
C's application binary interface (ABI) is supported, as well as all of C's fundamental and derived types, enabling direct access to existing C code and libraries. D bindings are available for many popular C libraries. Additionally, C's standard library is part of standard D.
On Microsoft Windows, D can access Component Object Model (COM) code.
As long as memory management is properly taken care of, many other languages can be mixed with D in a single binary. For example, the GDC compiler allows to link and intermix C, C++, and other supported language codes such as Objective-C. D code (functions) can also be marked as using C, C++, Pascal ABIs, and thus be passed to the libraries written in these languages as callbacks. Similarly data can be interchanged between the codes written in these languages in both ways. This usually restricts use to primitive types, pointers, some forms of arrays, unions, structs, and only some types of function pointers.
Because many other programming languages often provide the C API for writing extensions or running the interpreter of the languages, D can interface directly with these languages as well, using standard C bindings (with a thin D interface file). For example, there are bi-directional bindings for languages like Python,[25] Lua[26][27] and other languages, often using compile-time code generation and compile-time type reflection methods.
For D code marked as extern(C++)
, the following features are specified:
C++ namespaces are used via the syntax extern(C++, namespace)
where namespace is the name of the C++ namespace.
The C++ side
#include <iostream> using namespace std; class Base { public: virtual void print3i(int a, int b, int c) = 0; }; class Derived : public Base { public: int field; Derived(int field) : field(field) {} void print3i(int a, int b, int c) { cout << "a = " << a << endl; cout << "b = " << b << endl; cout << "c = " << c << endl; } int mul(int factor); }; int Derived::mul(int factor) { return field * factor; } Derived *createInstance(int i) { return new Derived(i); } void deleteInstance(Derived *&d) { delete d; d = 0; }
The D side
extern(C++) { abstract class Base { void print3i(int a, int b, int c); } class Derived : Base { int field; @disable this(); override void print3i(int a, int b, int c); final int mul(int factor); } Derived createInstance(int i); void deleteInstance(ref Derived d); } void main() { import std.stdio; auto d1 = createInstance(5); writeln(d1.field); writeln(d1.mul(4)); Base b1 = d1; b1.print3i(1, 2, 3); deleteInstance(d1); assert(d1 is null); auto d2 = createInstance(42); writeln(d2.field); deleteInstance(d2); assert(d2 is null); }
The D programming language has an official subset known as "Better C".[28] This subset forbids access to D features requiring use of runtime libraries other than that of C.
Enabled via the compiler flags "-betterC" on DMD and LDC, and "-fno-druntime" on GDC, Better C may only call into D code compiled under the same flag (and linked code other than D) but code compiled without the Better C option may call into code compiled with it: this will, however, lead to slightly different behaviours due to differences in how C and D handle asserts.
core.thread
)core.sync
Walter Bright started working on a new language in 1999. D was first released in December 2001[1] and reached version 1.0 in January 2007.[29] The first version of the language (D1) concentrated on the imperative, object oriented and metaprogramming paradigms,[30] similar to C++.
Some members of the D community dissatisfied with Phobos, D's official runtime and standard library, created an alternative runtime and standard library named Tango. The first public Tango announcement came within days of D 1.0's release.[31] Tango adopted a different programming style, embracing OOP and high modularity. Being a community-led project, Tango was more open to contributions, which allowed it to progress faster than the official standard library. At that time, Tango and Phobos were incompatible due to different runtime support APIs (the garbage collector, threading support, etc.). This made it impossible to use both libraries in the same project. The existence of two libraries, both widely in use, has led to significant dispute due to some packages using Phobos and others using Tango.[32]
In June 2007, the first version of D2 was released.[33] The beginning of D2's development signaled D1's stabilization. The first version of the language has been placed in maintenance, only receiving corrections and implementation bugfixes. D2 introduced breaking changes to the language, beginning with its first experimental const system. D2 later added numerous other language features, such as closures, purity, and support for the functional and concurrent programming paradigms. D2 also solved standard library problems by separating the runtime from the standard library. The completion of a D2 Tango port was announced in February 2012.[34]
The release of Andrei Alexandrescu's book The D Programming Language on 12 June 2010, marked the stabilization of D2, which today is commonly referred to as just "D".
In January 2011, D development moved from a bugtracker / patch-submission basis to GitHub. This has led to a significant increase in contributions to the compiler, runtime and standard library.[35]
In December 2011, Andrei Alexandrescu announced that D1, the first version of the language, would be discontinued on 31 December 2012.[36] The final D1 release, D v1.076, was on 31 December 2012.[37]
Code for the official D compiler, the Digital Mars D compiler by Walter Bright, was originally released under a custom license, qualifying as source available but not conforming to the Open Source Definition.[38] In 2014, the compiler front-end was re-licensed as open source under the Boost Software License.[2] This re-licensed code excluded the back-end, which had been partially developed at Symantec. On 7 April 2017, the whole compiler was made available under the Boost license after Symantec gave permission to re-license the back-end, too.[3][39][40][41] On 21 June 2017, the D Language was accepted for inclusion in GCC.[42]
Most current D implementations compile directly into machine code.
Production ready compilers:
Toy and proof-of-concept compilers:
Using above compilers and toolchains, it is possible to compile D programs to target many different architectures, including IA-32, amd64, AArch64, PowerPC, MIPS64, DEC Alpha, Motorola m68k, SPARC, s390, WebAssembly. The primary supported operating systems are Windows and Linux, but various compilers also support Mac OS X, FreeBSD, NetBSD, AIX, Solaris/OpenSolaris and Android, either as a host or target, or both. WebAssembly target (supported via LDC and LLVM) can operate in any WebAssembly environment, like modern web browser (Google Chrome, Mozilla Firefox, Microsoft Edge, Apple Safari), or dedicated Wasm virtual machines.
Editors and integrated development environments (IDEs) supporting syntax highlighting and partial code completion for the language include SlickEdit, Emacs, vim, SciTE, Smultron, Zeus,[54] and Geany among others.[55]
Open source D IDEs for Windows exist, some written in D, such as Poseidon,[67] D-IDE,[68] and Entice Designer.[69]
D applications can be debugged using any C/C++ debugger, like GDB or WinDbg, although support for various D-specific language features is extremely limited. On Windows, D programs can be debugged using Ddbg, or Microsoft debugging tools (WinDBG and Visual Studio), after having converted the debug information using cv2pdb. The ZeroBUGS debugger for Linux has experimental support for the D language. Ddbg can be used with various IDEs or from the command line; ZeroBUGS has its own graphical user interface (GUI).
DustMite is a tool for minimizing D source code, useful when finding compiler or tests issues.[70]
dub is a popular package and build manager for D applications and libraries, and is often integrated into IDE support.[71]
This example program prints its command line arguments. The main
function is the entry point of a D program, and args
is an array of strings representing the command line arguments. A string
in D is an array of characters, represented by immutable(char)[]
.
import std.stdio: writefln; void main(string[] args) { foreach (i, arg; args) writefln("args[%d] = '%s'", i, arg); }
The foreach
statement can iterate over any collection. In this case, it is producing a sequence of indexes (i
) and values (arg
) from the array args
. The index i
and the value arg
have their types inferred from the type of the array args
.
The following shows several D capabilities and D design trade-offs in a short program. It iterates over the lines of a text file named words.txt
, which contains a different word on each line, and prints all the words that are anagrams of other words.
import std.stdio, std.algorithm, std.range, std.string; void main() { dstring[] [dstring] signature2words; foreach (dchar[] w; lines(File("words.txt"))) { w = w.chomp().toLower(); immutable signature = w.dup.sort().release().idup; signature2words[signature] ~= w.idup; } foreach (words; signature2words) { if (words.length > 1) { writeln(words.join(" ")); } } }
signature2words
is a built-in associative array that maps dstring (32-bit / char) keys to arrays of dstrings. It is similar to defaultdict(list)
in Python.lines(File())
yields lines lazily, with the newline. It has to then be copied with idup
to obtain a string to be used for the associative array values (the idup
property of arrays returns an immutable duplicate of the array, which is required since the dstring
type is actually immutable(dchar)[]
). Built-in associative arrays require immutable keys.~=
operator appends a new dstring to the values of the associate dynamic array.toLower
, join
and chomp
are string functions that D allows the use of with a method syntax. The name of such functions are often similar to Python string methods. The toLower
converts a string to lower case, join(" ")
joins an array of strings into a single string using a single space as separator, and chomp
removes a newline from the end of the string if one is present. The w.dup.sort().release().idup
is more readable, but equivalent to release(sort(w.dup)).idup
for example. This feature is called UFCS (Uniform Function Call Syntax), and allows extending any built-in or third party package types with method-like functionality. The style of writing code like this is often referenced as pipeline (especially when the objects used are lazily computed, for example iterators / ranges) or Fluent interface.sort
is an std.algorithm function that sorts the array in place, creating a unique signature for words that are anagrams of each other. The release()
method on the return value of sort()
is handy to keep the code as a single expression.foreach
iterates on the values of the associative array, it is able to infer the type of words
.signature
is assigned to an immutable variable, its type is inferred.dchar[]
is used instead of normal UTF-8 char[]
otherwise sort()
refuses to sort it. There are more efficient ways to write this program using just UTF-8.Notable organisations that use the D programming language for projects include Facebook,[72] eBay,[73] and Netflix.[74]
D has been successfully used for AAA games,[75] language interpreters, virtual machines,[76][77] an operating system kernel,[78] GPU programming,[79] web development,[80][81] numerical analysis,[82] GUI applications,[83][84] a passenger information system,[85] machine learning,[86] text processing, web and application servers and research.
The notorious North Korean hacking group known as Lazarus exploited CVE-2021-44228, aka "Log4Shell," to deploy three malware families written in DLang.[87]
Original source: https://en.wikipedia.org/wiki/D (programming language).
Read more |