Python (programming language)
About this schools Wikipedia selection
SOS believes education gives a better chance in life to children in the developing world too. Do you want to know about sponsoring? See www.sponsorachild.org.uk
Paradigm(s) | multi-paradigm: object-oriented, imperative, functional, reflective |
---|---|
Appeared in | 1991 |
Designed by | Guido van Rossum |
Developer | Python Software Foundation |
Stable release | 3.1.3/ November 27, 2010 2.7.1/ November 27, 2010 |
Preview release | 3.2 rc 1/ January 16, 2011 |
Typing discipline | duck, dynamic, strong |
Major implementations | CPython, IronPython, Jython, Python for S60, PyPy, Unladen Swallow |
Dialects | Cython, RPython, Stackless Python |
Influenced by | ABC, ALGOL 68, C, C++, Haskell, Icon, Java, Lisp, Modula-3, Perl |
Influenced | Boo, Cobra, D, Falcon, Groovy, JavaScript, Ruby |
OS | Cross-platform |
License | Python Software Foundation License |
Usual filename extensions | .py, .pyw, .pyc, .pyo, .pyd |
|
Python is an interpreted, general-purpose high-level programming language whose design philosophy emphasizes code readability. Python aims to combine "remarkable power with very clear syntax", and its standard library is large and comprehensive. Its use of indentation for block delimiters is unique among popular programming languages.
Python supports multiple programming paradigms, primarily but not limited to object oriented, imperative and, to a lesser extent, functional programming styles. It features a fully dynamic type system and automatic memory management, similar to that of Scheme, Ruby, Perl, and Tcl. Like other dynamic languages, Python is often used as a scripting language, but is also used in a wide range of non-scripting contexts.
The reference implementation of Python ( CPython) is free and open source software and has a community-based development model, as do all or nearly all of its alternative implementations. CPython is managed by the non-profit Python Software Foundation.
Python interpreters are available for many operating systems, and Python programs can be packaged into stand-alone executable code for many systems using various tools.
History
Python was conceived in the late 1980s and its implementation was started in December 1989 by Guido van Rossum at CWI in the Netherlands as a successor to the ABC programming language (itself inspired by SETL) capable of exception handling and interfacing with the Amoeba operating system. Van Rossum is Python's principal author, and his continuing central role in deciding the direction of Python is reflected in the title given to him by the Python community, Benevolent Dictator for Life (BDFL).
Python 2.0 was released on 16 October 2000, with many major new features including a full garbage collector and support for Unicode. However, the most important change was to the development process itself, with a shift to a more transparent and community-backed process. Python 3.0, a major, backwards-incompatible release, was released on 3 December 2008 after a long period of testing. Many of its major features have been backported to the backwards-compatible Python 2.6 and 2.7. Python was the winner of the TIOBE Programming Language Award of 2010 for having the greatest programming language market share gain over the course of that year (+1.81%).
Programming philosophy
Python is a multi-paradigm programming language. Rather than forcing programmers to adopt a particular style of programming, it permits several styles: object-oriented programming and structured programming are fully supported, and there are a number of language features which support functional programming and aspect-oriented programming (including by metaprogramming and by magic methods). Many other paradigms are supported using extensions, such as pyDBC and Contracts for Python which allow Design by Contract.
Python uses dynamic typing and a combination of reference counting and a cycle-detecting garbage collector for memory management. An important feature of Python is dynamic name resolution ( late binding), which binds method and variable names during program execution.
Rather than requiring all desired functionality to be built into the language's core, Python was designed to be highly extensible. New built-in modules can be easily written in C, C++ or Cython. Python can also be used as an extension language for existing modules and applications that need a programmable interface. This design of a small core language with a large standard library and an easily extensible interpreter was intended by Van Rossum from the very start because of his frustrations with ABC (which espoused the opposite mindset).
The design of Python offers only limited support for functional programming in the Lisp tradition. However, Python's design philosophy exhibits significant similarities to those of minimalistic Lisp-family languages, such as Scheme. The standard library has two modules (itertools and functools) that implement proven functional tools borrowed from Haskell and Standard ML.
While offering choice in coding methodology, the Python philosophy rejects exuberant syntax, such as in Perl, in favour of a sparser, less-cluttered grammar. Python's developers expressly promote a particular "culture" or ideology based on what they want the language to be, favoring language forms they see as "beautiful", "explicit" and "simple". As Alex Martelli put it in his Python Cookbook (2nd ed., p. 230): "To describe something as clever is NOT considered a compliment in the Python culture." Python's philosophy rejects the Perl " there is more than one way to do it" approach to language design in favour of "there should be one—and preferably only one—obvious way to do it".
Python's developers eschew premature optimization, and moreover, reject patches to non-critical parts of CPython which would offer a marginal increase in speed at the cost of clarity. Python is sometimes described as "slow". However, by the Pareto principle, most problems and sections of programs are not speed critical. When speed is a problem, Python programmers tend to try using a JIT compiler such as Psyco, rewriting the time-critical functions in "closer to the metal" languages such as C, or by translating (a dialect of) Python code to C code using tools like Cython.
The core philosophy of the language is summarized by the document "PEP 20 (The Zen of Python)".
Name and neologisms
An important goal of the Python developers is making Python fun to use. This is reflected in the origin of the name (based on the television series Monty Python's Flying Circus), in the common practice of using Monty Python references in example code, and in an occasionally playful approach to tutorials and reference materials. For example, the metasyntactic variables often used in Python literature are spam and eggs, instead of the traditional foo and bar.
A common neologism in the Python community is pythonic, which can have a wide range of meanings related to program style. To say that a piece of code is pythonic is to say that it uses Python idioms well, that it is natural or shows fluency in the language. Likewise, to say of an interface or language feature that it is pythonic is to say that it works well with Python idioms, that its use meshes well with the rest of the language.
In contrast, a mark of unpythonic code is that it attempts to write C++ (or Lisp, Perl, or Java) code in Python—that is, provides a rough transcription rather than an idiomatic translation of forms from another language. The concept of pythonicity is tightly bound to Python's minimalist philosophy of readability and avoiding the "there's more than one way to do it" approach. Unreadable code or incomprehensible idioms are unpythonic.
Users and admirers of Python—most especially those considered knowledgeable or experienced—are often referred to as Pythonists, Pythonistas, and Pythoneers.
The prefix Py can be used to show that something is related to Python. Examples of the use of this prefix in names of Python applications or libraries include Pygame, a binding of SDL to Python (commonly used to create games); PyS60, an implementation for the Symbian Series 60 Operating System; PyQt and PyGTK, which bind Qt and GTK, respectively, to Python; and PyPy, a Python implementation written in Python. The prefix is also used outside of naming software packages: the major Python conference is named PyCon.
Usage
Python is often used as a scripting language for web applications, e.g. via mod_wsgi for the Apache web server. With Web Server Gateway Interface, a standard API has been developed to facilitate these applications. Web application frameworks like Django, Pylons, TurboGears, web2py, Flask and Zope support developers in the design and maintenance of complex applications. Libraries like NumPy, SciPy and Matplotlib allow Python to be used effectively in scientific computing.
Python has been successfully embedded in a number of software products as a scripting language, including in finite element method software such as Abaqus, 3D animation packages such as Maya, MotionBuilder, Softimage, Cinema 4D, BodyPaint 3D, modo and Blender and 2D imaging programs like GIMP, Inkscape, Scribus and Paint Shop Pro. GNU GDB uses Python as a pretty printer to show complex structures such as C++ containers. ESRI is now promoting Python as the best choice for writing scripts in ArcGIS. It has even been used in several video games and has been adopted as one of the two available scripting languages in Google Docs.
For many operating systems, Python is a standard component; it ships with most Linux distributions, NetBSD, OpenBSD and with Mac OS X and can be used from the terminal. A number of Linux distributions use installers written in Python: Ubuntu uses the Ubiquity installer, while Red Hat Linux and Fedora use the Anaconda installer. Gentoo Linux uses Python in its package management system, Portage and the standard tool to access it, emerge. Pardus uses it for administration and during system boot.
Python has also seen extensive use in the information security industry, including exploit development.
Among the users of Python are YouTube and the original BitTorrent client. Large organizations that make use of Python include Google, Yahoo!, CERN, NASA and ITA. Most of the Sugar software for the One Laptop per Child XO, now developed at Sugar Labs, is written in Python.
Syntax and semantics
Python was intended to be a highly readable language. It is designed to have an uncluttered visual layout, frequently using English keywords where other languages use punctuation. Python requires less boilerplate than traditional manifestly typed structured languages such as C or Pascal, and has a smaller number of syntactic exceptions and special cases than either of these. For a detailed description of the differences between 2.x and 3.x versions, see History of Python.
Indentation
Python uses whitespace indentation, rather than curly braces or keywords, to delimit blocks (a feature also known as the off-side rule). An increase in indentation comes after certain statements; a decrease in indentation signifies the end of the current block.
Statements and control flow
Python's statements include (among others):
- The
if
statement, which conditionally executes a block of code, along withelse
andelif
(a contraction of else-if). - The
for
statement, which iterates over an iterable object, capturing each element to a local variable for use by the attached block. - The
while
statement, which executes a block of code as long as its condition is true. - The
try
statement, which allows exceptions raised in its attached code block to be caught and handled byexcept
clauses; it also ensures that clean-up code in afinally
block will always be run regardless of how the block exits. - The
class
statement, which executes a block of code and attaches its local namespace to a class, for use in object-oriented programming. - The
def
statement, which defines a function or method. - The
with
statement (from Python 2.6), which encloses a code block within a context manager (for example, acquiring a lock before the block of code is run, and releasing the lock afterwards). - The
pass
statement, which serves as a NOP and can be used in place of a code block. - The
assert
statement, used during debugging to check for conditions that ought to apply. - The
yield
statement, which returns a value from a generator function. (From Python 2.5,yield
is also an operator. This form is used to implement coroutines -- see below.)
Each statement has its own semantics: for example, the def
statement does not execute its block immediately, unlike most other statements.
CPython does not support first-class continuations, and according to Guido van Rossum it never will. However, better support for coroutine-like functionality is provided in 2.5, by extending Python's generators. Prior to 2.5, generators were lazy iterators; information was passed unidirectionally out of the generator. As of Python 2.5, it is possible to pass information back into a generator function.
Expressions
Python expressions are similar to languages such as C and Java. Some important notes:
- In Python 2, the / operator on integers does integer division, i.e. it truncates the result to an integer. In Python 3, however, the result of / is always a floating-point value, and a new operator // is introduced to do integer division. In Python 2, this behaviour can be enabled using the statement from __future__ import division or type-cast float(x) / float(y)
- In Python, == compares by value, in contrast to Java, where it compares by reference (value comparisons in Java use the equals method). To compare by reference in Python, use the is operator.
- Python uses the words
and
,or
,not
for its boolean operators rather than the symbolic&&
,||
,!
. - Python has an important type of expression known as a list comprehension. Recent versions of Python have extended list comprehensions into a more general expression known as a generator expression.
- Anonymous functions are implemented using lambda expressions; however, these are limited in that the body can only be a single expression.
- Conditional expressions in Python are written as y if x else z (different in order of operands from the ?: operator common to many other languages).
- Python makes a distinction between lists and tuples. Lists are written as [1, 2, 3], are mutable, and cannot be used as the keys of dictionaries (dictionary keys must be immutable in Python). Tuples are written as (1, 2, 3), are immutable and thus can be used as the keys of dictionaries. The parentheses around the tuple are optional in some contexts. Tuples can appear on the left side of an equal sign; hence an expression like x, y = y, x can be used to swap two variables.
- Python 2 has a "string format" operator %. This functions analogous to printf expressions in C, e.g. "foo=%s bar=%d" % ("blah", 2) evaluates to "foo=blah bar=2". In Python 3 this has been replaced by the format method in the string class, e.g. "foo={0} bar={1}".format("blah", 2).
- Python has various kinds of strings.
- Either single or double quotes can be used to quote strings. Unlike in Unix shell languages, Perl or Perl-influenced languages such as Ruby or Groovy, single quotes and double quotes function identically, i.e. there is no string interpolation of $foo expressions.
- There are also multi-line strings, which begin and end with a series of three single or double quotes and function like here documents in shell languages, Perl and Ruby.
- Finally, all of the previously-mentioned string types come in " raw" varieties (denoted by placing a literal r before the opening quote), which do no backslash-interpolation and hence are very useful for regular expressions or Windows-style paths; compare "@-quoting" in C#.
- Python has slice expressions on lists, denoted as ...[left:right] or ...[left:right:stride]. For example, if the variable nums is assigned the list [1, 3, 5, 7, 8, 13, 20], then the following expressions will evaluate True:
- nums[2:5] == [5, 7, 8], i.e. the slice goes up to, but not including, the right index.
- nums[1:] == [3, 5, 7, 8, 13, 20], i.e. all elements but the first, because an omitted right index means "the end".
- nums[:-3] == [1, 3, 5, 7], i.e. an omitted left index means "the start", and a negative index (either left or right) counts from the end.
- nums[:] makes a copy of the list. This means nums == nums[:] is true but nums is nums[:] is false. Changes to the copy will not affect the original.
- nums[1:5:2] == [3, 7], i.e. if three numbers are given, the third is called the "stride", indicating in this case that every second element will be selected.
In Python, a distinction between expressions and statements is rigidly enforced, in contrast to languages such as Common Lisp, Scheme or Ruby. This leads to some duplication of functionality, e.g.
- list comprehensions vs. "for" loops
- conditional expressions vs. "if" blocks
- The eval() vs. exec() builtins (in Python 2, exec is a statement declarator); eval() is for expressions, exec() is for statements.
Statements cannot be a part of an expression and so list and other comprehensions or lambda expressions, all being expressions, cannot contain statements. A particular case of this is that an assignment statement such as 'a =1' cannot form part of the conditional expression of a conditional statement; this has the advantage of avoiding a classic C error of mistaking an assignment token '=', for an equality operator '==' which would remain valid C in if (c = 1) { ... } but if c = 1: ... is invalid Python code.
Methods
Methods on objects are functions attached to the object's class; the syntax instance.method(argument)
is, for normal methods and functions, syntactic sugar for Class.method(instance, argument)
. Python methods have an explicit self
parameter to access instance data, in contrast to the implicit self in some other object-oriented programming languages (for example, Java, C++ or Ruby).
Typing
Python uses duck typing and has typed objects but untyped variable names. Type constraints are not checked at compile time; rather, operations on an object may fail, signifying that the given object is not of a suitable type. Despite being dynamically typed, Python is strongly typed, forbidding operations that are not well-defined (for example, adding a number to a string) rather than silently attempting to make sense of them.
Python allows programmers to define their own types using classes, which are most often used for object-oriented programming. New instances of classes are constructed by calling the class (for example, SpamClass()
or EggsClass()
), and the classes themselves are instances of the metaclass type
(itself an instance of itself), allowing metaprogramming and reflection.
Prior to version 3.0, Python had two kinds of classes: "old-style" and "new-style". Old-style classes were eliminated in Python 3.0, making all classes new-style. In versions between 2.2 and 3.0, both kinds of classes could be used. The syntax of both styles is the same, the difference being whether the class object
is inherited from, directly or indirectly (all new-style classes inherit from object
and are instances of type
).
Here is a summary of Python's built-in types:
Type | Description | Syntax example |
---|---|---|
str |
An immutable sequence of characters. In Python 2, strings are a sequence of characters. Unicode strings need to be declared by prefixing with the letter u. In Python 3 strings are Unicode by default. | 'Wikipedia' "Wikipedia" """Spanning |
bytes |
An immutable sequence of bytes | b'Some ASCII' b"Some ASCII" |
list |
Mutable, can contain mixed types | [4.0, 'string', True] |
tuple |
Immutable, can contain mixed types | (4.0, 'string', True) |
set , frozenset |
Unordered, contains no duplicates. A frozenset is immutable. |
{4.0, 'string', True} frozenset([4.0, 'string', True]) |
dict |
A mutable group of key and value pairs | {'key1': 1.0, 3: False} |
int |
An immutable fixed precision number of unlimited magnitude | 42 |
float |
An immutable floating point number (system-defined precision) | 3.1415927 |
complex |
An immutable complex number with real number and imaginary parts | 3+2.7j |
bool |
An immutable truth value | True False |
Mathematics
Python defines the modulus operator so that the result of a % b
is in the open interval [0,b)
, where b
is a positive integer. (When b
is negative, the result lies in the interval (b,0]). This is the usual way of defining the modulus operation in mathematics. However, this consequently affects how integer division is defined. To maintain the validity of the equation b * (a // b) + a % b = a
, Integer division is defined to round towards minus infinity. Therefore 7 // 3
is 2, but (−7) // 3
is −3. This is different from many programming languages, where the result of integer division rounds towards zero and the modulus operator is consequently defined in a way which can return negative numbers.
Python provides a round
function for rounding floats to integers. Version 2.6.1 and lower use round-away-from-zero: round(0.5)
is 1.0, round(-0.5)
is -1.0. Version 3.0 and higher use round-to-even: round(1.5)
is 2.0, round(2.5)
is 2.0. The Decimal
type/class in module decimal
(since version 2.4) provides exact numerical representation and several rounding modes.
Python allows boolean expressions with multiple equality relations in a manner that is consistent with general usage in mathematics. For example, the expression a < b < c
tests whether a
is less than b
and b
is less than c
. C-derived languages interpret this expression differently: in C, the expression would first evaluate a < b
, resulting in 0 or 1, and that result would then be compared with c
.
Implementations
CPython
The mainstream Python implementation, known as CPython, is written in C meeting the C89 standard. CPython compiles Python programs into intermediate bytecode, which are then executed by the virtual machine. It is distributed with a large standard library written in a mixture of C and Python. CPython ships in versions for many platforms, including Microsoft Windows and most modern Unix-like systems. CPython was intended from almost its very conception to be cross-platform; its use and development on esoteric platforms such as Amoeba, alongside more conventional ones like Unix and Mac OS, has greatly helped in this regard.
Stackless Python is a significant fork of CPython that implements microthreads; it does not use the C memory stack. It can be expected to run on approximately the same platforms that CPython runs on.
Google started a project called Unladen Swallow in 2009 with the aims of increasing the speed of the Python interpreter by 5 times and improving its multithreading ability to scale to thousands of cores.
Alternative implementations
Jython compiles the Python program into Java byte code, which can then be executed by every Java Virtual Machine implementation. This also enables the use of Java class library functions from the Python program. IronPython follows a similar approach in order to run Python programs on the .NET Common Language Runtime. PyPy is an experimental self-hosting implementation of Python, written in Python, that can output several types of bytecode, object code and intermediate languages. There also exist compilers to high-level object languages, with either unrestricted Python, a restricted subset of Python, or a language similar to Python as the source language. PyPy is of this type, compiling RPython to several languages; other examples include Pyjamas compiling to JavaScript; Shed Skin compiling to C++; and Cython and Pyrex compiling to C.
In 2005 Nokia released a Python interpreter for the Series 60 mobile phones called PyS60. It includes many of the modules from the CPython implementations and some additional modules for integration with the Symbian operating system. This project has been kept up to date to run on all variants of the S60 platform and there are several third party modules available. The Nokia N900 also supports Python with gtk windows libraries, with the feature that programs can be both written and run on the device itself. There is also a Python interpreter for Windows CE devices (including Pocket PC). It is called PythonCE. There are additional tools available for easy application and GUI development.
The PyMite virtual machine began in 2000 and made its first public appearance at PyCon 2003. PyMite was folded into Python-on-a-Chip in 2009. Python-on-a-Chip (p14p) is a project to develop a reduced Python virtual machine (codenamed PyMite) that runs a significant subset of the Python language on microcontrollers without an OS in as little as 4KB of RAM.
Around 2004, the Pyastra project created a specialized translator and assembler that targets very resource-constrained microcontrollers.
ChinesePython (中蟒) is a Python programming language using Chinese language lexicon. Besides reserved words and variable names, most data type operations can be coded in Chinese as well.
Interpretational semantics
Most Python implementations (including CPython) can function as a command line interpreter, for which the user enters statements sequentially and receives the results immediately. In short, Python acts as a shell. While the semantics of the other modes of execution (bytecode compilation, or compilation to native code) preserve the sequential semantics, they offer a speed boost at the cost of interactivity, so they are usually only used outside of a command-line interaction (e.g., when importing a module).
Other shells add capabilities beyond those in the basic interpreter, including IDLE and IPython. While generally following the visual style of the Python shell, they implement features like auto-completion, retention of session state, and syntax highlighting.
Some implementations can compile not only to bytecode, but can turn Python code into machine code. So far, this has only been done for restricted subsets of Python. PyPy takes this approach, naming its restricted compilable version of Python RPython.
Psyco is a specialising just in time compiler that integrates with CPython and transforms bytecode to machine code at runtime. The produced code is specialised for certain data types and is faster than standard Python code. Psyco is compatible with all Python code, not only a subset.
Development
Python development is conducted largely through the Python Enhancement Proposal (PEP) process. PEPs are standardized design documents providing general information related to Python, including proposals, descriptions, design rationales, and explanations for language features. Outstanding PEPs are reviewed and commented upon by Van Rossum, the Python project's Benevolent Dictator for Life (leader / language architect). CPython's developers also communicate over a mailing list, python-dev, which is the primary forum for discussion about the language's development; specific issues are discussed in the Roundup bug tracker maintained at python.org. Development takes place at the self-hosted svn.python.org.
CPython's public releases come in three types, distinguished by which part of the version number is incremented:
- backwards-incompatible versions, where code is expected to break and must be manually ported. The first part of the version number is incremented. These releases happen infrequently—for example, version 3.0 was released 8 years after 2.0.
- major or 'feature' releases, which are largely compatible but introduce new features. The second part of the version number is incremented. These releases are scheduled to occur roughly every 18 months, and each major version is supported by bugfixes for several years after its release.
- bugfix releases, which introduce no new features but fix bugs. The third and final part of the version number is incremented. These releases are made whenever a sufficient number of bugs have been fixed upstream since the last release, or roughly every 3 months. Security vulnerabilities are also patched in bugfix releases.
A number of alpha, beta, and release-candidates are also released as previews and for testing before the final release is made. Although there is a rough schedule for each release, this is often pushed back if the code is not ready. The development team monitor the state of the code by running the large unit test suite during development, and using the BuildBot continuous integration system.
Standard library
Python has a large standard library, commonly cited as one of Python's greatest strengths, providing pre-written tools suited to many tasks. This is deliberate and has been described as a "batteries included" Python philosophy. The modules of the standard library can be augmented with custom modules written in either C or Python. Boost C++ Libraries includes a library, Boost.Python, to enable interoperability between C++ and Python. Because of the wide variety of tools provided by the standard library, combined with the ability to use a lower-level language such as C and C++, which is already capable of interfacing between other libraries, Python can be a powerful glue language between languages and tools.
The standard library is particularly well tailored to writing Internet-facing applications, with a large number of standard formats and protocols (such as MIME and HTTP) already supported. Modules for creating graphical user interfaces, connecting to relational databases, arithmetic with arbitrary precision decimals, manipulating regular expressions, and doing unit testing are also included.
Some parts of the standard library are covered by specifications (for example, the WSGI implementation wsgiref
follows PEP 333), but the majority of the modules are not. They are specified by their code, internal documentation, and test suite (if supplied). However, because most of the standard library is cross-platform Python code, there are only a few modules that must be altered or completely rewritten by alternative implementations.
The standard library is not essential to run Python or embed Python within an application. Blender 2.49 for instance omits most of the standard library.
For software testing, the standard library provides the unittest
and doctest
modules.
Influences on other languages
Python's design and philosophy have influenced several programming languages, including:
- Pyrex and its derivative Cython are code translators that are targeted at writing fast C extensions for the CPython interpreter. The language is mostly Python with syntax extensions for C and C++ features. Both languages produce compilable C code as output.
- Boo uses indentation, a similar syntax, and a similar object model. However, Boo uses static typing and is closely integrated with the .NET framework.
- Cobra uses indentation and a similar syntax. Cobra's "Acknowledgements" document lists Python first among languages that influenced it. However, Cobra directly supports design-by-contract, unit tests and optional static typing.
- ECMAScript borrowed iterators, generators, and list comprehensions from Python.
- Go is described as incorporating the "development speed of working in a dynamic language like Python".
- Groovy was motivated by the desire to bring the Python design philosophy to Java.
- OCaml has an optional syntax, called twt (The Whitespace Thing), inspired by Python and Haskell.
Python's development practices have also been emulated by other languages. The practice of requiring a document describing the rationale for, and issues surrounding, a change to the language (in Python's case, a PEP) is also used in Tcl and Erlang because of Python's influence.