WIKIBOOKS
DISPONIBILI
?????????
ART
- Great Painters
BUSINESS&LAW
- Accounting
- Fundamentals of Law
- Marketing
- Shorthand
CARS
- Concept Cars
GAMES&SPORT
- Videogames
- The World of Sports
COMPUTER TECHNOLOGY
- Blogs
- Free Software
- Google
- My Computer
- PHP Language and Applications
- Wikipedia
- Windows Vista
EDUCATION
- Education
LITERATURE
- Masterpieces of English Literature
LINGUISTICS
- American English
- English Dictionaries
- The English Language
MEDICINE
- Medical Emergencies
- The Theory of Memory
MUSIC&DANCE
- The Beatles
- Dances
- Microphones
- Musical Notation
- Music Instruments
SCIENCE
- Batteries
- Nanotechnology
LIFESTYLE
- Cosmetics
- Diets
- Vegetarianism and Veganism
TRADITIONS
- Christmas Traditions
NATURE
- Animals
- Fruits And Vegetables

ARTICLES IN THE BOOK

A Dictionary of Americanisms
A Dictionary of the English Language
A Greek-English Lexicon
A Latin Dictionary
American and British English spelling differences
Anagram dictionary
Answers.com
Babel Fish
Babylon Ltd
Bank of English
Basic English
Bilingual dictionary
Black's Law Dictionary
Brewer's Dictionary of Irish Phrase and Fable
Brewer's Dictionary of Phrase and Fable
British National Corpus
Bryson's Dictionary of Troublesome Words
Canadian Oxford Dictionary
Centre for Lexicography
Chambers Dictionary
COBUILD
Collaborative International Dictionary of English
Concise Oxford Dictionary
Corpus linguistics
Defining vocabulary
Definition
Descriptionary
DICT
Dictionary
Dictionary of American English
Dictionary of American Regional English
Dictionary of National Biography
Dictionary of Received Ideas
Dictionary of the Scots Language
Dord
Dorland's Medical Dictionary
Easton's Bible Dictionary
Electronic dictionary
Encyclopedic dictionary
English language
Etymological dictionary
Etymology
FrameNet
Franklin Electronic Publishers
Freedict
Free On-line Dictionary of Computing
Free On-line Dictionary of Philosophy
Gazetteer
Gloss
Glossary
Glyph
Gnome-dictionary
Grady Ward
Grammar
HarperCollins
Harvard Dictionary of Music
Headword
Idiom dictionary
Imperial Dictionary
Interglot
James Murray
Jargon File
KMLE Medical Dictionary
Law dictionary
Legal lexicography
Lemma
LEO
Lexeme
Lexicographic error
Lexicographic information cost
Lexicography
Lexicon
Lexicon technicum
Lexigraf
Linguistic Data Consortium
List of online dictionaries
Logos Dictionary
Longman
LSP dictionary
Macquarie Dictionary
Main Page
Maximizing dictionary
Medical dictionary
Merriam-Webster
Merriam-Webster%27s Geographical Dictionary
Minimizing dictionary
Moby Project
Moby Thesaurus
Monolingual learner's dictionary
Multi-field dictionary
New Oxford American Dictionary
New Oxford Dictionary of English
Noah Webster
Official Scrabble Players Dictionary
OmniDictionary
OneLook
Online Etymology Dictionary
Oxford Advanced Learner%27s Dictionary
Oxford Classical Dictionary
Oxford Dictionary of Byzantium
Oxford Dictionary of English Etymology
Oxford Dictionary of World Religions
Oxford English Corpus
Oxford English Dictionary
Oxford spelling
Oxford University Press
Project Gutenberg
Pronunciation
Pseudodictionary
Quotations
Random House Dictionary of the English Language
Reference.com
Rhyming dictionary
Roger's Profanisaurus
Roget's Thesaurus
Samuel Johnson
Shorter Oxford English Dictionary
Single-field dictionary
Slang dictionary
Specialised lexicography
Specialized dictionary
Spelling
StarDict
Sub-field dictionary
Synonyms
Table Alphabeticall
The Century Dictionary
The Computer Contradictionary
The Devil's Dictionary
The Devil's Dictionary X
TheFreeDictionary.com
The Oxford Dictionary of Philosophy
The Oxford Dictionary of Quotations
Thesaurus
The Surgeon of Crowthorne
Translation dictionary
Urban Dictionary
Vines Expository Dictionary
Webster's Dictionary
Webster's New World Dictionary
Wikipedia
Wiktionary
William Whitaker's Words
WordNet
World Book Dictionary
Xrefer

CONDIZIONI DI USO DI QUESTO SITO
L'utente può utilizzare il nostro sito solo se comprende e accetta quanto segue:

Le risorse linguistiche gratuite presentate in questo sito si possono utilizzare esclusivamente per uso personale e non commerciale con tassativa esclusione di ogni condivisione comunque effettuata. Tutti i diritti sono riservati. La riproduzione anche parziale è vietata senza autorizzazione scritta.
Il nome del sito EnglishGratis è esclusivamente un marchio e un nome di dominio internet che fa riferimento alla disponibilità sul sito di un numero molto elevato di risorse gratuite e non implica dunque alcuna promessa di gratuità relativamente a prodotti e servizi nostri o di terze parti pubblicizzati a mezzo banner e link, o contrassegnati chiaramente come prodotti a pagamento (anche ma non solo con la menzione "Annuncio pubblicitario"), o comunque menzionati nelle pagine del sito ma non disponibili sulle pagine pubbliche, non protette da password, del sito stesso.
La pubblicità di terze parti è in questo momento affidata al servizio Google AdSense che sceglie secondo automatismi di carattere algoritmico gli annunci di terze parti che compariranno sul nostro sito e sui quali non abbiamo alcun modo di influire. Non siamo quindi responsabili del contenuto di questi annunci e delle eventuali affermazioni o promesse che in essi vengono fatte!
L'utente, inoltre, accetta di tenerci indenni da qualsiasi tipo di responsabilità per l'uso - ed eventuali conseguenze di esso - degli esercizi e delle informazioni linguistiche e grammaticali contenute sul siti. Le risposte grammaticali sono infatti improntate ad un criterio di praticità e pragmaticità più che ad una completezza ed esaustività che finirebbe per frastornare, per l'eccesso di informazione fornita, il nostro utente. La segnalazione di eventuali errori è gradita e darà luogo ad una immediata rettifica.

ENGLISHGRATIS.COM è un sito personale di
Roberto Casiraghi e Crystal Jones
email: robertocasiraghi at iol punto it

Roberto Casiraghi INFORMATIVA SULLA PRIVACY Crystal Jones

Siti amici: Lonweb • Daisy Stories • English4Life • Scuolitalia
Sito segnalato da INGLESE.IT

ENGLISH DICTIONARIES
This article is from:
http://en.wikipedia.org/wiki/Corpus_linguistics

All text is available under the terms of the GNU Free Documentation License: http://en.wikipedia.org/wiki/Wikipedia:Text_of_the_GNU_Free_Documentation_License

Corpus linguistics

From Wikipedia, the free encyclopedia

Corpus linguistics is the study of language as expressed in samples (corpora) or "real world" text. This method represents a digestive approach to deriving a set of abstract rules by which a natural language is governed or else relates to another language. Originally done by hand, corpora are largely derived by an automated process, which is corrected. The core of a corpus is the derivation of a set of Part-of-speech tags, representing a formal overview of the various types of words and word-relationships in a given language.

Computational methods had once been viewed as a holy grail of linguistic research, which would ultimately manifest a ruleset for natural language processing and machine translation at a high level. Such has not been the case, and since the cognitive revolution, cognitive linguistics has been largely critical of many claimed practical uses for corpora. However, as computation capacity and speed have increased, the use of corpora to study language and term relationships en masse has gained some respectability.

The corpus approach runs counter to Noam Chomsky's view that real language is riddled with performance-related errors, thus requiring careful analysis of small speech samples obtained in a highly controlled laboratory setting. Corpus linguistics does away with Chomsky's competence/performance split; adherents believe that reliable language analysis best occurs on field-collected samples, in natural contexts and with minimal experimental interference.^{[citation needed]}

History

A landmark in modern corpus linguistics was the publication by Henry Kucera and Nelson Francis of Computational Analysis of Present-Day American English in 1967, a work based on the analysis of the Brown Corpus, a carefully compiled selection of current American English, totalling about a million words drawn from a wide variety of sources. Kucera and Francis subjected it to a variety of computational analyses, from which they compiled a rich and variegated opus, combining elements of linguistics, language teaching, psychology, statistics, and sociology. A further key publication was Randolph Quirk's 'Towards a description of English Usage' (1960, Transactions of the Philological Society, 40-61) in which he introduced The Survey of English Usage.

Shortly thereafter Boston publisher Houghton-Mifflin approached Kucera to supply a million word, three-line citation base for its new American Heritage Dictionary, the first dictionary to be compiled using corpus linguistics. The AHD made the innovative step of combining prescriptive elements (how language should be used) with descriptive information (how it actually is used).

Other publishers followed suit. The British publisher Collins' COBUILD dictionaries, designed for users learning English as a foreign language, were compiled using the Bank of English.

The Brown Corpus has also spawned a number of similarly structured corpora: the LOB Corpus (1960s British English), Kolhapur (Indian English), Wellington (New Zealand English), ACE (Australian English), the Frown Corpus (early 1990s American English), and the FLOB Corpus (1990s British English). Other corpora represent many languages, varieties and modes, and include The British National Corpus, a 100 million word collection of a range of spoken and written texts, created in the 1990s by a consortium of publishers, universities (Oxford and Lancaster) and the British Library. There is a project underway to create an American National Corpus.

References

Journals

There are several international peer-reviewed journals dedicated to corpus linguistics, for example, Corpora, Corpus Linguistics and Linguistic Theory, ICAME Journal and the International Journal of Corpus Linguistics.

Book Series

Book series in this field include Language and Computers, Studies in Corpus Linguistics and English Corpus Linguistics

Other

Biber, Douglas, Susan Conrad, Randi Reppen Corpus Linguistics, Investigating Language Structure and Use, Cambridge: Cambridge UP, 1998. ISBN 0-521-49957-7

External links

Bookmarks for Corpus-based Linguists: very comprehensive site with categorized and annotated links to language corpora, software, references, etc.
Corpora discussion list
Manuel Barbera's overview site
Przemek Kaszubski's list of references
Corpus4u Community
McEnery and Wilson's Corpus Linguistics Page
Research and Development Unit for English Studies
The Centre for Corpus Linguistics at Birmingham University
Gateway to Corpus Linguistics on the Internet: an annotated guide to corpus resources on the web
Biomedical corpora
Linguistic Data Consortium, currently the premier distributor of corpora
Stefan Th. Gries's Corpus Linguistics with R list

Retrieved from "http://en.wikipedia.org/wiki/Corpus_linguistics"

Categories: Articles with unsourced statements since February 2007 | All articles with unsourced statements | Discourse analysis | Corpus linguistics