The edited volume Sequences in Language and Text is the first collection of original research in the area of the quantitative analysis of sequentially organized linguistic data.
Linguistic sequences are extremely useful textual structures in almost all areas of Language Technology. Character and word n-grams are by far the most successful features in text classification tasks such as authorship identification, text categorization, genre classification, sentiment analysis etc. Furthermore character linguistic sequences are the basis for linguistic modeling and subsequent applications such as speech recognition, language identification etc.
In addition to the above language technology oriented research, the present volume aims to give insight to the theoretical value of linguistic sequences. Sequences in texts can be produced by a number of different factors, either external to the linguistic system or by its own grammatical structure. This volume hosts contributions which will analyze linguistic sequences using quantitative methods under the synergetic theoretical framework that can explain their role in the linguistic system.