| Title | Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types | 
  
  | Authors | Yoshihiko Hayashi and Chiharu Narawa | 
  
  | Abstract | iIt is often argued that a set of standard linguistic processing functionalities should be identified,with each of them given a formal specification. We would benefit from the formal specifications; for example, the semi-automated composition of a complex language processing workflow could be enabled in due time. This paper extracts a standard set of linguistic processing functionalities and tries to classify them formally. To do this, we first investigated prominent types of language Web services/linguistic processors by surveying a Web-based language service infrastructure and published NLP toolkits. We next induced a set of standard linguistic processing functionalities by carefully investigating each of the linguistic processor types. The standard linguistic processing functionalities was then characterized by the input/output data types, as well as the required data operation types, which were also derived from the investigation. As a result, we came up with an ontological depiction that classifies linguistic processors and linguistic processing functionalities with respect to the fundamental data operation types. We argue that such an ontological depiction can explicitly describe the functional aspects of a linguistic processing functionality. | 
  
  | Topics | LR Infrastructures and Architectures, Web Services, Other | 
  
  | Full paper  | Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types | 
  
  | Bibtex | @InProceedings{HAYASHI12.863, author =  {Yoshihiko Hayashi and Chiharu Narawa},
 title =  {Classifying Standard Linguistic Processing Functionalities based on Fundamental Data Operation Types},
 booktitle =  {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
 year =  {2012},
 month =  {may},
 date =  {23-25},
 address =  {Istanbul, Turkey},
 editor =  {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
 publisher =  {European Language Resources Association (ELRA)},
 isbn =  {978-2-9517408-7-7},
 language =  {english}
 }
 |