<HTML>
<HEAD>
<TITLE>LREC 2000 - Paper 82 summary</title>
<SCRIPT LANGUAGE="JavaScript" TYPE="text/javascript">
<!--
// preload images:
 if(document.images)
  {
  hom_d= new Image(100,20);   hom_d.src="../eikones/hom_d.gif";
  pap_g=new Image(100,20);    pap_g.src="../eikones/pap_g.gif";  
  pap_d=new Image(100,20);    pap_d.src="../eikones/pap_d.gif";  
  pap_l=new Image(100,20);    pap_l.src="../eikones/pap_l.gif";  
  hom_l=new Image(100,20);    hom_l.src="../eikones/hom_l.gif";
  aut_d=new Image(100,20);     aut_d.src="../eikones/aut_d.gif";
  aut_l=new Image(100,20);    aut_l.src="../eikones/aut_l.gif";
  Key_d=new Image(100,20);    Key_d.src="../eikones/Key_d.gif";
  Key_l=new Image(100,20);   Key_l.src="../eikones/Key_l.gif";
  ses_d=new Image(100,20);    ses_d.src="../eikones/ses_d.gif";
  ses_l=new Image(100,20);   ses_l.src="../eikones/ses_l.gif";
  abs_l=new Image(100,20);   abs_l.src="../eikones/abs_l.gif";
  abs_d=new Image(100,20);    abs_d.src="../eikones/abs_d.gif";
  aut_l=new Image(100,20);    aut_l.src="../eikones/aut_l.gif";
}

function changimg(imgName,imgObjName)
 {
  if (document.images)
   {
   document.images[imgName].src=eval(imgObjName+".src");
   }
 }
//-->
</SCRIPT>

</HEAD>
<BODY marginwidth="0" marginheight="0" leftmargin="0" topmargin="0" rightmargin="0"  background="../eikones/fonto.jpg">
<TABLE align="center" border="0" width="100%" cellspacing="0" cellpadding="0" >
<TR>
<TD height="50" valign="center" colspan="7" bgcolor="#003163"><font face="Arial" size="4" color="#ffffff"><b>LREC 2000</b> 2<sup>nd</sup>
      International Conference on Language Resources &amp; Evaluation</font></TD>
</TR>
 <tr bgcolor="#003162">
 <td width="100" valign="center"><A href="../../default.htm" onmouseout="changimg('home','hom_d')" onmouseover="changimg('home','hom_l')"><IMG border="0" height="20" name="home" src="../eikones/hom_d.gif" width="100"></A></td>
 <TD width="100"><A href="../session.htm" onmouseout="changimg('sessions','ses_d')" onmouseover="changimg('sessions','ses_l')"><IMG border="0" height="20" name="sessions" src="../eikones/ses_d.gif" width="100"></A></TD>
 <TD width="100"><A href="../paper.htm" onmouseout="changimg('papers','pap_d')" onmouseover="changimg('papers','pap_l')"><IMG border="0" height="20" name="papers" src="../eikones/pap_d.gif" width="100"></a></TD>
 <TD width="100"><A href="../abstract.htm" onmouseout="changimg('abstracts','abs_d')" onmouseover="changimg('abstracts','abs_l')"><IMG border="0" height="20"  name="abstracts" src="../eikones/abs_d.gif" width="100"></A></TD>
 <TD width="100"><A href="../author.htm" onmouseout="changimg('authors','aut_d')" onmouseover="changimg('authors','aut_l')"><IMG border="0" height="20"  name="authors" src="../eikones/aut_d.gif" width="100"></a></TD>
 <TD width="100"><A href="../keyword.htm" onmouseout="changimg('keywords','Key_d')" onmouseover="changimg('keywords','Key_l')"><IMG border="0" height="20" name="keywords" src="../eikones/Key_d.gif" width="100"></A></TD>
<td width="1000">&nbsp;</td>
 </tr>
 </TABLE>
<BLOCKQUOTE style="MARGIN-RIGHT: 0px">
  <P><A href="81.htm">Previous Paper</A>&nbsp;&nbsp; <A href="84.htm">Next Paper</A></P></BLOCKQUOTE>
  <center>
<TABLE width="95%" Align="center" Border="1" bordercolor="#669999" cellspacing="1">
    <tr>
      <td width="15%" height="40"><b>Title</b></font></td>
      <td width="85%" height="40"><font color="#990033" size="4">Shallow Parsing and Functional Structure in Italian Corpora</font></td>
    </tr>
    <tr>
      <td height="40"><b>Authors</b></td>
      <td height="40"><font color="#006600">Delmonte Rodolfo</font> (Ca' Garzoni-Moro, San Marco 3417, Università ''Ca Foscari'', 30124 - VENEZIA, Tel. 39-41-2578464/52/19, E-mail: delmont@unive.it, Website: http//byron.cgm.unive.it)</td>
    </tr>
    <tr>
      <td height="40"><b>Keywords</b></td>
      <td height="40">&nbsp;</td>
    </tr>
      <tr>
      <td height="40"><b>Session</b></td>
      <td height="40">Session WO2 - Treebanks</td>
    </tr>
     <tr>
      <td height="40"><b>Full Paper</b></td>
            <td height="40"><a href="../../ps/82.ps" target="newps" type="application/postscript">82.ps</a>, <a href="../../pdf/82.pdf" target="newpdf" type="application/pdf">82.pdf</a></td>
    </tr>
      <tr>
      <td height="40"><b>Abstract</b></td>
             <td height="40">In this paper we argue in favour of an integration between  statistically and syntactically based parsing by presenting data from  a study of a 500,000 word corpus of Italian. Most papers present  approaches on tagging which are statistically based. None of the  statistically based analyses, however, produce an accuracy level  comparable to the one obtained by means of linguistic rules [1]. Of  course their data are strictly referred to English, with the  exception of [2, 3, 4]. As to Italian, we argue that purely  statistically based approaches are inefficient basically due to great  sparsity of tag distribution - 50% or less of unambiguous tags when  punctuation is subtracted from the total count. In addition, the  level of homography is also very high: readings per word are 1.7  compared to 1.07 computed for English by [2] with a similar tagset. The current work includes a syntactic shallow parser and a ATN-like  grammatical function assigner that automatically classifies  previously manually verified tagged corpora. In a preliminary  experiment we made with automatic tagger, we obtained 99,97% accuracy  in the training set and 99,03% in the test set using combined  approaches: data derived from statistical tagging is well below 95%  even when referred to the training set, and the same applies to  syntactic tagging. As to the shallow parser and GF-assigner we shall report on a first preliminary experiment on a manually verified  subset made of 10,000 words.</td>
    </tr>
  </table><br>
  </center>
</BODY>
</html>