untitled
<OAI-PMH schemaLocation=http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd> <responseDate>2018-01-15T15:42:55Z</responseDate> <request identifier=oai:HAL:hal-00328998v1 verb=GetRecord metadataPrefix=oai_dc>http://api.archives-ouvertes.fr/oai/hal/</request> <GetRecord> <record> <header> <identifier>oai:HAL:hal-00328998v1</identifier> <datestamp>2018-01-11</datestamp> <setSpec>type:COMM</setSpec> <setSpec>subject:info</setSpec> <setSpec>collection:CNRS</setSpec> <setSpec>collection:ENST</setSpec> <setSpec>collection:INSTITUT-TELECOM</setSpec> <setSpec>collection:TELECOM-PARISTECH</setSpec> <setSpec>collection:PARISTECH</setSpec> <setSpec>collection:UNIV-AG</setSpec> <setSpec>collection:BNRMI</setSpec> </header> <metadata><dc> <publisher>HAL CCSD</publisher> <title lang=en>Proper Names Extraction from Fax Images Combining Textual and Image Features</title> <creator>Likforman-Sulem, Laurence</creator> <creator>Vaillant, Pascal</creator> <creator>Yvon, François</creator> <contributor>Laboratoire Traitement et Communication de l'Information (LTCI) ; Télécom ParisTech - Institut Mines-Télécom [Paris] - Centre National de la Recherche Scientifique (CNRS)</contributor> <contributor>Groupe de Recherche en Informatique et Mathématiques Appliquées Antilles-Guyane (GRIMAAG) ; Université des Antilles et de la Guyane (UAG)</contributor> <description>5 pages, 2 figures, composed with Microsoft Word. Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR 2003). Edinburgh, Écosse, 3-6 août 2003. Également disponible à l'URL : http://www.cse.salford.ac.uk/prima/ICDAR2003/Papers/0100_563_likforman_l.pdf</description> <description>International audience</description> <source>Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR 2003)</source> <source>ICDAR 2003 : Seventh International Conference on Document Analysis and Recognition</source> <coverage>Edinburgh, United Kingdom</coverage> <contributor>Apostolos Antonacopoulos</contributor> <publisher>IEEE Computer Society</publisher> <identifier>hal-00328998</identifier> <identifier>https://hal.archives-ouvertes.fr/hal-00328998</identifier> <source>https://hal.archives-ouvertes.fr/hal-00328998</source> <source>Apostolos Antonacopoulos. ICDAR 2003 : Seventh International Conference on Document Analysis and Recognition, Aug 2003, Edinburgh, United Kingdom. IEEE Computer Society, 1, ISBN 0-7695-1960-1, p. 545-549, 2003, 〈10.1109/ICDAR.2003.1227724〉</source> <identifier>DOI : 10.1109/ICDAR.2003.1227724</identifier> <relation>info:eu-repo/semantics/altIdentifier/doi/10.1109/ICDAR.2003.1227724</relation> <language>en</language> <subject lang=en>document analysis</subject> <subject lang=en>pattern recognition</subject> <subject lang=en>named entity</subject> <subject lang=en>multimodality</subject> <subject lang=en>text image</subject> <subject>ACM H.3.1; I.5.4</subject> <subject>[INFO.INFO-TT] Computer Science [cs]/Document and Text Processing</subject> <type>info:eu-repo/semantics/conferenceObject</type> <type>Conference papers</type> <description lang=en>In the frame of a Unified Messaging System, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object of the message, or the name of the sender. However, in the case of facsimiles, this information is not as easy to detect as in the case of e-mails, since no standard headers are defined. The aim of the present work is to identify and extract a specific information (the name of the sender) from a fax cover page. For this purpose, methods based on image document analysis (OCR recognition, physical blocks selection), and text analysis methods (optimised dictionary lookup, local grammar rules), are implemented to work in parallel. The fusion of their results brings a more accurate guess than any of the methods would achieve separately.</description> <date>2003-08</date> </dc> </metadata> </record> </GetRecord> </OAI-PMH>