ISO INTERNATIONAL STANDARD 24611 First edition 2012-11-01 Language resource management - Morpho-syntactic annotation framework (MAF) Gestion des ressources langagieres - Cadre d'annotation morphosyntaxique (MAF) Reference number ISO 24611:2012(E) @ISO 2012 by IHS under lic Not for Resale ISO 24611:2012(E) COPYRIGHTPROTECTEDDOCUMENT @ISO2012 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either isO at the address below or IsO's memberbody in the country of the requester. ISO copyright office Case postale 56. CH-1211 Geneva 20 Tel. + 4122749 01 11 Fax + 41 22 749 09 47 E-mail
[email protected] Web www.iso.org Published in Switzerland @ ISO 2012 - All rights reserved py IHS unde permitted without license from IHS Not for Resale ISO 24611:2012(E) Contents Page Foreword Introduction. 1 Scope 2 Normative references.. 3 Terms and definitions. 4 The MAF meta-model. 4.1 Overview... 4.2 MAF Meta-model 5 Segmenting with tokens .. 5.1 .6 5.2 Formaldescription:<token> 5.3 Embedding notation.... 5.4 Alternate representation for TEl based documents .8 5.5 9 5.6 Informative attributes.. 5.7 Completing the inline token notation ... 10 5.7.1 5.7.2 Overlapping tokens .. 11 6 Word-forms as linguistic units... 11 6.1 Formal description: <wordForm> 12 6.2 Token attachment....... 12 6.2.1 One token; one word-form ... 6.2.2 Several contiguous tokens; one word-form ... 12 6.2.3 Several discontinuous tokens; one word-form.... 13 6.2.4 Zero token,; one word.form.... 13 6.2.5 One token; several word-forms ... 14 6.3 Referring to lexical entries ... 14 6.4 Compoundword-forms. 15 6.5 Identification of word.forms within a TEl.comp.liant document...... 7 Morpho-syntactic content...... 7.1 General.. ..18 7.2 Using feature structures. 18 7.3 Compact morpho-syntactic tags 18 7.4 FSRlibraries.. 7.5 Designing tagsets.. 20 7.6 Formal description: <tagset> 22 8 Handling ambiguities .. 22 8.1 Word-form content ambiguities. 8.2 LexicalAmbiguities... 23 8.3 Structural ambiguities... 23 8.3.1 Structural ambiguities with word-forms 23 8.3.2 Structural ambiguities with tokens..... 8.4 Simplified structuring variants .. .24 8.4.1 Non-ambiguous linear representation, 24 8.4.2 Mixed linear and lattice representation. .25 8.5 Expandingthesimplifiedvariants. 26 8.5.1 Separating tokens and word-forms. 26 8.5.2 Wrapping into local lattices... 26 Copyrght International OrganizaionStandardizalionghts reserved ili ted without license from IHS Not for Resale
ISO 24611 2012 Language resource management — Morpho-syntactic annotation framewo
文档预览
中文文档
68 页
50 下载
1000 浏览
0 评论
309 收藏
3.0分
温馨提示:本文档共68页,可预览 3 页,如浏览全部内容或当前文档出现乱码,可开通会员下载原始文档
本文档由 人生无常 于 2024-08-31 13:18:52上传分享