首页 > Quantitative Evaluation of Machine Translation Systems Sentence Level 1 Universidade de Lis

Quantitative Evaluation of Machine Translation Systems Sentence Level 1 Universidade de Lis

发布时间：来源：文档文库

小中大

字号：

手机查看

QuantitativeEvaluationofMachineTranslationSystems:SentenceLevel
PalmiraMarrafa1andAntónioRibeiro2
UniversidadedeLisboaFaculdadedeLetras
GroupofLexicalandGrammaticalKnowledge
Computation(CLULAvenida5deOutubro,85–5ºP–1050–050Lisboa,PortugalPalmira.Marrafa@netcabo.pt
Abstract
Thispaperreportsthefirstresultsofanon-goingresearchonevaluationofMachineTranslationquality.ThestartingpointforthisworkwastheframeworkofISLE(theInternationalStandardsforLanguageEngineering,whichprovidesaclassificationforevaluationofMachineTranslation.Inordertomakeaquantitativeevaluationoftranslationquality,wepursueamoreconsistent,fine-grainedandcomprehensiveclassificationofpossibletranslationerrorsandweproposemetricsforsentencelevelerrors,specificallylexicalandsyntacticerrors.
MachineTranslationevaluation,translationqualitymetrics

1
UniversidadeNovadeLisboaFaculdadedeCiênciaseTecnologiaDepartamentodeInformática
QuintadaTorreMontedaCaparica
P–2829–516Caparica,Portugal
ambar@di.fct.unl.pt
2
Keywords
Introduction
MuchworkhasbeendoneonevaluationofMachineTranslationinthelasttenyears(see,forexample,Balkan,1991;Arnoldetal.,1993;Vasconcellos,1994;Whiteetal.,1994;EAGLES,1996;WhiteandO’Connell,1996;White,forthcoming.AcommongoalhasbeenthedesignofevaluationtechniquesinordertoreachamoreobjectiveevaluationofMachineTranslationqualitysystems.
However,theevaluationofMachineTranslationhasbeensubjectivetoagreatextent.ISLE(theInternationalStandardsforLanguageEngineeringaimsatreducingsubjectivityinthisdomain.ItprovidesaclassificationofinternalandexternalcharacteristicsofMachineTranslationsystemstobeevaluatedinconformitywiththeISO/IEC9126standard(ISO1991,whichconcernsqualitycharacteristicsofsoftwareproducts.Itassumestheneedofaquantitativeevaluationleadingtodefinitionofmetrics.
However,thatclassificationisnotfine-grainedenoughtoevaluatethequalityofmachinetranslatedtextsregardingthepossibletypesoftranslationerrors.Thus,inthiswork,weproposeamoreconsistent,fine-grainedandcomprehensiveclassificationattheindividualsentencelevel.Ourclassificationtakesintoaccounttheinternalstructureoflexicalunitsandsyntacticconstituents.Moreover,weproposemetricstomakeanobjectivequantitativeevaluation.Thesemetricsarebasedonthenumberoferrorsfoundandthetotalnumberofpossibleerrors.Thestructuralcomplexityofthepossibleerrorsisalsoconsideredinthemetrics.
WeselectedsomepertinentcharacteristicsfromtheISLEclassificationtomeasurethequalityofsentenceleveltranslations,concerninglexicalandsyntacticerrors,includingcollocations,fixedandsemi-fixedexpressionsforlexicalevaluation.Asforsyntacticerrors,webuiltatypologyoferrors.
OurmethodologywasmotivatedbyEnglish,FrenchandPortugueseparalleltextsfromtheEuropeanParliamentsessionsandalsobytranslationsobtainedfromtwocommercialMachineTranslationsystems.
Inthenextsection,wepresentamotivationfortherefinementofthetaxonomywithsomeexamples.Afterthat,wesummarisetheclassificationanddefinethemetricsusedfortheevaluation.Inthefollowingsection,wediscusssomepreviouswork.Finally,wepresenttheconclusionsandthefuturework.
Motivation
ISO(theInternationalOrganisationforStandardisationandIEC(theInternationalElectrotechnicalCommissionaretheinstitutionswhichdevelopinternationalstandards.Asforevaluation,animportantstandardistheISO/IEC9126(ISO1991.Thisstandarddistinguishesbetweeninternalcharacteristicswhichpertaintotheinternalworkingsandstructureofthesoftwareandexternalcharacteristicswhicharethecharacteristicswhichcanbeobservedwhenthesystemisinoperation.
TheISLEClassificationFrameworkforEvaluationofMachineTranslation1providesaclassificationoftheinternalandtheexternalcharacteristicsofMachineTranslationsystemstobeevaluatedinconformitywiththeISO/IEC9126standard.
AimingtoanalyseMachineTranslationsystemsfromauser’spointofview,wefocussedontheexternalcharacteristics.WetooktheISLEclassificationasastartingpointforthisevaluation.
IdeallyanevaluationofaMachineTranslationsystemqualityshouldcoverallthedifferentparametersliabletobeconsideredinatranslation.However,thisisatoocomplextasktobedoneinthisearlystageofourwork.Thus,wedecidedtofocusonthesentencelevel.
1http://issco-www.unige.ch/staff/andrei/islemteval2/
mainclassification.html

Theevaluationofthisleveldealswithfunctionality,inparticularaccuracy,accordingtotheISLEclassification:

《Quantitative Evaluation of Machine Translation Systems Sentence Level 1 Universidade de Lis.doc》

将本文的Word文档下载到电脑，方便收藏和打印

推荐度：

点击下载文档

文档为doc格式

相

关

案

例

Quantitative Evaluation of Machine Translation Systems Sentence Level 1 Universidade de Lis

相关推荐

推荐内容