国际化本地化测试I18NandI10N.ppt
《国际化本地化测试I18NandI10N.ppt》由会员分享,可在线阅读,更多相关《国际化本地化测试I18NandI10N.ppt(84页珍藏版)》请在三一文库上搜索。
1、Internationalization&LocalizationCopyright 2000,SAS Institute Inc.All rights reserved.Jane Xue,GTD2,China R&D,APRD June 23,2005Objectives of the PresentationWhatThisPresentationISItprovidesdefinitionsofterminology.ItgivesbackgroundinformationaboutinternationalizationandlocalizationItidentifiesthehig
2、h-riskareasforNLStesting.ItgivesI18NandI10Nchecklist.WhatThisPresentationIsNOTItisnotanin-depthdiscussionofallpossibleaspects.Itdoesnotprovidesolutionstoveryspecificproblems.Questions about internationalization and localizationQ1:IonlyprovideEnglishsoftware,soIneednotdointernationalization?A1:No,eve
3、nifyoudidntprovidelocalizedsoftware,youstillneedtointernationalizeyoursoftwaretoguaranteeitrunsuccessfullyonNon-EnglishOS.Q2:Localizationissimple,youonlyneedtotranslatetheUItolocallanguage?A2:No,Translationisonlypartoflocalization.Q3:LocalizationtestingonlytotesttheUItranslationiscorrectornot?A3:No,
4、wedoallunitetestaswellassystemtesttoguaranteeallfunctionworkscorrectlyinDBCSOS.Part I:Basic conceptsInternationalization&LocalizationLocale&EncodingCharactersSets&FontTranscoding&TranslateTableBackgroundofUnicodeNLS TerminologyInternationalization(I18N):Internationalizationabbreviatedasi18n,becauset
5、hereare18lettersbetweentheiandthen.I18Nistheprocessofdesigninganapplicationsothatthefeaturedesignandcodedesigndonotmakeassumptionsbasedonasinglelanguageorlocale.Internationalizationalsoassumesthatthesourcecodebasesimplifiesthecreationofdifferentlanguageeditionsofaprogram.Onegoalofinternationalizatio
6、nistoensurethatinternationalconventions(includingrulesforsortingstringsandforformattingdates,times,numbers,andcurrencies)aresupported.Anothergoalistodesigntheproductinsuchawaythatuserswillhaveaconsistentlook,feel,andfunctionalityacrossdifferentlanguageeditionsofaproduct.NLS Terminology(Cont)Localiza
7、tion(L10n):Localizationabbreviatedasl10n,becausethereare10lettersbetweenthelandthen.Localizationistheprocessofadaptingsoftwareforaparticulargeographicalregion(orlocale).Translationoftheuserinterface,systemmessagesanddocumentationisalargepartlocalization,butnotallofit.NLS Terminology(Cont)Locale:Loca
8、leismorethanlanguage.Alocalereflectsthelocalconventions,language,andcultureforaparticulargeographicalregion.Forexample,PortugueseisspokeninBrazilaswellasinPortugal,buttheculturesaredifferent.Therefore,theyareconsideredtwodifferentlocales.Additionally,acountrymayhavemorethanoneofficiallanguage.Forexa
9、mple,Canadahastwoofficiallanguages:EnglishandFrench.Thus,Canadahastwodistinctlocales:Canadian-EnglishandCanadian-French.Somelocale-dependentcategoriesincludetheformattingofdatesandthedisplayformatformonetaryvalues.NLS Terminology(Cont)CharacterSet:Therepertoireofcharactersandsymbolsthatareusedbyalan
10、guageorgroupoflanguages.EverycharactersetincludesNationalcharacters(acharacterspecifictoaparticularnationorgroupofnations)PunctuationmarksEnglishalphabetNumbersControlcharacters(neededbythecomputer)NLS Terminology(Cont)Encoding:Anencodingmapseachcharacterinacharactersettoauniquenumericrepresentation
11、Aparticularcharactercanhavedifferentnumericrepresentationsindifferentencodings.Forexampletheisrepresentedas91intheDanishEBCDICencodingandas197intheWindowsLatin1encoding.NLS Terminology(Cont)Single-bytecharacterset(SBCS):Acharacterencodingwhereeachcharacterisrepresentedbyonebyte.Single-bytecharacter
12、setsarelimitedto256characters.GroupsoflanguagesarerepresentedinaSBCScharacterset.ExamplesofanSBCScharactersetaretheASCIILatin1andLatin2charactersets,whichrepresentthecharactersofWesternandCentralEurope,respectively.Double-bytecharacterset(DBCS):AtSAS,werefertotheAsiancharactersets(Japanese,Korean,Si
13、mplifiedChinese,andTraditionalChinese)asdouble-bytecharactersets(DBCS).DBCSisamisnomersinceindividualcharacterscantakeoneortwobytestorepresent.Amoreappropriatetermismulti-bytecharacterset(MBCS),whichisusedasasynonymforDBCS.IttakesaspecialbuildoftheSASSystemtohandleDBCSuserdata.NLS Terminology(Cont)U
14、nicode:Amultilingualcharactersetwhichcanrepresentcharactersfrommorethanonelanguagegroup.JavalanguageuseUnicodeencodings.TheMVAsystemuseseitheranSBCSencodingoraDBCSencoding.UCS2:Afixed-length,2-byteformofUnicode.UTF16:Aformof2-byteUnicodethatmakesallowancesforsomecharacterstobe4byteswide.A4-bytechara
15、cterinUTF16iscalledasurrogatepair.UTF8:Avarying-lengthformofUnicodewhereU.S.charactersareallrepresentedin1byte.Othercharacterswilltakefrom2to4bytestorepresent.UCS4:Afixed-length,4-byteformofUnicode.NLS Terminology(Cont)Transcoding:Theprocessofmovingdatafromoneencodingtoanother.Forexample,theLatin1en
16、codingforthe$is24hex.TheDanishEBCDICencodingfor$is67hex.IfdataisproperlytranscodedfromASCIILatin1toDanishEBCDIC(CP1142),thevalue24isconvertedtoa67.TranscodingcantakeastringfromASCIItoEBCDICorfromoneDBCSencodingtoanother.Trantab(translatetable):Atableusedtotranscodedatafromoneencodingtoanother.SASlan
17、guageelementsthatcontrollocalevaluesandencodingpropertiesautomaticallyinvoketheappropriatetranslationtable.Translationtablesarespecifictotheoperatingenvironment.NLS Terminology(Cont)Font:Fontdefinesthegraphicalrepresentationofacharacter.Thereisa1:nrelationshipbetweenacharacteranditsrepresentationind
18、ifferentfonts.Fontisasetofprintableordisplayabletextcharactersinaspecificstyleandsize.Thetypedesignforasetoffontsisthetypefaceandvariationsofthisdesignformthetypefacefamily.Inpractice,fontandtypeface(printfont)areoftenusedwithoutmuchprecision,sometimesinterchangeably.Quote“Insteadofwastingyourtimeto
19、teachFrenchtoyourcomputers,youdbetterteachEnglishtoallyourpersonnel.Amanagerofauniversitycomputingcentre(inthestoneageofcomputing).I18n goalsOnegoalofinternationalizationistoensurethatinternationalconventions(includingrulesforsortingstringsandforformattingdates,times,numbers,andcurrencies)aresupport
20、ed.Anothergoalistodesigntheproductinsuchawaythatuserswillhaveaconsistentlook,feel,andfunctionalityacrossdifferentlanguageeditionsofaproduct.I18n challengesSoftwareneedstofunctionproperlyineveryglobalmarket.Applicationsneedtoconformlocalconventions.Dataneedtobeprocessedsuccessfullyinnativelanguagesan
21、denvironments.Themeaningofcharactersmustberetained,acrossenvironments,andregardlessoftheencodingused.Languages and EncodingsDifferentlanguagesusedifferentcharactersinwriting.Veryfewuseonlythe26charactersAthroughZoftheLatinalphabet.Acharacterisrepresentedinacomputerbyassociatingitwithanumber,orbinary
22、code.Thesenumbersareuniquewithinaset.Thewholesetofsuchmappingsformsa“characterset“or“encoding”.Languages and Encodings(Cont)Hence,allcharacter-baseddatathatisstored,transmittedorprocessedisinanencoding.Manyencodingshavebeendevelopedtoaddresstherequirementsofdifferentlanguagesandcomputingenvironments
23、Problems with data transferAlthoughitiseasytotransferdatawithinanenvironmentusingthesameencoding(characterset)Itremainsdifficulttotransferanddisplaydatawithinenvironmentsusingdifferentencodings(charactersets).Problems with data transfer(Cont)Encodingsystemsconflictwithoneanother.Thatis,twoencodings
24、canusedifferentpositionsforthesamecharacter,orusethesamecodepositionfortwodifferentcharacters.Yetwheneverdataispassedbetweendifferentencodingsorplatforms,thatdatarunstheriskofcorruption.Quote“Inhindsight,whatwehavedoneisinventedacomputerCommunicationsTowerofBabel”.EdwinHart,ASCIIandEBCDICCharacterSe
25、tandCodeIssuesinSystemsApplicationArchitecture,1989.UnicodeUnicodeischangingallthat!Unicodebasicallyprovidesauniquepositionforeverycharacter,nomatterwhattheplatform,nomatterwhattheprogram,nomatterwhatthelanguage.UnicodeUnicodeisa16-bitencodingthatcanrepresentmorethan65,000characters.Itsolvestwomajor
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 国际化 本地化 测试 I18NandI10N
