语料库研究方法概述课件

上传人:202****8-1 文档编号:241551844 上传时间:2024-07-03 格式:PPT 页数:47 大小:336.79KB
返回 下载 相关 举报
语料库研究方法概述课件_第1页
第1页 / 共47页
语料库研究方法概述课件_第2页
第2页 / 共47页
语料库研究方法概述课件_第3页
第3页 / 共47页
点击查看更多>>
资源描述
2012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述选题、设计与方法Put it altogether李文中李文中中国外语教育研究中心中国外语教育研究中心2012选题、设计与方法Put it altogether李文中12012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述语料库不是人学的,正则表达式不是女人学的。语料库不是人学的,22012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述 Corpus-driven is basically corpus based.32012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Any corpus-based research is necessarily driven by corpus data.Any corpus-based research is n42012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述目标:通过语料库分析和研究:验证假设、直觉获得新发现建立新的假设构建新的理论验证已有的发现解决难题目标:通过语料库分析和研究:52012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述创新:新创新:新62012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述基于语料库方法是一种验证程序语料库驱动方法是一种发现程序语料库研究方法概述课件72012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述理据:任何感知都是推断Any perception is but inferencing.理据:任何感知都是推断82012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述world ofrealityworld of textEinstein GulfUnbridgeableworld ofworld of Einstein Gulf92012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述眼耳鼻舌身意色声香味触法学问思辨行文本眼 耳 鼻 舌 身 意色 声 香 味 触 102012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述1.基本步骤:2.确定题目3.提出问题4.确定总体和样本5.选择工具6.处理数据7.描述结果:分类、总结特征(description)8.解释结果:观察、描述、解释(explanation)9.解读结果:意义、价值、应用(interpretation)基本步骤:11Identifying a problemSome thing or phenomenon:out of expectationIncongruentNeed a solutionpuzzlingIdentifying a problemSome thin12Reading to be better informedWhat has been done as contributionWhat has been left undoneWhat has been done wrongNever count someone elses money.Reading to be better informedW13Formulating research questionsNaming:what isClassificatory:How are they interrelated(patterned)?Explanatory:to what extent do they co-occur?Predictive:What will happen if?Never ask a question to which you already know the answer;never ask how to questionFormulating research questions14Finding a methodPopulation SampleSamplingFinding a methodPopulation 15P(population)S(Sample)R(Result)I(Interpretation)SamplingvalidityreliabilityValidity Generalizability PSRISamplingreliabilityValidit16IFP SS RR ITHENI PIF172012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Descriptive researchsingle texttext vs.textpeople vs.textDescriptive research182012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research questions1.How many different word forms are used in the text?How many running words are used?What is their distribution?2.To what extent can the level of difficulty of the text be computed on the basis of the graded wordlists?3.How many different word classes are used?What is the number of each word class?Research questions192012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述MethodTo answer RQ 1,generate a wordlist of the given text and observe:The number of typesThe number of tokensthe type/token ratio(TTR)If the text is very large,standardize the TTR the types and their frequency cumulative percentageMethod202012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述To answer RQ 2,compute the wordlist against a batch of graded wordlists,and observe:How many types on Level 1,2,and 3 lists are used in the text?And what is their percentage?What about their tokens?How many types that are not on any list are used in the text?Summarize their features.To answer RQ 2,compute the wo212012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述To answer RQ 3,retrieve each word class from the POS tagged text,and sort them on frequency in decreasing orderRetrieve all the nouns,verbs,and adjectivesSort the listTo answer RQ 3,retrieve each 222012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Instruments Use Antconc 3.0 to generate the wordlist;Use Range to compare and contrast the wordlist against a batch of graded wordlists;Use PowerGrep to retrieve the word class from the POS tagged text;Instruments 232012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Explanatory researchinterrelationship between wordsIR between phraseologiesIR between genresExplanatory research242012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research on relationship:shapedirectionstrengthResearch on relationship:252012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research questionsWhat are the words that are unique to the text in terms of its subject matter?To what extent are these words related to the subject/topic of the text?What patterns of relationships exist among the key words?Research questions262012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述MethodCompare&contrast the wordlist(of the observed text or corpus)against the wordlist of the reference text or corpus(larger);Observe and group the words within a classification framework;Method272012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述InstrumentsAntconc 3.0Other applicationsLiterary analysisAutomatic summarizationInstruments282012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research on word usesObjectives:Observe the collocates of a word;Study its patterns of uses;Study its meanings associated with its patterns of uses;Study the semantic prosody of its meaningResearch on word uses292012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research questions1.What words collocate with the Search Word?What is the strength of the collocability?2.What is the pattern of the SW?And what is its semantic preference?3.What is the semantic prosody of the pattern?Research questions302012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述MethodSearch the word(KW,SW,or Node Word)as KWIC;Observe its collocates and their word classes;Observe the meaning that is associated with the pattern;Observe its semantic prosody;Method312012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述InstrumentsAntconc 3.0ConcordanceSort:Level 1,Level 2,Level 3Frequency countCollocatesSort Sort POS tagsInstruments322012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research on chunksObjectives:To retrieve the multiword sequences;To examine the internal structure of such sequences;To obtain the sequences unique to a specific text;Research on chunks332012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research questionsWhat multiword sequences(in terms of n-gram)are found in the given text?How are these sequences structured in terms of lexical grammatical pattern?How is the message conveyed associated with the overall structure of the sequences?Research questions342012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述MethodSegment the text and generate a batch of lists of multiword sequences(of various lengths);Observe the structure of the retrieved n-grams and examine their regularities;Study the semantic and pragmatic features;Method352012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述InstrumentsKfngramWordsmith Tools v3.0PowerGrep 3.5Instruments362012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research on parallel textsObjectives:To observe how the source text was translated into the target text;To observe the probability of the translation units and corresponding units found in the text;To study the dynamics of the translation of a given community;Research on parallel texts372012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research questionsWhat translation units can be found in the parallel texts?What corresponding units can be found in the texts?What are the distributional features of the corresponding units of the identical genre;Research questions382012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述MethodObserve the parallel texts(or construct the corpora);Retrieve the units focused;Examine the context of such units;Observe their patterns;Method392012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述InstrumentsParallel textsParallel concordancerCorresponding units databaseInstruments402012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Contrastive StudiesObjectives:to do genre analysisto determine the difficulty level of a textto observe how a text is biased against its readers in terms of event presentation and evaluationto compare a batch of wordlists with the graded wordlistsContrastive Studies412012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Research questions:What genre features can be identified for a particular genre text,in terms of lexis,patterns,and textual organization?How difficult is a text?Is it a proper input text for the students at their present competence?Research questions:422012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述What contrastive features are displayed in the point of view,tone,judgement of the text?How do these features affect the readers?How to determine the vocabulary size or lexical density of a text(a batch of texts)?What contrastive features are 432012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Instruments:Keyword analysis;concordance analysisWordlist comparison&contrastCritical discourse analysisInstruments:442012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Other corpus-driven,corpus-based,or corpus-informed researchUse of corpora in language teachingContrastive Interlanguage AnalysisMultiword sequencesParallel textsGenre®ister analysisDiscourse analysisLiterary analysisCognitive linguistic studiesOther corpus-driven,corpus-ba452012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述Where the corpus is strong:Synchronic and diachronicSyntagmatic and paradigmaticQualitative and quantitativeEmpirical and intuitiveProbability&CentralityWhere the corpus is weak:Mental processesPsychological realityPossibility absenceWhere the corpus is strong:462012 语料库与外语研究研修班语料库研究方法概述语料库研究方法概述语料库研究方法概述课件47
展开阅读全文
相关资源
相关搜索

最新文档


当前位置:首页 > 办公文档 > 教学培训


copyright@ 2023-2025  zhuangpeitu.com 装配图网版权所有   联系电话:18123376007

备案号:ICP2024067431-1 川公网安备51140202000466号


本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。装配图网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知装配图网,我们立即给予删除!