资源描述
按一下以編輯母片標題樣式,按一下以編輯母片,第二層,第三層,第四層,第五層,*,NCKU BINFO 2004,生物資訊,及,及網路資,源,源簡介,蔡少正,國立成功,大,大學醫學,院,院生理所,國立成功,大,大學生物,資,資訊中心,定義:利,用,用電腦輔,助,助資料管,理,理系統從,基,基因體(Genomic),、,、蛋白質,體,體(proteomics)或藥物,篩,篩選資料,庫,庫中蒐集,、,、整合、,及,及分析大,量,量的生物,序,序列或資,訊,訊。,基因資料,庫,庫(Genbank)序列,資,資訊快速,增,增加,生物資訊,學,學的發展,#bp in billions,1,2,3,1982-2000,生物資訊,學,學(Bioinformatics)與,人類基因,組,組解讀計,劃,劃,(Human Genome Project),序列資料,貯,貯存,-(6*10,6,序列檔案/onegenome)*6*200KB/序,列,列檔案,序列資訊,整,整理,-BLAST,FASTA.,序列資訊,分,分析,-Assembly,repetitive sequence.,序列功能,註,註解,-Genomeannotation,molecular modeling,生物資訊,學,學的發展,涵蓋分子,生,生物、資,訊,訊工程、,統,統計等學,門,門的新科,學,學,計算生物,學,學(,computational biology):處,理,理生物(,序,序列)資,料,料的科學,蘊涵無限,寶,寶藏的科,學,學-,序,序列取得,、,、結構分,析,析、功能,預,預測、模,擬,擬驗證。,生物資訊,學,學的發展,後基因體,分,分析紀元,(Post-genomeEra),功能性基,因,因體,(FunctionalGenomics),世代來臨,當基因被,完,完全解讀,出,出來後下,一,一階段的,目,目標:組,合,合一個沒,有,有“間隙,”,”(Gap)的,染,染色體圖,譜,譜,並且,提,提高序列,正,正確率至99.99%,-基,因,因註解,-基因預測,-蛋,白,白質功能,研,研究(proteomics),-單,核,核甘酸多,樣,樣性,(singlenucleotidepolymorphisms orSNPs),功能性基,因,因體(Functional Genomics),辨認所有,的,的基因並,瞭,瞭解他們,的,的功能,:,Different technologies and resourcesinto aproteomic process,LITERATURE(,文,文獻)DATABASES,NUCLEOTIDE(,核,核酸)DATABASES,PROTEIN(,蛋,蛋白質,),),DATABASES,ENZYME,(,(酵,素,素)DATABASES,OTHERTYPE(,其,其他)DATABASES,資料庫,的,的類型,:,:依性,質,質區分,資料庫,的,的類型,:,:依功,能,能區分,原始資,料,料庫,Genebank,EST,database,加值型,資,資料庫,初步或,未,未加整,理,理的序,列,列資訊,比對、,計,計算、,整,整理,只擷取,部,部分有,用,用的,資訊加,以,以歸類,儲,儲存,重複序,列,列資料,庫,庫、,訊息傳,遞,遞因子,資,資料庫,(Smartdatabase),實用價,值,值較低,具高附,加,加價值,SMART-SimpleModular ArchitectureResearchTool,NCBI,網路資,源,源 vs.,套,套裝軟,體,體,網路資,源,源的優,缺,缺點,優點:,-資,料,料庫更,新,新頻繁,-介,面,面較易,學,學習,,操,操作容,易,易,缺點:,-通,常,常功能,專,專一化,-資,料,料格式,轉,轉換不,易,易,-參,數,數預設,值,值無法,調,調整,-解,讀,讀輸出,結,結果困,難,難,網路資,源,源 vs.,套,套裝軟,體,體,套裝軟,體,體的優,缺,缺點,優點:,-容,易,易調整,參,參數預,設,設值,-分,析,析功能,完,完整且,具,具多樣,化,化,-資,料,料格式,統,統一,,不,不同程,式,式間不,需,需轉換,-可,提,提供圖,形,形檔輸,出,出結果,缺點:,-資,料,料庫更,新,新較不,頻,頻繁,-指,令,令繁多,不,不易學,習,習,網際網,路,路上的,生,生物資,源,源,SearchEngine,網際網,路,路上的,生,生物資,源,源,Databases,Literature,-PubMed,Hint,SeqAnalRef,SRS,Sequence,-DNA:,GenBank/EMBL/DDBJ,UniGene,GDB,-,Protein,:PDB,PIR,PROSITE,SWISS-PORT,Structure,-,BioMagResBank,SCOP,MMDB,網際網,路,路上的,生,生物資,源,源,Software(DNA annotation),Nucleic AcidConformation,-DNA,RNA secondarystructure,Translation,-start andstop coden,codenusagetable,ORF Finder,-promoter,5and 3UTR,intron and exon,GeneFunction Prediction,-motif and patternsearch,網際網路上,的,的生物資源,Software(Protein annotation),Identificationand characterization,DNA-Protein,Similaritysearches,Pattern and profilesearches,Post-translational modification prediction,Primary structure analysis,Secondarystructureprediction,Tertiary structure,Transmembrane regions detection,Alignment,網際網路上,的,的生物資源,Courses,陽明大學生,化,化所,清華大學生,科,科系,中研院生圖,國家衛生研,究,究院生物資,訊,訊課程,國立成功大,學,學醫學院,BioinformaticsPackage Tools,GCG,-TheWisconsinPackage AccelrysBiocomputational Research,GenoMax v3.3,InforMax,LSI,TM,Lion Bioscience,BioinformaticsPackage Tools,LaserGene,-DNAStar,Inc.,MacVector,-Oxford Molecular Group,Inc.,DiscoveryStudio Gene,-Oxford Molecular Group,Inc.,Vector NTI,-Informax,Inc.,Literature,Bibliographyand Reference Databases,Medline,MIM(Mendelian Inheritancein Man),Taxonomy,Geneticode,MIM(Mendelian Inheritancein Man),Browsing OMIM,Nucleotidesequencedatabases,EMBL/Genebank/DDBJ,Containingevery individuallysubmittedprimary sequence,REFSEQ,Provide non-redundant curateddatarepresenting knowledge of Known genes,ENSEMBL,annotatedgenomic contigsequence,The International Nucleotide Sequence Database Collaboration(INSD),The INSD consists of DDBJ(Japan),GenBank(USA)and the EMBL(UK)Nucleotide Sequence Database.The three databases exhange new and updated data on a daily basis to achieve optimal synchronisation,Genebank,Protein sequence databases,Swiss-Prot,a curatedprotein sequence databasewhich strivesto providea high level of annotation.,SpTrEMBL,a databasesupplementingtheSwiss-ProtProtein Sequence Data Bank.TrEMBLcontains the translationsof all codingsequences(CDS)EMBL,PIR,identificationand analysis ofprotein sequences and their corresponding codingsequences,REFSEQP,Protein informationfromREFSEQ,SPTrEMBL,Primary sequence could befurther analysis,Sequence database with analyzed annotation(specialpattern found),Analysis tools,Nucleotiderelated databases,Special patternof,Nucleotide,sequence,REBASE,Genome structure,CPGISLAND,ENSEMBLCPG,LOCUSLINK,MOUSE2HUMAN,Transcription factorbinding site,TESS:,TFCLASS,TFCELL,TFFACTOR,TFMATRIX,TFSITE,Genestructure,EPD,TFGENE,UTR,UTRSITE,EMBLALIGN,integration ofgeneexpressionpattern,UNIGENEUNILIB,Locuslink,PDB,二維電泳資,料,料庫,2D Gel,預測人類的,基,基因,利用其他物,種,種的基因,利用某一染,色,色體上的基,因,因數,利用,EST資料,庫,庫中的序列,傳統的方法,CGAP,http:/cgap.nci.nih.gov/,目前完成之,生,生物資訊平,台,台:,人類基因體,序,序列搜尋系,統,統(Human genomeblastserver),34種微生,物,物基因體序,列,列搜尋系統,變異基因資,料,料庫(HGV Database),單核甘酸多,型,型性(SNP)加值型,資,資料庫以及,網,網頁介面資,料,料輸入及搜,尋,尋系統,表現序列分,析,析系統(EST analysis system),南區生物資,訊,訊教育選課,系,系統,成功大學生,物,物資訊中心,
展开阅读全文