Additional file 8 BLAST results for Unigenes in Additional file 6 A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPECPWN401N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_10027 Length=492 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002315491.1| predicted protein [Populus trichocarpa] >... 175 6e-52 emb|CBI22961.3| unnamed protein product [Vitis vinifera] 181 9e-52 ref|XP_003528629.1| PREDICTED: uncharacterized protein LOC100... 179 5e-51 ref|XP_003550543.1| PREDICTED: uncharacterized protein LOC100... 177 1e-50 ref|NP_851004.1| tetratricopeptide repeat domain-containing p... 177 2e-50 ALIGNMENTS >ref|XP_002315491.1| predicted protein [Populus trichocarpa] gb|EEF01662.1| predicted protein [Populus trichocarpa] Length=222 Score = 175 bits (443), Expect = 6e-52 Identities = 87/104 (84%), Positives = 94/104 (90%), Gaps = 0/104 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAVVLLEK GD T+EVLS+RASCYKEVGEYKKAVADC+KVLE + NV+VL Sbjct 118 DAIKWLSWAVVLLEKTGDKASTMEVLSTRASCYKEVGEYKKAVADCSKVLEHDDANVSVL 177 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMA 179 VQRALLYESMEKY+LGAEDLR V+ IDP NRVARSTVHRLTKMA Sbjct 178 VQRALLYESMEKYRLGAEDLRVVLKIDPANRVARSTVHRLTKMA 221 >emb|CBI22961.3| unnamed protein product [Vitis vinifera] Length=451 Score = 181 bits (458), Expect = 9e-52 Identities = 89/105 (85%), Positives = 98/105 (93%), Gaps = 0/105 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAVVLLEKAGD GT+EVL+ RASCYKEVGEYKKAVADC+KVLE + KNV+VL Sbjct 347 DAIKWLSWAVVLLEKAGDDAGTMEVLTCRASCYKEVGEYKKAVADCSKVLEHDEKNVSVL 406 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMAS 176 VQRALLYES+EKYKLGAEDLRTV+ DPGNRVARST+HRLTKMA+ Sbjct 407 VQRALLYESIEKYKLGAEDLRTVLKFDPGNRVARSTIHRLTKMAA 451 >ref|XP_003528629.1| PREDICTED: uncharacterized protein LOC100799789 [Glycine max] Length=478 Score = 179 bits (454), Expect = 5e-51 Identities = 88/104 (85%), Positives = 96/104 (92%), Gaps = 0/104 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAV+LL+KAGD+ TVEVLS RASCYKEVGEYKKAVADCTKVLE + NV+VL Sbjct 374 DAIKWLSWAVILLQKAGDSAATVEVLSCRASCYKEVGEYKKAVADCTKVLENDETNVSVL 433 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMA 179 VQRALLYESMEKY+LGAEDLRTV+ IDPGNR+ARSTVHRL KMA Sbjct 434 VQRALLYESMEKYRLGAEDLRTVLKIDPGNRIARSTVHRLAKMA 477 >ref|XP_003550543.1| PREDICTED: uncharacterized protein LOC100800725 [Glycine max] Length=442 Score = 177 bits (450), Expect = 1e-50 Identities = 89/104 (86%), Positives = 95/104 (91%), Gaps = 0/104 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAVVLLEKAGD+ T EVLSSRASCYKEVGEYKKAVADCTKVLE + NV+VL Sbjct 338 DAIKWLSWAVVLLEKAGDSATTGEVLSSRASCYKEVGEYKKAVADCTKVLENDETNVSVL 397 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMA 179 VQRALLYESMEKY+LGAEDLRTV+ IDPGNR+AR TVHRL KMA Sbjct 398 VQRALLYESMEKYRLGAEDLRTVLKIDPGNRIARGTVHRLAKMA 441 >ref|NP_851004.1| tetratricopeptide repeat domain-containing protein [Arabidopsis thaliana] gb|AAS49048.1| At3g16760 [Arabidopsis thaliana] gb|AAT71974.1| At3g16760 [Arabidopsis thaliana] dbj|BAF00191.1| hypothetical protein [Arabidopsis thaliana] gb|AEE75862.1| tetratricopeptide repeat domain-containing protein [Arabidopsis thaliana] Length=475 Score = 177 bits (450), Expect = 2e-50 Identities = 85/104 (82%), Positives = 96/104 (92%), Gaps = 0/104 (0%) Frame = -3 Query 490 DAIKWLSWAVVLLEKAGDADGTVEVLSSRASCYKEVGEYKKAVADCTKVLEQNGKNVAVL 311 DAIKWLSWAV+L+++AGD G+ EVLS+RASCYKEVGEYKKAVADCTKVL+ + KNV +L Sbjct 371 DAIKWLSWAVILMDRAGDEAGSAEVLSTRASCYKEVGEYKKAVADCTKVLDHDKKNVTIL 430 Query 310 VQRALLYESMEKYKLGAEDLRTVMNIDPGNRVARSTVHRLTKMA 179 VQRALLYESMEKYKLGAEDLR V+ IDPGNR+ARSTVHRLTKMA Sbjct 431 VQRALLYESMEKYKLGAEDLRMVLKIDPGNRIARSTVHRLTKMA 474 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 1,855,251,573 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 276337787 Number of extensions: 6330351 Number of successful extensions: 19991 Number of sequences better than 1e-10: 1 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 19864 Number of HSP's successfully gapped: 1 Length of query: 492 Length of database: 6150218869 Length adjustment: 125 Effective length of query: 367 Effective length of database: 3910333369 Effective search space: 152503001391 Effective search space used: 152503001391 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits)
A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE1P5Z101N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_1010 Length=931 Score E Sequences producing significant alignments: (Bits) Value ref|NP_566697.1| cysteine-rich repeat secretory protein 38 [A... 177 2e-50 dbj|BAJ34612.1| unnamed protein product [Thellungiella haloph... 176 6e-50 ref|XP_002883339.1| hypothetical protein ARALYDRAFT_479720 [A... 175 2e-49 ref|XP_003597035.1| Cysteine-rich repeat secretory protein [M... 173 7e-49 ref|XP_002515716.1| DUF26 domain-containing protein 2 precurs... 169 2e-47 ALIGNMENTS >ref|NP_566697.1| cysteine-rich repeat secretory protein 38 [Arabidopsis thaliana] sp|Q9LRJ9.1|CRR38_ARATH RecName: Full=Cysteine-rich repeat secretory protein 38; Flags: Precursor dbj|BAB01391.1| unnamed protein product [Arabidopsis thaliana] gb|AAK59407.1| unknown protein [Arabidopsis thaliana] gb|ABD91491.1| At3g22060 [Arabidopsis thaliana] gb|AEE76585.1| cysteine-rich repeat secretory protein 38 [Arabidopsis thaliana] Length=252 Score = 177 bits (449), Expect = 2e-50 Identities = 100/229 (44%), Positives = 130/229 (57%), Gaps = 9/229 (4%) Frame = -3 Query 908 SQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEGPNKVYGL 729 SQ ++ L H C++ ++T + Y +NL L L K P F S G PN V GL Sbjct 29 SQNNAFLFHKCSDIEGSFTSKSLYESNLNNLFSQLSYKVPST-GFAASSTGNTPNNVNGL 87 Query 728 AQCRGDVSPDDCDYCLYDARCKITQRCYKRSA-VVWYDNCQLKYSDVNFLGKIDKENDFV 552 A CRGD S DC CL A ++ QRC A +VWYDNC +KYS NF GKID EN F Sbjct 88 ALCRGDASSSDCRSCLETAIPELRQRCPNNKAGIVWYDNCLVKYSSTNFFGKIDFENRFY 147 Query 551 MINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKNVTIHGMVLC 372 + NVKNVS + F + LLT+L+K+A+ + N+ +FA GEK K ++G+V C Sbjct 148 LYNVKNVS--DPSTFNSQTKALLTELTKKATTRDNQKLFATGEKNIGKNK---LYGLVQC 202 Query 371 TQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFL 225 T+DL CK CL+G++ ELP G RVVG SC RYE Y F+ Sbjct 203 TRDLKSITCKACLNGIIGELP--NCCDGKEGGRVVGGSCNFRYEIYPFV 249 >dbj|BAJ34612.1| unnamed protein product [Thellungiella halophila] Length=255 Score = 176 bits (446), Expect = 6e-50 Identities = 97/229 (42%), Positives = 132/229 (58%), Gaps = 9/229 (4%) Frame = -3 Query 908 SQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEGPNKVYGL 729 SQ ++ L H C++ N+T + Y +NL L + + P + F S G P+ V GL Sbjct 32 SQNNAFLYHKCSDIEGNFTSKSPYESNLDSLFRRISYRVPSS-GFAASSAGNSPDNVNGL 90 Query 728 AQCRGDVSPDDCDYCLYDARCKITQRCYKRSA-VVWYDNCQLKYSDVNFLGKIDKENDFV 552 A CRGD S DC CL A ++ QRC A ++WYDNC +KYS NF GKID EN F Sbjct 91 ALCRGDASSSDCGSCLATAIPELRQRCPNNKAGIIWYDNCLVKYSSTNFFGKIDYENRFY 150 Query 551 MINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKNVTIHGMVLC 372 + NV NVS + F + LLT+L+++A+ N+ +FA GEK + KK ++G+V C Sbjct 151 LYNVNNVS--DPASFNTQTKALLTELTQKATTGDNQKLFATGEKNLEKKK---LYGLVQC 205 Query 371 TQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFL 225 T+DL + CK CLDG++ ELP G RVVG SC RYE Y F+ Sbjct 206 TRDLRRESCKACLDGIIGELP--NCCDGKEGGRVVGGSCNFRYEIYPFV 252 >ref|XP_002883339.1| hypothetical protein ARALYDRAFT_479720 [Arabidopsis lyrata subsp. lyrata] gb|EFH59598.1| hypothetical protein ARALYDRAFT_479720 [Arabidopsis lyrata subsp. lyrata] Length=252 Score = 175 bits (443), Expect = 2e-49 Identities = 100/230 (43%), Positives = 130/230 (57%), Gaps = 9/230 (4%) Frame = -3 Query 908 SQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEGPNKVYGL 729 SQ ++ L H C++ ++T + Y +NL L L K P F S G P+ V GL Sbjct 29 SQNNAFLFHKCSDIEGSFTSKSPYESNLNNLFPQLSYKVPST-GFATSSAGITPDNVNGL 87 Query 728 AQCRGDVSPDDCDYCLYDARCKITQRCYKRSA-VVWYDNCQLKYSDVNFLGKIDKENDFV 552 A CRGD S DC CL A +I QRC A ++WYDNC +KYS NF GKID EN F Sbjct 88 ALCRGDASSSDCSSCLATAIPEIRQRCPSNKAGIIWYDNCLVKYSSTNFFGKIDFENRFY 147 Query 551 MINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKNVTIHGMVLC 372 + NV NVS + F + LLT L+K+A+ N+ +FA GEK KK ++G+V C Sbjct 148 LYNVNNVS--DPSTFNTQTKALLTKLTKKATTGDNQKLFATGEKNIGMKK---LYGLVQC 202 Query 371 TQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFLN 222 T+DL + CK CL+G++ ELP G RVVG SC RYE Y F+N Sbjct 203 TRDLKSEACKACLNGIIGELP--NCCDGKEGGRVVGGSCNFRYEIYPFVN 250 >ref|XP_003597035.1| Cysteine-rich repeat secretory protein [Medicago truncatula] gb|AES67286.1| Cysteine-rich repeat secretory protein [Medicago truncatula] Length=246 Score = 173 bits (438), Expect = 7e-49 Identities = 104/240 (43%), Positives = 140/240 (58%), Gaps = 16/240 (7%) Frame = -3 Query 926 LIPTTISQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEGP 747 LI TT+ G+ L H+C+ TS+N+T + Y +NLK LI+ L KTP FG GS Sbjct 19 LIQTTL--GTDPLFHICS-TSENFTAHSPYESNLKTLINSLIYKTPST-GFGSGSIDLTQ 74 Query 746 ---NKVYGLAQCRGDVSPDDCDYCLYDARCKITQRC-YKRSAVVWYDNCQLKYSDVNFLG 579 K YGLA CRGDVS +C C+ A +I C YK+ A++WYDNC KY D +F G Sbjct 75 YQNQKAYGLALCRGDVSTSECKTCVSQATKEILNVCPYKKGAIIWYDNCMFKYLDNDFFG 134 Query 578 KIDKENDFVMINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKN 399 KID N F ++NV+NVS K F + + LL+ L+ +AS N ++A GE + + Sbjct 135 KIDNTNKFALLNVQNVSDPIK--FNNMTNDLLSFLANEAS--MNHKLYATGELKIGESER 190 Query 398 VTIHGMVLCTQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFLND 219 V +G+ CT+D+S DCK CLDG + ELP G RVVG SC +RYE Y F+ + Sbjct 191 V--YGLTQCTRDISSVDCKKCLDGAISELP--NCCDGKKGGRVVGGSCNIRYEIYPFVRE 246 >ref|XP_002515716.1| DUF26 domain-containing protein 2 precursor, putative [Ricinus communis] gb|EEF46663.1| DUF26 domain-containing protein 2 precursor, putative [Ricinus communis] Length=240 Score = 169 bits (428), Expect = 2e-47 Identities = 97/237 (41%), Positives = 137/237 (58%), Gaps = 12/237 (5%) Frame = -3 Query 929 LLIPTTISQGSSLLNHVCTNTSQNYTKDTTYATNLKYLIDDLYVKTPQNYSFGQGSYGEG 750 LL+ T G L H C++ S+N+T + Y +NL L +LY + P+ FG GS G Sbjct 14 LLVQTVY--GDDPLFHFCSS-SENFTANGPYESNLNKLAGNLYFQVPKE-GFGFGSSGRD 69 Query 749 PNKVYGLAQCRGDVSPDDCDYCLYDARCKITQRC-YKRSAVVWYDNCQLKYSDVNFLGKI 573 P++ YGLA CRGDVS DC C+ +A +I +RC ++A++WYDNC KYSD + G+I Sbjct 70 PDQAYGLALCRGDVSSSDCKTCVVEASSEIRKRCPTNKAAIIWYDNCLYKYSDKKYFGQI 129 Query 572 DKENDFVMINVKNVSRTEKEIFRDTSIGLLTDLSKQASDKANKYMFAGGEKLFDDKKNVT 393 D N F M NV+ V+ + + F + LL++L+ QA + Y +KL K Sbjct 130 DNRNKFYMWNVRVVNDSAE--FNQKTKELLSELASQAYVTSKLYATGESDKLGKSNK--- 184 Query 392 IHGMVLCTQDLSYDDCKTCLDGVVKELPIYKGTIYSVGARVVGASCTVRYETYVFLN 222 ++G+V CT+DLS DC+ CLDG++ ELP G RVV SC RYE Y F+N Sbjct 185 LYGLVQCTRDLSSGDCRKCLDGIITELP--SCCDGKEGGRVVSGSCNFRYEIYPFVN 239 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 471020381 Number of extensions: 9864198 Number of successful extensions: 26830 Number of sequences better than 1e-10: 62 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 26544 Number of HSP's successfully gapped: 63 Length of query: 931 Length of database: 6150218869 Length adjustment: 140 Effective length of query: 791 Effective length of database: 3641547109 Effective search space: 619063008530 Effective search space used: 619063008530 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 176 (72.4 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE1VWB5012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_1041 Length=926 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002518985.1| conserved hypothetical protein [Ricinus c... 174 1e-46 ref|XP_003534287.1| PREDICTED: inactive poly [ADP-ribose] pol... 174 2e-46 emb|CAN61759.1| hypothetical protein VITISV_006105 [Vitis vin... 173 4e-46 ref|XP_003516978.1| PREDICTED: inactive poly [ADP-ribose] pol... 164 5e-43 ref|XP_002890975.1| hypothetical protein ARALYDRAFT_473413 [A... 164 5e-43 ALIGNMENTS >ref|XP_002518985.1| conserved hypothetical protein [Ricinus communis] gb|EEF43518.1| conserved hypothetical protein [Ricinus communis] Length=536 Score = 174 bits (440), Expect = 1e-46 Identities = 97/183 (53%), Positives = 120/183 (66%), Gaps = 5/183 (3%) Frame = +3 Query 24 SLSCFDVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNM 203 S+ DVDEN RH+V C VIMG +E ++ GS Q P+SE FDSGVD L+NP YVVWNM Sbjct 355 SVKFCDVDENGVRHIVFCRVIMGKMELVQPGSTQSHPSSENFDSGVDDLQNPGQYVVWNM 414 Query 204 NMNSHIYAECVVSFRATSVVEEP--VFGKEKNVGIPQFSSYGEPQAQALGESVEAKSPNS 377 NMN+HIY E +VSF+ + E +F + S G+ +Q S ++P S Sbjct 415 NMNTHIYPEFIVSFKVSLNAEGDMLLFFVNSQTRLESGGSLGKASSQG---SSNTRTPKS 471 Query 378 PWMPFPVLFAAIADKVTPESMYLVRTNYALFKNKKMSRDAFVKNLRLIVGDNLLKSAITS 557 P+MPFPVLFAAI +KV E M LV T+Y F+ KMSR F+K+LRLIVGD LLKS ITS Sbjct 472 PFMPFPVLFAAIRNKVPSEQMKLVLTDYKQFQANKMSRGDFIKSLRLIVGDALLKSTITS 531 Query 558 LQS 566 LQS Sbjct 532 LQS 534 >ref|XP_003534287.1| PREDICTED: inactive poly [ADP-ribose] polymerase RCD1-like [Glycine max] Length=583 Score = 174 bits (440), Expect = 2e-46 Identities = 99/191 (52%), Positives = 125/191 (65%), Gaps = 15/191 (8%) Frame = +3 Query 39 DVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNMNMNSH 218 DVDEN RH+ LC VIMGN+E LR G+DQF P+S E+D+GVD +E P YVVWNMNMN+H Sbjct 379 DVDENGVRHLALCRVIMGNMEILRPGTDQFHPSSCEYDNGVDAIECPQYYVVWNMNMNTH 438 Query 219 IYAECVVSFRATSVVEEPVFGKE-KNV-GI------------PQFSSYGEPQAQALGESV 356 IY E VVSF+ +S E G E KNV G+ + S+ +A ++ S Sbjct 439 IYPEFVVSFKVSSDAEGHFCGSEGKNVSGVNTACDGPHGLLNSESSTVDNGKAPSMVSST 498 Query 357 EAKSPNSPWMPFPVLFAAIADKVTPESMYLVRTNYALFKNKKMSRDAFVKNLRLIVGDNL 536 K P SPWMPFPVL AI D+V P M +++T Y F++K +SRD FVK LRLIVGD L Sbjct 499 -PKVPKSPWMPFPVLLDAIRDQVPPTGMDVIKTYYEQFRSKHISRDDFVKMLRLIVGDGL 557 Query 537 LKSAITSLQSK 569 L++ IT+LQ K Sbjct 558 LRTTITNLQYK 568 >emb|CAN61759.1| hypothetical protein VITISV_006105 [Vitis vinifera] Length=604 Score = 173 bits (439), Expect = 4e-46 Identities = 103/212 (49%), Positives = 127/212 (60%), Gaps = 30/212 (14%) Frame = +3 Query 24 SLSCFDVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNM 203 S++ DVDEN +H+VLC VIMGN+E + GS Q P+SE FDSGVD L+NP Y++WNM Sbjct 366 SVNYCDVDENGVQHIVLCRVIMGNMELVHPGSGQCHPSSENFDSGVDDLQNPKHYIIWNM 425 Query 204 NMNSHIYAECVVSFRATSVV--EEPVFGKEKN-----------------------VGIPQ 308 NMN+HIY E VVSF+ +S V E + G E N VG+ Sbjct 426 NMNTHIYPEYVVSFKVSSRVGAEGYLIGNESNYDISGVTTCQGQSQGHSKLGLHPVGLXN 485 Query 309 FSSYGEPQAQALGESVEAKS-----PNSPWMPFPVLFAAIADKVTPESMYLVRTNYALFK 473 S ++LG++ S P SPWMPFP+LFAAI+ KV + M LV T Y LF+ Sbjct 486 DSHPTPSLGRSLGKATTLGSSTLRVPKSPWMPFPMLFAAISKKVPLKDMQLVDTQYELFR 545 Query 474 NKKMSRDAFVKNLRLIVGDNLLKSAITSLQSK 569 KK+SR FVK LRLIVGD LLKS IT LQ K Sbjct 546 RKKISRADFVKKLRLIVGDTLLKSTITHLQCK 577 >ref|XP_003516978.1| PREDICTED: inactive poly [ADP-ribose] polymerase RCD1-like [Glycine max] Length=583 Score = 164 bits (416), Expect = 5e-43 Identities = 94/191 (49%), Positives = 124/191 (65%), Gaps = 15/191 (8%) Frame = +3 Query 39 DVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNMNMNSH 218 DVDEN RH+ LC VIMGN+E L+ G+ QF P+S E+D+GVD +E P YVVWNMNMN+H Sbjct 379 DVDENGVRHLALCRVIMGNMEILQPGTGQFHPSSCEYDNGVDSIECPRYYVVWNMNMNTH 438 Query 219 IYAECVVSFRATSVVEEPVFGKE-KNV-GI------------PQFSSYGEPQAQALGESV 356 IY E VVSF+ +S E G E KNV G+ + S+ +A ++ S Sbjct 439 IYPEFVVSFKVSSDAEGHFCGSEGKNVSGVNSACQGPHGLLHSESSTVDNGKAPSMVAST 498 Query 357 EAKSPNSPWMPFPVLFAAIADKVTPESMYLVRTNYALFKNKKMSRDAFVKNLRLIVGDNL 536 K P SPWMPFP+L AI ++V P+ M +++ Y F++K +SRD FVK LRLIVGD L Sbjct 499 -PKVPKSPWMPFPLLLDAIRNQVPPKGMDVIKIYYEQFRSKHISRDDFVKMLRLIVGDGL 557 Query 537 LKSAITSLQSK 569 L++ IT+LQ K Sbjct 558 LRTTITNLQFK 568 >ref|XP_002890975.1| hypothetical protein ARALYDRAFT_473413 [Arabidopsis lyrata subsp. lyrata] gb|EFH67234.1| hypothetical protein ARALYDRAFT_473413 [Arabidopsis lyrata subsp. lyrata] Length=588 Score = 164 bits (416), Expect = 5e-43 Identities = 90/192 (47%), Positives = 121/192 (63%), Gaps = 17/192 (9%) Frame = +3 Query 39 DVDENDTRHMVLCCVIMGNVEPLRCGSDQFLPNSEEFDSGVDRLENPNLYVVWNMNMNSH 218 DVDEN R+MVLC VIMGN+E LR QF EE+D+GVD +ENP Y+VWN+NMN+H Sbjct 378 DVDENGVRYMVLCRVIMGNMELLRGDKAQFFSGGEEYDNGVDDIENPKNYIVWNINMNTH 437 Query 219 IYAECVVSFRATSV--VEEPVFGKEKNVGI---------PQFSSY----GEPQAQALGES 353 I+ E VV F+ +++ E + K N G+ PQ S G A ++G S Sbjct 438 IFPEFVVRFKLSNLPNAEGNLIAKRDNSGVTLEGPKNPPPQVESNHGAGGSGSANSVGSS 497 Query 354 VEAKSPNSPWMPFPVLFAAIADKVTPESMYLVRTNYALFKNKKMSRDAFVKNLRLIVGDN 533 P SPWMPFP LFAAI+ KV + M L+ +Y ++KKM+R FV+ LR+IVGD+ Sbjct 498 --TTRPKSPWMPFPTLFAAISHKVAEKDMSLINADYQQLRDKKMTRAEFVRKLRVIVGDD 555 Query 534 LLKSAITSLQSK 569 LL+S IT+LQ++ Sbjct 556 LLRSTITTLQNQ 567 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 450260976 Number of extensions: 9268664 Number of successful extensions: 19694 Number of sequences better than 1e-10: 4 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 19677 Number of HSP's successfully gapped: 4 Length of query: 926 Length of database: 6150218869 Length adjustment: 140 Effective length of query: 786 Effective length of database: 3641547109 Effective search space: 611779914312 Effective search space used: 611779914312 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 176 (72.4 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPECVPUY016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_10651 Length=476 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002469870.1| candidate beta-glucosidase from glycoside... 72.0 3e-12 gb|ABK24629.1| unknown [Picea sitchensis] 68.6 4e-11 ALIGNMENTS >ref|XP_002469870.1| candidate beta-glucosidase from glycoside hydrolase family 1 [Postia placenta Mad-698-R] gb|EED84949.1| candidate beta-glucosidase from glycoside hydrolase family 1 [Postia placenta Mad-698-R] Length=480 Score = 72.0 bits (175), Expect = 3e-12 Identities = 29/39 (74%), Positives = 32/39 (82%), Gaps = 0/39 (0%) Frame = -3 Query 474 SNEDGRGPSIWDAFCRAPGNICDGSNADVAVDQYHRYKE 358 +NE GRGPSIWD FC+ PGNI DGSN D+A D YHRYKE Sbjct 23 ANEGGRGPSIWDTFCKVPGNIRDGSNGDIATDSYHRYKE 61 >gb|ABK24629.1| unknown [Picea sitchensis] Length=477 Score = 68.6 bits (166), Expect = 4e-11 Identities = 29/39 (74%), Positives = 32/39 (82%), Gaps = 0/39 (0%) Frame = -3 Query 474 SNEDGRGPSIWDAFCRAPGNICDGSNADVAVDQYHRYKE 358 + E G+GPSIWD+F R PG I DGSN DVAVDQYHRYKE Sbjct 34 AKEGGKGPSIWDSFSRTPGKILDGSNGDVAVDQYHRYKE 72 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 272888281 Number of extensions: 6189810 Number of successful extensions: 12786 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 12722 Number of HSP's successfully gapped: 0 Length of query: 476 Length of database: 6150218869 Length adjustment: 120 Effective length of query: 356 Effective length of database: 3999928789 Effective search space: 151997293982 Effective search space used: 151997293982 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPDY6ESF01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_107 Length=1296 Score E Sequences producing significant alignments: (Bits) Value emb|CAH59737.1| hypothetical protein [Plantago major] 119 2e-28 ref|XP_003554330.1| PREDICTED: uncharacterized protein LOC100... 92.8 1e-18 ref|NP_001241600.1| uncharacterized protein LOC100804857 [Gly... 92.4 2e-18 ref|XP_002274395.1| PREDICTED: uncharacterized protein LOC100... 87.8 1e-16 ref|XP_003537192.1| PREDICTED: uncharacterized protein LOC100... 74.3 3e-12 ALIGNMENTS >emb|CAH59737.1| hypothetical protein [Plantago major] Length=196 Score = 119 bits (298), Expect = 2e-28 Identities = 82/199 (41%), Positives = 110/199 (55%), Gaps = 16/199 (8%) Frame = -2 Query 1076 MEVFIPYKTSHCPETCISDGSSATD-PHNLIQIKDDLSMND-LKTCLSELMNVEEAGISS 903 MEV +P TS IS S +D P N + + D LKT + + +E+ IS Sbjct 1 MEVCVPCNTS--AGEYISKNSGFSDGPLNQPHCNSRIPLVDCLKTSSPQHLEIEDDEISP 58 Query 902 GGFNLAPEKYYIST---------EVRESEGKDSEKCLSKSATFQPPCGPKLSEDVFVGEK 750 G + + + +S + ES S KCL KSATF P PKLS + G + Sbjct 59 GCYLASDKTCEVSLNEGNGGRELDKSESNASSSTKCLDKSATFPPCRDPKLSTERLCGRR 118 Query 749 RNHEEHLTHEVSEGNGSASTFNTCNTQSTSLPVHLKPISAMKGSREKQQGKPLPKKLSVT 570 + H + T V+E N A + N C ++ SLP K +S++KGSREKQ G P PKKLSV+ Sbjct 119 KKHTKR-TDAVAEKNDMAESPNRCPSRCNSLPTPSKLVSSLKGSREKQ-GMP-PKKLSVS 175 Query 569 WAPDVYDPIPTSVSHVPSD 513 WAPDVYDP+PTSVSHVP++ Sbjct 176 WAPDVYDPVPTSVSHVPNN 194 >ref|XP_003554330.1| PREDICTED: uncharacterized protein LOC100783136 [Glycine max] Length=254 Score = 92.8 bits (229), Expect = 1e-18 Identities = 93/278 (33%), Positives = 128/278 (46%), Gaps = 57/278 (21%) Frame = -2 Query 983 IKDDLSMNDLKTCLSELMNVEEAGIS------SGGFNLAPEKYYISTEVRESEGKDSEKC 822 + DD +++L +E + +++A S + +N+ EK + +++ E + C Sbjct 10 VLDDNFISNLGRTFNESLQIQDAHKSVLASEENDIYNVNEEK--LCEAMKDQETNINMTC 67 Query 821 LSKSATFQPPC-------GPKLSEDVFVGEKRNHEEHLTHEVSEGNGSASTFNTCNTQST 663 L KSATF P K ++ G H H T+ +S Sbjct 68 LKKSATFPIPNTMLPSSPSDKEADTSVTGPLNEHSAHQTYSLS----------------V 111 Query 662 SLPVHLKPISAMKGSREKQQGKPLPKKLSVTWAPDVYDPIPTSVSH-VPSDKNQRYRnkk 486 S P LK ISAMKGSREK G + KL+V WA DVYDPIPT +SH V S+K Q+ Sbjct 112 SPPAPLKLISAMKGSREKHGGSQV--KLNVKWASDVYDPIPTLLSHTVRSNKKQQ----- 164 Query 485 ygkynkhkssgkssrgskskdskkqgrkssGAGSNKLKQPFHL--DSGVVLREPPQVG-- 318 + + K K Q SS GSNK KQ L SG+ + Sbjct 165 ---------KSRKKKPEKKNGKKGQKGNSSRGGSNKDKQVRKLGGTSGLCYKSMDSCDKV 215 Query 317 -----PLDYNNVGSQDPFCGSSFLKESVTKLHYPVAEA 219 LD V SQD +CG+SFLK+SVT++HY VAEA Sbjct 216 LGASTELDALEVRSQDSYCGTSFLKKSVTEVHYSVAEA 253 >ref|NP_001241600.1| uncharacterized protein LOC100804857 [Glycine max] gb|ACU17939.1| unknown [Glycine max] Length=257 Score = 92.4 bits (228), Expect = 2e-18 Identities = 95/273 (35%), Positives = 136/273 (50%), Gaps = 47/273 (17%) Frame = -2 Query 983 IKDDLSMNDLKTCLSELMNVEEAGIS------SGGFNLAPEKYYISTEVRESEGKDSEKC 822 + DD +++L SE +++++A S S +N+ EK I +++ K + C Sbjct 13 VHDDNFISNLGRTFSESLHIQDAQKSLLASEGSDIYNVNEEK--ICKAMKDQATKVNMAC 70 Query 821 LSKSATFQPPCG--PKLSEDVFVGEKRNHEEHLTHEVSEGNGSASTFNTCNTQSTSLPVH 648 L KSATF P P S D + + +T + E + + T++ +S SLP Sbjct 71 LKKSATFPIPNTMLPSSSSD------KEADTSVTEPLYE-HSAHQTYS----RSVSLPAP 119 Query 647 LKPISAMKGSREKQQGKPLPKKLSVTWAPDVYDPIPTSVSH-VPSDKNQRYRnkkygkyn 471 LK I A+KGSREK G + KL+V WA DVYDP+PT +SH V S+K Q+ Sbjct 120 LKLIPAIKGSREKHGGSQV--KLNVKWAADVYDPVPTLLSHTVRSNKKQQ---------- 167 Query 470 khkssgkssrgskskdskkqgrkssGAGSNKLKQPFHL--DSGVVLREPPQ-------VG 318 + + K K Q SS GS+K KQ L SG+ + Sbjct 168 ----KSRKKKPEKKNGKKGQKGNSSRGGSSKDKQFRKLGGTSGLCYKSMDSCDKVLGVAT 223 Query 317 PLDYNNVGSQDPFCGSSFLKESVTKLHYPVAEA 219 LD +V SQD +CG+SFLK+SVT+LHY VAEA Sbjct 224 ELDALDVRSQDSYCGTSFLKKSVTELHYSVAEA 256 >ref|XP_002274395.1| PREDICTED: uncharacterized protein LOC100248230 isoform 3 [Vitis vinifera] ref|XP_002274341.1| PREDICTED: uncharacterized protein LOC100248230 isoform 1 [Vitis vinifera] emb|CAN77855.1| hypothetical protein VITISV_037693 [Vitis vinifera] emb|CBI32610.3| unnamed protein product [Vitis vinifera] Length=293 Score = 87.8 bits (216), Expect = 1e-16 Identities = 90/257 (35%), Positives = 125/257 (49%), Gaps = 34/257 (13%) Frame = -2 Query 944 LSELMN--VEEAGI----SSGGFNLAPEKYYISTEVRESEGKDSEKCLSKSATFQPPCGP 783 + EL N +E++G+ S+ N+ E + ES + +K K ATF P Sbjct 57 VKELANSCIEQSGMDVISSTEAENICGE---LKVRQTESPTRSYQKSFCKCATF--PSSG 111 Query 782 KLSEDVFVGEKRNHEEHLTHEVSEGNGSASTFNTCNTQSTSLPVHLKPISAMKGSREKQQ 603 K S GE+ + + E N S + + +++ SLP LK +SAMKGSR+K+ Sbjct 112 KTSLAGSSGEEDGNPDATLQE----NYSLKSLSPDISRTASLPTPLKLVSAMKGSRDKE- 166 Query 602 GKPLPKKLSVTWAPDVYDPIPTSVSHV--------PSDKNQRYRnkkygkynkhkssgks 447 G PL KKL+VTWAPDVYDP PT VSH S N++ K S+GK Sbjct 167 GIPL-KKLNVTWAPDVYDPPPTIVSHTVRNCKKQQQSKNNRKNGKHKQKGKAARGSNGKD 225 Query 446 srgskskdskkqgrkssGAGSNKLKQPFHLDSGVVLREPPQVGPLDYNNVGSQDPFCGSS 267 + + S +KL +DS RE L+ VGS D +CGSS Sbjct 226 KKQLRKIVGSGDRGFRSFEVRDKLIASNFIDSS---RE------LEDFEVGSPDGYCGSS 276 Query 266 FLKESVTKLHYPVAEAT 216 FL++SV K+H+ VAEAT Sbjct 277 FLRKSVAKVHFSVAEAT 293 >ref|XP_003537192.1| PREDICTED: uncharacterized protein LOC100812338 [Glycine max] Length=239 Score = 74.3 bits (181), Expect = 3e-12 Identities = 77/271 (28%), Positives = 120/271 (44%), Gaps = 43/271 (16%) Frame = -2 Query 1004 DPHNLIQIKDDLSMNDLKTCLSELMNVEEA--------GISSGGFNLAPEKYYISTEVRE 849 D + + D ++ L+ SE +++ +A G G ++ E++E Sbjct 2 DTRSPVNAVHDNIISKLEVTFSESLHIHDAQNSEHASEGDHIGNCDVGERNLREGFELQE 61 Query 848 SEGKDSEKCLSKSATFQPPCGPKLSEDVFVGEKRNHEEHLTHEVSEGNGSASTFNTCNTQ 669 + K KCL + +TF P D+ + + EE S S +C Sbjct 62 T--KLEIKCLKECSTFPYP-------DMMLPSSSSDEE--ADASSPSKQSPRQNYSC--- 107 Query 668 STSLPVHLKPISAMKGSREKQQGKPLPKKLSVTWAPDVYDPIPTSVSHVPSDKNQRYRnk 489 S SLP K +SAMKGSREK++G + KL+V WAPDVYDP+PT +SH +K Q+ Sbjct 108 SVSLPAPRKLVSAMKGSREKERGSQM--KLTVKWAPDVYDPVPTLLSHTVKNKKQQ---- 161 Query 488 kygkynkhkssgkssrgskskdskkqgrkssGAGSNKLKQPFHLDSGVVLREPPQVGPLD 309 + + K K Q S GS+K KQ +V Sbjct 162 ----------KPRIKKSEKKNGKKGQKVSYSKRGSSKDKQ----YRNRWFYSHDEVFEAS 207 Query 308 YNNVGSQDPFCGSS-FLKESVTKLHYPVAEA 219 +N + D +CG+S +L+ S+TK+H+ + EA Sbjct 208 SDNAANHDSYCGTSYYLETSLTKVHWSIGEA 238 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 682390971 Number of extensions: 14727534 Number of successful extensions: 32988 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 32979 Number of HSP's successfully gapped: 0 Length of query: 1296 Length of database: 6150218869 Length adjustment: 144 Effective length of query: 1152 Effective length of database: 3569870773 Effective search space: 1028122782624 Effective search space used: 1028122782624 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 178 (73.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEDH95R013 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_11016 Length=467 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002530622.1| chaperone protein DNAj, putative [Ricinus... 113 1e-28 ref|XP_003551367.1| PREDICTED: chaperone protein dnaJ 11, chl... 107 1e-26 ref|XP_003544555.1| PREDICTED: chaperone protein dnaJ 11, chl... 104 1e-25 ref|XP_002300579.1| predicted protein [Populus trichocarpa] >... 102 2e-25 ref|NP_187939.1| chaperone DnaJ-domain containing protein [Ar... 99.4 2e-23 ALIGNMENTS >ref|XP_002530622.1| chaperone protein DNAj, putative [Ricinus communis] gb|EEF31765.1| chaperone protein DNAj, putative [Ricinus communis] Length=168 Score = 113 bits (282), Expect = 1e-28 Identities = 59/98 (60%), Positives = 71/98 (72%), Gaps = 3/98 (3%) Frame = -1 Query 437 NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIEIRNAYATLSDPE 258 +LYE+LR++R AS EIKTAYR+LAKLYHPDAA R D GRDF+EI NAY TLSDP Sbjct 72 SLYEILRIKRTASLMEIKTAYRSLAKLYHPDAAVR-EDVETDGRDFMEIHNAYETLSDPA 130 Query 257 ARSAYDNNLEVSLQRLRLRSTTGSRGRFYPTRNWETDQ 144 AR+ YD +L+ + + R R G G +YPTR WETDQ Sbjct 131 ARALYDLSLDAASR--RRRPAVGFTGGYYPTRRWETDQ 166 >ref|XP_003551367.1| PREDICTED: chaperone protein dnaJ 11, chloroplastic-like [Glycine max] Length=151 Score = 107 bits (267), Expect = 1e-26 Identities = 64/113 (57%), Positives = 75/113 (66%), Gaps = 8/113 (7%) Frame = -1 Query 458 APAVDNR----NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIEI 291 A AVD + +LYEVLRV R+AS TEIK+AYR+LAKLYHPDAA + G DFI++ Sbjct 37 AEAVDTQRPAASLYEVLRVERDASPTEIKSAYRSLAKLYHPDAAVQRSPETDGDGDFIQL 96 Query 290 RNAYATLSDPEARSAYDNNLEVSL-QRLRLRSTTGSRGR---FYPTRNWETDQ 144 RNAY TLSDP AR+ YD L + R R ST+ SR FY TR WETDQ Sbjct 97 RNAYETLSDPSARAMYDRTLAAAHGGRHRRFSTSLSRNHSSAFYTTRRWETDQ 149 >ref|XP_003544555.1| PREDICTED: chaperone protein dnaJ 11, chloroplastic-like [Glycine max] Length=142 Score = 104 bits (260), Expect = 1e-25 Identities = 59/113 (52%), Positives = 76/113 (67%), Gaps = 10/113 (9%) Frame = -1 Query 458 APAVDNR---NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIEIR 288 A A+D+R +LYEVLR+++NAS EIK+AYR LAK+YHPD+A R S + RDFIEI Sbjct 30 ATAIDSRRAASLYEVLRIKQNASAVEIKSAYRNLAKVYHPDSALR--RSESDERDFIEIH 87 Query 287 NAYATLSDPEARSAYDNNLEVSLQRLR-----LRSTTGSRGRFYPTRNWETDQ 144 +AY TLSDP AR+ YD +L + R + + GS G +Y TR WETDQ Sbjct 88 DAYETLSDPSARALYDLSLMAARDDNRSFSSLVAAPNGSSGFYYQTRKWETDQ 140 >ref|XP_002300579.1| predicted protein [Populus trichocarpa] gb|EEE85384.1| predicted protein [Populus trichocarpa] Length=99 Score = 102 bits (255), Expect = 2e-25 Identities = 54/98 (55%), Positives = 64/98 (65%), Gaps = 1/98 (1%) Frame = -1 Query 437 NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIEIRNAYATLSDPE 258 +LYE+LRV AS EIKTAYR+LAK+YHPDA D + G DFIEI NAY TLSDP Sbjct 1 SLYEILRVNPTASQVEIKTAYRSLAKVYHPDAMLDRDDEPSEGVDFIEIHNAYETLSDPA 60 Query 257 ARSAYDNNLEVSLQRLRLRSTTGSRGRFYPTRNWETDQ 144 AR+ YD +L + + R G G +Y TR WETDQ Sbjct 61 ARAVYDMSLSAAARDF-YRRAVGYSGGYYTTRRWETDQ 97 >ref|NP_187939.1| chaperone DnaJ-domain containing protein [Arabidopsis thaliana] dbj|BAB02800.1| DnaJ-like protein [Arabidopsis thaliana] gb|AAM64632.1| DnaJ protein, putative [Arabidopsis thaliana] gb|AAP88343.1| At3g13310 [Arabidopsis thaliana] gb|AEE75332.1| chaperone DnaJ-domain containing protein [Arabidopsis thaliana] Length=157 Score = 99.4 bits (246), Expect = 2e-23 Identities = 58/111 (52%), Positives = 71/111 (64%), Gaps = 10/111 (9%) Frame = -1 Query 467 PAFAPAVDNR--NLYEVLRVRRNASGTEIKTAYRTLAKLYHPDAASRFMDSAAGGRDFIE 294 PA +V R +LYE+L+V AS TEIKTAYR+LAK+YHPDA S + GRDF+E Sbjct 52 PAVTESVRRRVSSLYELLKVNETASLTEIKTAYRSLAKVYHPDA------SESDGRDFME 105 Query 293 IRNAYATLSDPEARSAYDNNLEVSLQRLRLRSTTGSRGRFY-PTRNWETDQ 144 I AYATL+DP R+ YD+ L V +R+ G GR Y TR WETDQ Sbjct 106 IHKAYATLADPTTRAIYDSTLRVPRRRVH-AGAMGRSGRVYATTRRWETDQ 155 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 236168012 Number of extensions: 4973328 Number of successful extensions: 13151 Number of sequences better than 1e-10: 1 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 12733 Number of HSP's successfully gapped: 1 Length of query: 467 Length of database: 6150218869 Length adjustment: 117 Effective length of query: 350 Effective length of database: 4053686041 Effective search space: 154040069558 Effective search space used: 154040069558 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEDUEFD016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_11358 Length=459 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003531875.1| PREDICTED: uncharacterized protein LOC100... 142 5e-41 ref|XP_002531002.1| conserved hypothetical protein [Ricinus c... 139 1e-39 ref|XP_003552604.1| PREDICTED: uncharacterized protein LOC100... 134 7e-38 ref|XP_002330916.1| predicted protein [Populus trichocarpa] >... 134 1e-37 gb|ACG30695.1| B12D protein [Zea mays] 131 1e-36 ALIGNMENTS >ref|XP_003531875.1| PREDICTED: uncharacterized protein LOC100527287 [Glycine max] gb|ACU16358.1| unknown [Glycine max] Length=86 Score = 142 bits (358), Expect = 5e-41 Identities = 64/85 (75%), Positives = 76/85 (89%), Gaps = 0/85 (0%) Frame = +1 Query 25 MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ 204 M RW+KPEVYPL+AAM+FVTG+CVFQL+RNV+ NPDVR+NK RS AVLEN+EEGEKYA+ Sbjct 1 MGRWMKPEVYPLLAAMTFVTGMCVFQLTRNVLGNPDVRINKTRRSMAVLENREEGEKYAE 60 Query 205 HSLRKFLRTRTPEIMPALNRFFSQN 279 H LRKFLRTR PEIMP +N FFS++ Sbjct 61 HGLRKFLRTRPPEIMPTINHFFSED 85 >ref|XP_002531002.1| conserved hypothetical protein [Ricinus communis] gb|EEF31390.1| conserved hypothetical protein [Ricinus communis] Length=86 Score = 139 bits (349), Expect = 1e-39 Identities = 62/85 (73%), Positives = 75/85 (88%), Gaps = 0/85 (0%) Frame = +1 Query 25 MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ 204 M RW+KPEVYPL+AAM+FVT LC FQL+RN+ NPDVR+NKAHR TAVLEN+ EGE+YA+ Sbjct 1 MARWIKPEVYPLMAAMTFVTSLCAFQLTRNMFLNPDVRINKAHRRTAVLENEVEGEQYAE 60 Query 205 HSLRKFLRTRTPEIMPALNRFFSQN 279 H LRKFLRTR PEIMP++N FFS++ Sbjct 61 HGLRKFLRTRPPEIMPSINHFFSED 85 >ref|XP_003552604.1| PREDICTED: uncharacterized protein LOC100785946 [Glycine max] Length=86 Score = 134 bits (337), Expect = 7e-38 Identities = 59/85 (69%), Positives = 74/85 (87%), Gaps = 0/85 (0%) Frame = +1 Query 25 MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ 204 M RW+KPEVYPL+AAM+FV+ +CVFQL+RN++ NPDVR+NK RS VL+N+EEGEKYA+ Sbjct 1 MGRWMKPEVYPLLAAMTFVSSMCVFQLTRNMLGNPDVRINKTRRSMPVLDNREEGEKYAE 60 Query 205 HSLRKFLRTRTPEIMPALNRFFSQN 279 H LRKFLRTR PEIMP +N FFS++ Sbjct 61 HGLRKFLRTRPPEIMPTINHFFSED 85 >ref|XP_002330916.1| predicted protein [Populus trichocarpa] gb|EEF10241.1| predicted protein [Populus trichocarpa] Length=86 Score = 134 bits (336), Expect = 1e-37 Identities = 61/85 (72%), Positives = 73/85 (86%), Gaps = 0/85 (0%) Frame = +1 Query 25 MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ 204 M RW+KPEVYPL+AAM+ VT LC+FQL+RNV NPDVRVNKA+R VLEN+EEGE+YA+ Sbjct 1 MGRWMKPEVYPLLAAMTCVTSLCIFQLTRNVFMNPDVRVNKANRGMGVLENKEEGERYAE 60 Query 205 HSLRKFLRTRTPEIMPALNRFFSQN 279 H LRKFLRTR PEIMP +N FFS++ Sbjct 61 HGLRKFLRTRPPEIMPTVNHFFSED 85 >gb|ACG30695.1| B12D protein [Zea mays] Length=94 Score = 131 bits (329), Expect = 1e-36 Identities = 59/90 (66%), Positives = 76/90 (84%), Gaps = 0/90 (0%) Frame = +1 Query 25 MRRWVKPEVYPLIAAMSFVTGLCVFQLSRNVISNPDVRVNKAHRSTAVLENQEEGEKYAQ 204 M RW+KP+VYPLIAAMSFVTG+CVFQL+RNV+ NPDVRV+K R +AVL+N EG++Y+Q Sbjct 1 MGRWLKPDVYPLIAAMSFVTGMCVFQLARNVLMNPDVRVSKTSRQSAVLDNAGEGQRYSQ 60 Query 205 HSLRKFLRTRTPEIMPALNRFFSQNNDN*T 294 H+ R+FL T+ PE+ PALN FFS +N+N T Sbjct 61 HAFRRFLATQRPEVFPALNSFFSDSNNNNT 90 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 213584297 Number of extensions: 4126945 Number of successful extensions: 10139 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 10131 Number of HSP's successfully gapped: 0 Length of query: 459 Length of database: 6150218869 Length adjustment: 116 Effective length of query: 343 Effective length of database: 4071605125 Effective search space: 150649389625 Effective search space used: 150649389625 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEDW0HV012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_11591 Length=452 Score E Sequences producing significant alignments: (Bits) Value gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutin... 92.8 4e-21 gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltior... 81.3 1e-16 ref|XP_003609710.1| Pathogenesis-related protein [Medicago tr... 74.3 3e-14 ref|XP_003609709.1| Pathogenesis-related protein [Medicago tr... 74.3 6e-14 ref|XP_002271428.2| PREDICTED: pathogenesis-related protein S... 70.9 6e-13 ALIGNMENTS >gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa] Length=154 Score = 92.8 bits (229), Expect = 4e-21 Identities = 50/90 (56%), Positives = 64/90 (71%), Gaps = 0/90 (0%) Frame = -2 Query 451 RIDALDAekgttkfttteGPWLGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALK 272 RI+A+D + +K+T EGP LGDKIES+ ++ KFE+ S GGC KIV EY+TKGD+ LK Sbjct 65 RIEAVDIDNQVSKYTVIEGPMLGDKIESIHYEQKFEDSSDGGCVAKIVCEYHTKGDIQLK 124 Query 271 DEDIKEIIDGTKGFYAAAEAYLTANPTVCA 182 +E +K I D GFY +E YL ANP VCA Sbjct 125 EEGVKAINDQALGFYTLSEEYLHANPNVCA 154 >gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza] Length=160 Score = 81.3 bits (199), Expect = 1e-16 Identities = 43/90 (48%), Positives = 60/90 (67%), Gaps = 0/90 (0%) Frame = -2 Query 451 RIDALDAekgttkfttteGPWLGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALK 272 R+D +D EK + K+T EG LGDK+E + +D+KFE+ GGC +K+ +EY+TKG L Sbjct 71 RVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKVTSEYHTKGGYELA 130 Query 271 DEDIKEIIDGTKGFYAAAEAYLTANPTVCA 182 DED+K + + G Y + E YL ANP VCA Sbjct 131 DEDLKGAKEQSLGMYKSCEDYLLANPHVCA 160 >ref|XP_003609710.1| Pathogenesis-related protein [Medicago truncatula] gb|AES91907.1| Pathogenesis-related protein [Medicago truncatula] Length=160 Score = 74.3 bits (181), Expect = 3e-14 Identities = 42/90 (47%), Positives = 56/90 (62%), Gaps = 0/90 (0%) Frame = -2 Query 451 RIDALDAekgttkfttteGPWLGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALK 272 +ID LD E K+T EG LGDK+ES+ ++VKFE + GGC K+ + Y T GD +K Sbjct 71 KIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVK 130 Query 271 DEDIKEIIDGTKGFYAAAEAYLTANPTVCA 182 +ED+KE + T G Y E+YL NP V A Sbjct 131 EEDVKEGRESTIGIYEVVESYLLENPQVYA 160 >ref|XP_003609709.1| Pathogenesis-related protein [Medicago truncatula] gb|AES91906.1| Pathogenesis-related protein [Medicago truncatula] Length=229 Score = 74.3 bits (181), Expect = 6e-14 Identities = 42/90 (47%), Positives = 56/90 (62%), Gaps = 0/90 (0%) Frame = -2 Query 451 RIDALDAekgttkfttteGPWLGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALK 272 +ID LD E K+T EG LGDK+ES+ ++VKFE + GGC K+ + Y T GD +K Sbjct 140 KIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVK 199 Query 271 DEDIKEIIDGTKGFYAAAEAYLTANPTVCA 182 +ED+KE + T G Y E+YL NP V A Sbjct 200 EEDVKEGRESTIGIYEVVESYLLENPQVYA 229 >ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera] emb|CBI22930.3| unnamed protein product [Vitis vinifera] Length=160 Score = 70.9 bits (172), Expect = 6e-13 Identities = 32/65 (49%), Positives = 45/65 (69%), Gaps = 0/65 (0%) Frame = -2 Query 388 LGDKIESVVFDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAAEAY 209 LGD++ES+V+++KFEE GGC K +EY+TKG+ +K+E I+E + G Y EAY Sbjct 92 LGDQLESIVYEMKFEESGDGGCICKTRSEYHTKGEFEIKEESIREGKEKAMGVYKLVEAY 151 Query 208 LTANP 194 L ANP Sbjct 152 LLANP 156 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 179470603 Number of extensions: 3135620 Number of successful extensions: 6465 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 6463 Number of HSP's successfully gapped: 0 Length of query: 452 Length of database: 6150218869 Length adjustment: 113 Effective length of query: 339 Effective length of database: 4125362377 Effective search space: 152638407949 Effective search space used: 152638407949 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEDYGS6012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_12004 Length=442 Score E Sequences producing significant alignments: (Bits) Value gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutin... 86.7 8e-19 gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltior... 80.1 2e-16 ref|XP_003609710.1| Pathogenesis-related protein [Medicago tr... 71.2 4e-13 ref|XP_002271428.2| PREDICTED: pathogenesis-related protein S... 70.9 5e-13 ref|XP_003609709.1| Pathogenesis-related protein [Medicago tr... 71.2 7e-13 ALIGNMENTS >gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa] Length=154 Score = 86.7 bits (213), Expect = 8e-19 Identities = 41/72 (57%), Positives = 50/72 (69%), Gaps = 0/72 (0%) Frame = +3 Query 3 GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA 182 G LG+KIE + Y+ KFE+ S GGC KIV EY+TKGD+ LK+E +K I D GFY + Sbjct 83 GPMLGDKIESIHYEQKFEDSSDGGCVAKIVCEYHTKGDIQLKEEGVKAINDQALGFYTLS 142 Query 183 EAYLTANPTVCA 218 E YL ANP VCA Sbjct 143 EEYLHANPNVCA 154 >gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza] Length=160 Score = 80.1 bits (196), Expect = 2e-16 Identities = 35/72 (49%), Positives = 48/72 (67%), Gaps = 0/72 (0%) Frame = +3 Query 3 GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA 182 G LG+K+E + YD+KFE+ GGC +K+ +EY+TKG L DED+K + + G Y + Sbjct 89 GDMLGDKLEKICYDMKFEDTEDGGCVVKVTSEYHTKGGYELADEDLKGAKEQSLGMYKSC 148 Query 183 EAYLTANPTVCA 218 E YL ANP VCA Sbjct 149 EDYLLANPHVCA 160 >ref|XP_003609710.1| Pathogenesis-related protein [Medicago truncatula] gb|AES91907.1| Pathogenesis-related protein [Medicago truncatula] Length=160 Score = 71.2 bits (173), Expect = 4e-13 Identities = 33/72 (46%), Positives = 45/72 (63%), Gaps = 0/72 (0%) Frame = +3 Query 3 GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA 182 G LG+K+E + Y+VKFE + GGC K+ + Y T GD +K+ED+KE + T G Y Sbjct 89 GDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVKEEDVKEGRESTIGIYEVV 148 Query 183 EAYLTANPTVCA 218 E+YL NP V A Sbjct 149 ESYLLENPQVYA 160 >ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera] emb|CBI22930.3| unnamed protein product [Vitis vinifera] Length=160 Score = 70.9 bits (172), Expect = 5e-13 Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 0/68 (0%) Frame = +3 Query 3 GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA 182 G LG+++E +VY++KFEE GGC K +EY+TKG+ +K+E I+E + G Y Sbjct 89 GGVLGDQLESIVYEMKFEESGDGGCICKTRSEYHTKGEFEIKEESIREGKEKAMGVYKLV 148 Query 183 EAYLTANP 206 EAYL ANP Sbjct 149 EAYLLANP 156 >ref|XP_003609709.1| Pathogenesis-related protein [Medicago truncatula] gb|AES91906.1| Pathogenesis-related protein [Medicago truncatula] Length=229 Score = 71.2 bits (173), Expect = 7e-13 Identities = 33/72 (46%), Positives = 45/72 (63%), Gaps = 0/72 (0%) Frame = +3 Query 3 GAYLGEKIEPVVYDVKFEEVSGGGCTIKIVNEYNTKGDVALKDEDIKEIIDGTKGFYAAA 182 G LG+K+E + Y+VKFE + GGC K+ + Y T GD +K+ED+KE + T G Y Sbjct 158 GDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVKEEDVKEGRESTIGIYEVV 217 Query 183 EAYLTANPTVCA 218 E+YL NP V A Sbjct 218 ESYLLENPQVYA 229 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 177980684 Number of extensions: 3148074 Number of successful extensions: 7265 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 7261 Number of HSP's successfully gapped: 0 Length of query: 442 Length of database: 6150218869 Length adjustment: 110 Effective length of query: 332 Effective length of database: 4179119629 Effective search space: 154627426273 Effective search space used: 154627426273 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEE208N016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_12099 Length=440 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003545085.1| PREDICTED: uncharacterized protein LOC100... 78.2 4e-15 ref|XP_002300574.1| predicted protein [Populus trichocarpa] >... 77.8 6e-15 ref|XP_003518561.1| PREDICTED: uncharacterized protein LOC100... 75.5 4e-14 ref|XP_002863234.1| hypothetical protein ARALYDRAFT_497148 [A... 72.8 3e-13 ref|XP_003634761.1| PREDICTED: uncharacterized protein LOC100... 73.9 8e-13 ALIGNMENTS >ref|XP_003545085.1| PREDICTED: uncharacterized protein LOC100812046 [Glycine max] Length=311 Score = 78.2 bits (191), Expect = 4e-15 Identities = 39/93 (42%), Positives = 54/93 (58%), Gaps = 3/93 (3%) Frame = -3 Query 426 RVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVG--IFVFRIDAsvgelggsg 253 R+ NLHWRFRGNE + V+ F VQI WDVHDWL++ N G+G FVF+ + Sbjct 218 RIMNLHWRFRGNEIVMVNNFPVQIFWDVHDWLFT-NDLGLGPAFFVFKPIFLETTSDFNS 276 Query 252 rveggedlGNGFWDVMTDEPCSVGFCHFLYAWK 154 G+ +++ D + GFCH+LYAW+ Sbjct 277 IECPERGGGSSKHELLEDNSSTQGFCHYLYAWR 309 >ref|XP_002300574.1| predicted protein [Populus trichocarpa] gb|EEE85379.1| predicted protein [Populus trichocarpa] Length=307 Score = 77.8 bits (190), Expect = 6e-15 Identities = 41/95 (43%), Positives = 53/95 (56%), Gaps = 1/95 (1%) Frame = -3 Query 429 IRVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVGIFVFRIDAsvgelggsgr 250 IRV NL+WRFRGNE + VD+ VQI WDVHDWL+SG+ + G+F+ + A G G Sbjct 213 IRVMNLNWRFRGNENMKVDDVGVQIFWDVHDWLFSGSSTSHGLFILKPAAQEGGDDKVGE 272 Query 249 veggedlGNGFWDVMTDEPCSV-GFCHFLYAWKTE 148 G +D + S GF H + AWK E Sbjct 273 GRHCRGNDGGMYDSPKERSSSTPGFFHVINAWKYE 307 >ref|XP_003518561.1| PREDICTED: uncharacterized protein LOC100776841 [Glycine max] Length=311 Score = 75.5 bits (184), Expect = 4e-14 Identities = 38/94 (40%), Positives = 54/94 (57%), Gaps = 3/94 (3%) Frame = -3 Query 426 RVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVG--IFVFRIDAsvgelggsg 253 R+ NLHWRFRGNE + V+ VQI WDVHDWL++ N G+G FVF+ + Sbjct 218 RIMNLHWRFRGNEILMVNNLPVQIFWDVHDWLFT-NDLGLGPAFFVFKPVFLETTSDSNS 276 Query 252 rveggedlGNGFWDVMTDEPCSVGFCHFLYAWKT 151 G+ +++ + + GFCH+LYAW+T Sbjct 277 IECLERSGGSNKRELLEENSSTQGFCHYLYAWRT 310 >ref|XP_002863234.1| hypothetical protein ARALYDRAFT_497148 [Arabidopsis lyrata subsp. lyrata] gb|EFH39493.1| hypothetical protein ARALYDRAFT_497148 [Arabidopsis lyrata subsp. lyrata] Length=282 Score = 72.8 bits (177), Expect = 3e-13 Identities = 38/94 (40%), Positives = 55/94 (59%), Gaps = 14/94 (15%) Frame = -3 Query 429 IRVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVGIFVFRIDAsvgelggsgr 250 ++V NL W+FRGN+T+ VD+ VQ+ WDV+DWL+S +G G+F+F+ ++ Sbjct 203 VQVRNLQWKFRGNQTVLVDKEPVQVFWDVYDWLFSSPGTGHGLFIFKPES---------- 252 Query 249 veggedlGNGFWDVMTDEPCSVGFCHFLYAWKTE 148 G + NG D + S FC FLYAWK E Sbjct 253 --GESETSNGTKD--SSVSSSSDFCLFLYAWKLE 282 >ref|XP_003634761.1| PREDICTED: uncharacterized protein LOC100852612 [Vitis vinifera] Length=697 Score = 73.9 bits (180), Expect = 8e-13 Identities = 36/92 (39%), Positives = 48/92 (52%), Gaps = 2/92 (2%) Frame = -3 Query 429 IRVPNLHWRFRGNETITVDEFRVQILWDVHDWLYSGNHSGVGIFVFRIDAsvgelggsgr 250 IRV NLHWRFRGNET+ ++ +QI WDVHDWL++ G +F+F+ A Sbjct 213 IRVMNLHWRFRGNETVFLNNLPIQIFWDVHDWLFNSPSLGHALFIFKPGAPEYSSDSDLD 272 Query 249 veggedlGNG--FWDVMTDEPCSVGFCHFLYA 160 G +D + + FCHFLYA Sbjct 273 GTNHSGEAVGSDIYDSLWTTTSAADFCHFLYA 304 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 228184065 Number of extensions: 4528817 Number of successful extensions: 29740 Number of sequences better than 1e-10: 2 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 29737 Number of HSP's successfully gapped: 2 Length of query: 440 Length of database: 6150218869 Length adjustment: 110 Effective length of query: 330 Effective length of database: 4179119629 Effective search space: 150448306644 Effective search space used: 150448306644 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE20RD501S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_1213 Length=899 Score E Sequences producing significant alignments: (Bits) Value dbj|BAF47193.1| embryonic element binding Factor 7 [Daucus ca... 176 4e-49 gb|ADZ28107.1| ethylene response factor 4 [Malus x domestica] 173 3e-48 gb|ADE41144.1| AP2 domain class transcription factor [Malus x... 173 3e-48 gb|ADE41127.1| AP2 domain class transcription factor [Malus x... 166 2e-45 gb|ABB89755.1| putative dehydration-responsive element bindin... 164 1e-44 ALIGNMENTS >dbj|BAF47193.1| embryonic element binding Factor 7 [Daucus carota] Length=320 Score = 176 bits (445), Expect = 4e-49 Identities = 117/182 (64%), Positives = 133/182 (73%), Gaps = 21/182 (12%) Frame = +3 Query 3 PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF 182 P KLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAE+AALAYD+AAYKLRGD+ARLNF Sbjct 153 PTKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEEAALAYDRAAYKLRGDFARLNF 212 Query 183 PHLKHQLNQD-STFKPLQSSVDAKLQAICQNLAKQGTTEKKSVKPCLDSDKQSEIPKVEN 359 PHLK L+Q+ STFKPL S+VDAKLQAICQNL K+ +K + KP + KVE Sbjct 213 PHLK--LDQELSTFKPLHSTVDAKLQAICQNLNKE-PKQKSAKKP---KTSKPAFVKVEE 266 Query 360 ASSDSFNQAVKVEDDVLssskarsaspesDITFHDFSESCFDECGDFFLHKCPSVEIDWA 539 AS+ S + +SPES+I+F DFSE CFDE +F L K PSVEIDW Sbjct 267 ASNGS--------------DLSGGSSPESEISFLDFSEPCFDESENFMLQKFPSVEIDWE 312 Query 540 AL 545 AL Sbjct 313 AL 314 >gb|ADZ28107.1| ethylene response factor 4 [Malus x domestica] Length=323 Score = 173 bits (439), Expect = 3e-48 Identities = 112/192 (58%), Positives = 135/192 (70%), Gaps = 11/192 (6%) Frame = +3 Query 3 PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF 182 P KLYRGVRQRHWGKWVAEIRLP+NRTRLWLGTFDTAE+AALAYDKAA+KLRGD+ARLNF Sbjct 132 PTKLYRGVRQRHWGKWVAEIRLPRNRTRLWLGTFDTAEEAALAYDKAAFKLRGDFARLNF 191 Query 183 PHLKHQ----LNQDSTFKPLQSSVDAKLQAICQNLAKQGTTEKKSVKPCLDSDKQSEI-P 347 PHL+HQ +KPL SSVDAKLQAICQ+L T + + +PC ++ + + Sbjct 192 PHLRHQGALVCGDFGHYKPLHSSVDAKLQAICQSLGANSTKQGNTGEPCPVAETKPVVSA 251 Query 348 KVENASSDSFNQAVKVEDDVL------ssskarsaspesDITFHDFSESCFDECGDFFLH 509 +E DSF +K E + S S+SPESDITF DFS+S +DE +F L Sbjct 252 PLEAKMDDSFKSELKSESEAFSSSSYSPSRSDESSSPESDITFLDFSDSQWDEAENFGLE 311 Query 510 KCPSVEIDWAAL 545 K PSVEIDW+A+ Sbjct 312 KYPSVEIDWSAI 323 >gb|ADE41144.1| AP2 domain class transcription factor [Malus x domestica] Length=323 Score = 173 bits (439), Expect = 3e-48 Identities = 111/192 (58%), Positives = 134/192 (70%), Gaps = 11/192 (6%) Frame = +3 Query 3 PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF 182 P KLYRGVRQRHWGKWVAEIRLP+NRTRLWLGTFDTAE+AALAYDKAA+KLRGD+ARLNF Sbjct 132 PTKLYRGVRQRHWGKWVAEIRLPRNRTRLWLGTFDTAEEAALAYDKAAFKLRGDFARLNF 191 Query 183 PHLKHQ----LNQDSTFKPLQSSVDAKLQAICQNLAKQGTTEKKSVKPCLDSDKQSEI-P 347 PHL+HQ +KPL SSVDAKLQAICQ+L T + + +PC ++ + + Sbjct 192 PHLRHQGALVCGDFGHYKPLHSSVDAKLQAICQSLGANSTKQGNTGEPCPVAETKPVVSA 251 Query 348 KVENASSDSFNQAVKVEDDVL------ssskarsaspesDITFHDFSESCFDECGDFFLH 509 +E DSF +K E + S+SPESDITF DFS+S +DE +F L Sbjct 252 PLEAKMDDSFKSELKSESEAFSSSSYSPFRSDESSSPESDITFLDFSDSQWDEAENFGLE 311 Query 510 KCPSVEIDWAAL 545 K PSVEIDW+A+ Sbjct 312 KYPSVEIDWSAI 323 >gb|ADE41127.1| AP2 domain class transcription factor [Malus x domestica] Length=338 Score = 166 bits (420), Expect = 2e-45 Identities = 112/195 (57%), Positives = 132/195 (68%), Gaps = 14/195 (7%) Frame = +3 Query 3 PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF 182 P KLYRGVRQRHWGKWVAEIRLP+NRTRLWLGTFDTAE+AALAYDKAA+KLRGD+A LNF Sbjct 131 PTKLYRGVRQRHWGKWVAEIRLPRNRTRLWLGTFDTAEEAALAYDKAAFKLRGDFACLNF 190 Query 183 PHLKHQLNQDS----TFKPLQSSVDAKLQAICQNLAKQGTTEKKSVKPC-LDSDKQSEIP 347 PHLKHQ DS +KPL SSVDAKLQ ICQ+LA T + + +PC + K Sbjct 191 PHLKHQGAHDSGAFGHYKPLHSSVDAKLQEICQSLAANSTKQGNTGEPCSVPETKPMVSA 250 Query 348 KVENASSDSFNQAVKVEDDVL---------ssskarsaspesDITFHDFSESCFDECGDF 500 +E DSF +K E + S S++PE+DITF DFS+S +DE +F Sbjct 251 PLETRMDDSFKTELKSEWEAFSSLSFSPSRSDRSDESSTPETDITFLDFSDSQWDEAENF 310 Query 501 FLHKCPSVEIDWAAL 545 L K PSVEID+ L Sbjct 311 GLEKYPSVEIDFLHL 325 >gb|ABB89755.1| putative dehydration-responsive element binding protein [Broussonetia papyrifera] Length=330 Score = 164 bits (415), Expect = 1e-44 Identities = 115/200 (58%), Positives = 140/200 (70%), Gaps = 19/200 (10%) Frame = +3 Query 3 PVKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEdaalaydkaayklRGDYARLNF 182 P KLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAE+AALAYDKAAYKLRGD+ARLNF Sbjct 131 PNKLYRGVRQRHWGKWVAEIRLPKNRTRLWLGTFDTAEEAALAYDKAAYKLRGDFARLNF 190 Query 183 PHLKHQ----LNQDSTFKPLQSSVDAKLQAICQNLA---KQGTTE--------KKSVKPC 317 PHL+H+ + +KPL SSVDAKLQAICQ+LA KQG+ + K ++P Sbjct 191 PHLRHEGAHVSGEFGEYKPLHSSVDAKLQAICQSLANSQKQGSAKEACSEPEVKPVIEPK 250 Query 318 LDSDK----QSEIPKVENASSDSFNQAVKVEDDVLssskarsaspesDITFHDFSESCFD 485 + SD + E+ +SS S + ++ + + S A S+SPESD+T DFS+S +D Sbjct 251 MASDNSPKGELEVSSSSLSSSSSLSLSLSLSSPLSDESSAGSSSPESDVTLLDFSDSHWD 310 Query 486 ECGDFFLHKCPSVEIDWAAL 545 +F L K PSVEIDW AL Sbjct 311 GNENFGLGKYPSVEIDWDAL 330 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 431153293 Number of extensions: 8524744 Number of successful extensions: 19148 Number of sequences better than 1e-10: 6 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 19121 Number of HSP's successfully gapped: 6 Length of query: 899 Length of database: 6150218869 Length adjustment: 140 Effective length of query: 759 Effective length of database: 3641547109 Effective search space: 579005990331 Effective search space used: 579005990331 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 176 (72.4 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEES6GZ016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_12313 Length=436 Score E Sequences producing significant alignments: (Bits) Value gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutin... 80.1 2e-16 ref|XP_003609710.1| Pathogenesis-related protein [Medicago tr... 75.9 8e-15 gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltior... 75.1 2e-14 ref|XP_003609709.1| Pathogenesis-related protein [Medicago tr... 75.9 2e-14 ref|XP_002271428.2| PREDICTED: pathogenesis-related protein S... 73.2 8e-14 ALIGNMENTS >gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa] Length=154 Score = 80.1 bits (196), Expect = 2e-16 Identities = 42/87 (48%), Positives = 61/87 (70%), Gaps = 0/87 (0%) Frame = +2 Query 2 RIDALDAekgttkftttegPWLGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALK 181 RI+A+D + +K+T EGP LGDKIES+ ++ KFE+ S GGC+ KI +Y+TKGD+ LK Sbjct 65 RIEAVDIDNQVSKYTVIEGPMLGDKIESIHYEQKFEDSSDGGCVAKIVCEYHTKGDIQLK 124 Query 182 DDDIKEFVERTKAFYDAAEAYLIANPN 262 ++ +K ++ FY +E YL ANPN Sbjct 125 EEGVKAINDQALGFYTLSEEYLHANPN 151 >ref|XP_003609710.1| Pathogenesis-related protein [Medicago truncatula] gb|AES91907.1| Pathogenesis-related protein [Medicago truncatula] Length=160 Score = 75.9 bits (185), Expect = 8e-15 Identities = 41/86 (48%), Positives = 57/86 (66%), Gaps = 0/86 (0%) Frame = +2 Query 2 RIDALDAekgttkftttegPWLGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALK 181 +ID LD E K+T EG LGDK+ES+ ++VKFE + GGCL K+A+ Y T GD +K Sbjct 71 KIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVK 130 Query 182 DDDIKEFVERTKAFYDAAEAYLIANP 259 ++D+KE E T Y+ E+YL+ NP Sbjct 131 EEDVKEGRESTIGIYEVVESYLLENP 156 >gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza] Length=160 Score = 75.1 bits (183), Expect = 2e-14 Identities = 38/87 (44%), Positives = 60/87 (69%), Gaps = 0/87 (0%) Frame = +2 Query 2 RIDALDAekgttkftttegPWLGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALK 181 R+D +D EK + K+T EG LGDK+E + +D+KFE+ GGC++K+ ++Y+TKG L Sbjct 71 RVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKVTSEYHTKGGYELA 130 Query 182 DDDIKEFVERTKAFYDAAEAYLIANPN 262 D+D+K E++ Y + E YL+ANP+ Sbjct 131 DEDLKGAKEQSLGMYKSCEDYLLANPH 157 >ref|XP_003609709.1| Pathogenesis-related protein [Medicago truncatula] gb|AES91906.1| Pathogenesis-related protein [Medicago truncatula] Length=229 Score = 75.9 bits (185), Expect = 2e-14 Identities = 41/86 (48%), Positives = 57/86 (66%), Gaps = 0/86 (0%) Frame = +2 Query 2 RIDALDAekgttkftttegPWLGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALK 181 +ID LD E K+T EG LGDK+ES+ ++VKFE + GGCL K+A+ Y T GD +K Sbjct 140 KIDVLDKENLICKYTMIEGDPLGDKLESIAYEVKFEATNDGGCLCKMASSYKTIGDFDVK 199 Query 182 DDDIKEFVERTKAFYDAAEAYLIANP 259 ++D+KE E T Y+ E+YL+ NP Sbjct 200 EEDVKEGRESTIGIYEVVESYLLENP 225 >ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera] emb|CBI22930.3| unnamed protein product [Vitis vinifera] Length=160 Score = 73.2 bits (178), Expect = 8e-14 Identities = 30/66 (45%), Positives = 48/66 (73%), Gaps = 0/66 (0%) Frame = +2 Query 65 LGDKIESVVFDVKFEEVSGGGCLIKIANDYNTKGDVALKDDDIKEFVERTKAFYDAAEAY 244 LGD++ES+V+++KFEE GGC+ K ++Y+TKG+ +K++ I+E E+ Y EAY Sbjct 92 LGDQLESIVYEMKFEESGDGGCICKTRSEYHTKGEFEIKEESIREGKEKAMGVYKLVEAY 151 Query 245 LIANPN 262 L+ANP+ Sbjct 152 LLANPD 157 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: May 9, 2012 2:26 PM Number of letters in database: 6,200,364,692 Number of sequences in database: 18,076,563 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 18076563 Number of Hits to DB: 167970995 Number of extensions: 2784139 Number of successful extensions: 7347 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 7343 Number of HSP's successfully gapped: 0 Length of query: 436 Length of database: 6200364692 Length adjustment: 109 Effective length of query: 327 Effective length of database: 4230019325 Effective search space: 152280695700 Effective search space used: 152280695700 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE2YTBV01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_1307 Length=886 Score E Sequences producing significant alignments: (Bits) Value emb|CBI25361.3| unnamed protein product [Vitis vinifera] 307 4e-102 ref|XP_002280352.1| PREDICTED: macro domain-containing protei... 307 1e-101 ref|XP_002319871.1| predicted protein [Populus trichocarpa] >... 302 2e-100 ref|XP_003588477.1| Appr-1-p processing enzyme family protein... 295 1e-96 gb|AAB87596.2| expressed protein [Arabidopsis thaliana] 293 2e-96 ALIGNMENTS >emb|CBI25361.3| unnamed protein product [Vitis vinifera] Length=190 Score = 307 bits (787), Expect = 4e-102 Identities = 148/183 (81%), Positives = 165/183 (90%), Gaps = 0/183 (0%) Frame = -3 Query 878 ELKLSPSSVLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAACY 699 +L LSP+S LKIQKGDIT W VDGSSDAIVNPANERMLGGGGADGAIH+AAGPEL AACY Sbjct 7 QLALSPTSSLKIQKGDITKWFVDGSSDAIVNPANERMLGGGGADGAIHRAAGPELVAACY 66 Query 698 KVLEVKPGVRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVANE 519 KV EV+PG+RCPTGEARIT GF+LPA+HVIHTVGPIYDVD NP+ASL++AY N L +A E Sbjct 67 KVPEVRPGIRCPTGEARITQGFKLPAAHVIHTVGPIYDVDSNPEASLKSAYANCLSLAKE 126 Query 518 NNIQYIAFTAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKARE 339 NN+QYIAF AISCGV+GYPY+EAA VAISTVKEF ++KEVHFVLFSDDIY+VWL KA E Sbjct 127 NNVQYIAFPAISCGVFGYPYDEAATVAISTVKEFGKDLKEVHFVLFSDDIYNVWLNKANE 186 Query 338 LLQ 330 LLQ Sbjct 187 LLQ 189 >ref|XP_002280352.1| PREDICTED: macro domain-containing protein VPA0103 [Vitis vinifera] emb|CAN70345.1| hypothetical protein VITISV_012577 [Vitis vinifera] Length=231 Score = 307 bits (787), Expect = 1e-101 Identities = 148/183 (81%), Positives = 165/183 (90%), Gaps = 0/183 (0%) Frame = -3 Query 878 ELKLSPSSVLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAACY 699 +L LSP+S LKIQKGDIT W VDGSSDAIVNPANERMLGGGGADGAIH+AAGPEL AACY Sbjct 48 QLALSPTSSLKIQKGDITKWFVDGSSDAIVNPANERMLGGGGADGAIHRAAGPELVAACY 107 Query 698 KVLEVKPGVRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVANE 519 KV EV+PG+RCPTGEARIT GF+LPA+HVIHTVGPIYDVD NP+ASL++AY N L +A E Sbjct 108 KVPEVRPGIRCPTGEARITQGFKLPAAHVIHTVGPIYDVDSNPEASLKSAYANCLSLAKE 167 Query 518 NNIQYIAFTAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKARE 339 NN+QYIAF AISCGV+GYPY+EAA VAISTVKEF ++KEVHFVLFSDDIY+VWL KA E Sbjct 168 NNVQYIAFPAISCGVFGYPYDEAATVAISTVKEFGKDLKEVHFVLFSDDIYNVWLNKANE 227 Query 338 LLQ 330 LLQ Sbjct 228 LLQ 230 >ref|XP_002319871.1| predicted protein [Populus trichocarpa] gb|EEE95794.1| predicted protein [Populus trichocarpa] Length=180 Score = 302 bits (774), Expect = 2e-100 Identities = 147/175 (84%), Positives = 161/175 (92%), Gaps = 0/175 (0%) Frame = -3 Query 854 VLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAACYKVLEVKPG 675 +LKI KGDIT WSVDGSSDAIVNPANERMLGGGGADGAIH+AAGP+LR ACY V EV+PG Sbjct 5 LLKISKGDITKWSVDGSSDAIVNPANERMLGGGGADGAIHRAAGPQLRDACYTVPEVRPG 64 Query 674 VRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVANENNIQYIAF 495 VRCPTGEARITPGF LPA VIHTVGPIYDVD NP+ASLRNAYRNSL +A +NNI+YIAF Sbjct 65 VRCPTGEARITPGFNLPAFRVIHTVGPIYDVDGNPEASLRNAYRNSLILAKDNNIKYIAF 124 Query 494 TAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKARELLQ 330 AISCGVYGYPYEEAA VAISTVKEFA ++KEVHFVLFSD+IY+VWL+KA+ELLQ Sbjct 125 PAISCGVYGYPYEEAAKVAISTVKEFADDLKEVHFVLFSDEIYNVWLEKAKELLQ 179 >ref|XP_003588477.1| Appr-1-p processing enzyme family protein [Medicago truncatula] gb|AES58728.1| Appr-1-p processing enzyme family protein [Medicago truncatula] Length=233 Score = 295 bits (755), Expect = 1e-96 Identities = 144/184 (78%), Positives = 157/184 (85%), Gaps = 0/184 (0%) Frame = -3 Query 881 VELKLSPSSVLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAAC 702 V LS S+ L IQKGDIT WS+DGS+DAIVNPANERMLGGGGADGAIH+AAGP+L AC Sbjct 49 VRFPLSSSNALIIQKGDITKWSIDGSTDAIVNPANERMLGGGGADGAIHRAAGPDLLRAC 108 Query 701 YKVLEVKPGVRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVAN 522 V EV+PGVRCPTGEARITPGF LPASHVIHTVGPIYDVD NP ASL +AYRNSL+VA Sbjct 109 RNVPEVRPGVRCPTGEARITPGFLLPASHVIHTVGPIYDVDSNPAASLASAYRNSLRVAK 168 Query 521 ENNIQYIAFTAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKAR 342 ENNIQYIAF AISCGVYGYPY+EAA VAIST+KEF + KEVHFVLF DIYD WL K+ Sbjct 169 ENNIQYIAFPAISCGVYGYPYDEAATVAISTIKEFQNDFKEVHFVLFMSDIYDTWLNKSD 228 Query 341 ELLQ 330 ELL+ Sbjct 229 ELLK 232 >gb|AAB87596.2| expressed protein [Arabidopsis thaliana] Length=193 Score = 293 bits (749), Expect = 2e-96 Identities = 138/180 (77%), Positives = 161/180 (89%), Gaps = 0/180 (0%) Frame = -3 Query 869 LSPSSVLKIQKGDITGWSVDGSSDAIVNPANERMLGGGGADGAIHQAAGPELRAACYKVL 690 LS SS+LKI KGDIT WSVD SSDAIVNPANERMLGGGGADGAIH+AAGP+LRAACY+V Sbjct 12 LSDSSLLKILKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRAAGPQLRAACYEVP 71 Query 689 EVKPGVRCPTGEARITPGFRLPASHVIHTVGPIYDVDKNPKASLRNAYRNSLKVANENNI 510 EV+PGVRCPTGEARITPGF LPAS VIHTVGPIYD D NP+ SL N+Y+NSL+VA ENNI Sbjct 72 EVRPGVRCPTGEARITPGFNLPASRVIHTVGPIYDSDVNPQESLTNSYKNSLRVAKENNI 131 Query 509 QYIAFTAISCGVYGYPYEEAAMVAISTVKEFAGNIKEVHFVLFSDDIYDVWLKKARELLQ 330 +YIAF AISCG+YGYP++EAA + IST+K+F+ + KEVHFVLF+DDI+ VW+ KA+E+LQ Sbjct 132 KYIAFPAISCGIYGYPFDEAAAIGISTIKQFSTDFKEVHFVLFADDIFSVWVNKAKEVLQ 191 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 443984567 Number of extensions: 9097265 Number of successful extensions: 23242 Number of sequences better than 1e-10: 94 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 23059 Number of HSP's successfully gapped: 94 Length of query: 886 Length of database: 6150218869 Length adjustment: 139 Effective length of query: 747 Effective length of database: 3659466193 Effective search space: 570876726108 Effective search space used: 570876726108 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 176 (72.4 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEF2CHP012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_13435 Length=408 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002893532.1| hypothetical protein ARALYDRAFT_473065 [A... 109 4e-28 ref|XP_002262673.1| PREDICTED: glutaredoxin-C9-like [Vitis vi... 106 1e-26 emb|CAN60168.1| hypothetical protein VITISV_003664 [Vitis vin... 104 2e-26 ref|NP_174170.1| glutaredoxin-C9 [Arabidopsis thaliana] >sp|Q... 105 2e-26 ref|XP_002331955.1| glutaredoxin C9 [Populus trichocarpa] >gb... 105 5e-26 ALIGNMENTS >ref|XP_002893532.1| hypothetical protein ARALYDRAFT_473065 [Arabidopsis lyrata subsp. lyrata] gb|EFH69791.1| hypothetical protein ARALYDRAFT_473065 [Arabidopsis lyrata subsp. lyrata] Length=121 Score = 109 bits (273), Expect = 4e-28 Identities = 51/92 (55%), Positives = 68/92 (74%), Gaps = 0/92 (0%) Frame = -3 Query 397 KNVKILVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNEPDVaallgiegla 218 ++V+++V ENAVIV GR+GCCMCHVV+ LL G GVNP +L+++E+ E +V L G Sbjct 20 ESVRMVVEENAVIVIGRRGCCMCHVVRRLLLGLGVNPAVLEIEEEREEEVLRELERIGGG 79 Query 217 aGVNFPAVFVGGELFGGIDEIMGAHITGELVP 122 V PAV+VGG LFGG+D +M HI+GELVP Sbjct 80 DTVKLPAVYVGGRLFGGLDRVMATHISGELVP 111 >ref|XP_002262673.1| PREDICTED: glutaredoxin-C9-like [Vitis vinifera] Length=141 Score = 106 bits (265), Expect = 1e-26 Identities = 54/95 (57%), Positives = 69/95 (73%), Gaps = 4/95 (4%) Frame = -3 Query 394 NVKILVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNE----PDVaallgie 227 N+ +V+ENAVIVFGR+GCCM HVVK LL G GVNP + +V+E++E ++ + E Sbjct 37 NITNMVSENAVIVFGRRGCCMTHVVKRLLLGLGVNPAVCEVNEEDEIGVLDELGMVGAGE 96 Query 226 glaaGVNFPAVFVGGELFGGIDEIMGAHITGELVP 122 G V FPAVF+GG LFGG+D +M AHITGELVP Sbjct 97 GKQGAVQFPAVFIGGRLFGGLDRVMAAHITGELVP 131 >emb|CAN60168.1| hypothetical protein VITISV_003664 [Vitis vinifera] Length=101 Score = 104 bits (260), Expect = 2e-26 Identities = 53/91 (58%), Positives = 67/91 (74%), Gaps = 4/91 (4%) Frame = -3 Query 382 LVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNE----PDVaallgieglaa 215 +V+ENAVIVFGR+GCCM HVVK LL G GVNP + +V+E++E ++ + EG Sbjct 1 MVSENAVIVFGRRGCCMTHVVKRLLLGLGVNPAVCEVNEEDEIGVLDELGMVGAGEGKQG 60 Query 214 GVNFPAVFVGGELFGGIDEIMGAHITGELVP 122 V FPAVF+GG LFGG+D +M AHITGELVP Sbjct 61 AVQFPAVFIGGRLFGGLDRVMAAHITGELVP 91 >ref|NP_174170.1| glutaredoxin-C9 [Arabidopsis thaliana] sp|Q9SGP6.1|GRXC9_ARATH RecName: Full=Glutaredoxin-C9; Short=AtGrxC9; AltName: Full=Protein ROXY 19 gb|AAF16751.1|AC010155_4 F3M18.8 [Arabidopsis thaliana] gb|AAG40382.1|AF325030_1 At1g28480 [Arabidopsis thaliana] dbj|BAF01232.1| hypothetical protein [Arabidopsis thaliana] gb|ABK32150.1| At1g28480 [Arabidopsis thaliana] gb|ACO50423.1| glutaredoxin [Arabidopsis thaliana] gb|AEE30981.1| glutaredoxin-C9 [Arabidopsis thaliana] Length=137 Score = 105 bits (262), Expect = 2e-26 Identities = 52/95 (55%), Positives = 70/95 (74%), Gaps = 3/95 (3%) Frame = -3 Query 397 KNVKILVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNEPDV---aallgie 227 + V+++V ENAVIV GR+GCCMCHVV+ LL G GVNP +L++DE+ E +V +G++ Sbjct 33 ERVRMVVEENAVIVIGRRGCCMCHVVRRLLLGLGVNPAVLEIDEEREDEVLSELENIGVQ 92 Query 226 glaaGVNFPAVFVGGELFGGIDEIMGAHITGELVP 122 G V PAV+VGG LFGG+D +M HI+GELVP Sbjct 93 GGGGTVKLPAVYVGGRLFGGLDRVMATHISGELVP 127 >ref|XP_002331955.1| glutaredoxin C9 [Populus trichocarpa] gb|EEF11863.1| glutaredoxin C9 [Populus trichocarpa] Length=152 Score = 105 bits (261), Expect = 5e-26 Identities = 52/97 (54%), Positives = 65/97 (67%), Gaps = 6/97 (6%) Frame = -3 Query 394 NVKILVAENAVIVFGRKGCCMCHVVKLLLQGHGVNPTILDVDEQNEPDV------aallg 233 +V+ LV EN+VIVFG++GCCMCHVVK LL G GVNP + +V+E+ E DV Sbjct 46 HVQKLVLENSVIVFGKRGCCMCHVVKRLLLGLGVNPPVFEVEEKEEDDVIKELSMIDSDR 105 Query 232 ieglaaGVNFPAVFVGGELFGGIDEIMGAHITGELVP 122 V FP VFVGG+LFGG++ +M HITGELVP Sbjct 106 GGEGVDQVQFPVVFVGGKLFGGLERVMATHITGELVP 142 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: May 9, 2012 2:26 PM Number of letters in database: 6,200,364,692 Number of sequences in database: 18,076,563 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 18076563 Number of Hits to DB: 173621065 Number of extensions: 3259378 Number of successful extensions: 8445 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 8439 Number of HSP's successfully gapped: 0 Length of query: 408 Length of database: 6200364692 Length adjustment: 101 Effective length of query: 307 Effective length of database: 4374631829 Effective search space: 153112114015 Effective search space used: 153112114015 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE2Z4R101N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_1393 Length=876 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002518452.1| peroxisomal membrane protein 2, pxmp2, pu... 238 4e-75 ref|XP_002316547.1| predicted protein [Populus trichocarpa] >... 234 1e-73 ref|NP_001237804.1| uncharacterized protein LOC100499909 [Gly... 233 3e-73 ref|XP_003534581.1| PREDICTED: peroxisomal membrane protein P... 229 8e-72 gb|ACU19135.1| unknown [Glycine max] 222 6e-69 ALIGNMENTS >ref|XP_002518452.1| peroxisomal membrane protein 2, pxmp2, putative [Ricinus communis] gb|EEF43839.1| peroxisomal membrane protein 2, pxmp2, putative [Ricinus communis] Length=185 Score = 238 bits (607), Expect = 4e-75 Identities = 147/185 (79%), Positives = 166/185 (90%), Gaps = 0/185 (0%) Frame = -3 Query 841 MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG 662 MGSVAKKGLQ YL QLQ HPLRTK +TAG LSA+SDI+AQK+SGIQKLQL+RLLLKV+FG Sbjct 1 MGSVAKKGLQLYLLQLQHHPLRTKAITAGFLSAVSDIIAQKISGIQKLQLRRLLLKVLFG 60 Query 661 AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW 482 +AYLGPFGHF H++LDKIFKGKKDTKTVAKKVV EQLTSSP NN+LFM+YYG ++E RPW Sbjct 61 SAYLGPFGHFLHIILDKIFKGKKDTKTVAKKVVVEQLTSSPWNNMLFMIYYGVIVERRPW 120 Query 481 IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM 302 +HVK++IKKEYP VQ T+W+FWPVVGW+NHQYVPLQ RVI VVACFW IFLNLRARSM Sbjct 121 MHVKARIKKEYPKVQLTSWTFWPVVGWINHQYVPLQLRVIFHMVVACFWGIFLNLRARSM 180 Query 301 TLTKG 287 L KG Sbjct 181 ALAKG 185 >ref|XP_002316547.1| predicted protein [Populus trichocarpa] gb|EEE97159.1| predicted protein [Populus trichocarpa] Length=185 Score = 234 bits (597), Expect = 1e-73 Identities = 144/185 (78%), Positives = 167/185 (90%), Gaps = 0/185 (0%) Frame = -3 Query 841 MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG 662 MGSVAKKGLQQY+ QLQ+HPLRTK +TAGVLSA+SDIV+QKLSGIQKLQ+KR+LLKV+FG Sbjct 1 MGSVAKKGLQQYMLQLQQHPLRTKAITAGVLSALSDIVSQKLSGIQKLQIKRILLKVLFG 60 Query 661 AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW 482 YLGPFGH+ H++LDK+FKGKKDT TVAKKV EQLT+SP NNL+FMVYYG VI+GRPW Sbjct 61 FGYLGPFGHYLHILLDKLFKGKKDTTTVAKKVAVEQLTASPWNNLVFMVYYGMVIDGRPW 120 Query 481 IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM 302 + VK+K+KKEYPAVQFT+W+FWPVVGWVNHQY+P QFRVI S++A W IFLNLRARSM Sbjct 121 LQVKTKLKKEYPAVQFTSWTFWPVVGWVNHQYIPQQFRVIFHSLIAVGWGIFLNLRARSM 180 Query 301 TLTKG 287 LTKG Sbjct 181 ALTKG 185 >ref|NP_001237804.1| uncharacterized protein LOC100499909 [Glycine max] gb|ACU14136.1| unknown [Glycine max] Length=185 Score = 233 bits (595), Expect = 3e-73 Identities = 137/184 (74%), Positives = 170/184 (92%), Gaps = 0/184 (0%) Frame = -3 Query 841 MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG 662 MGS+AKKGL Y+ QLQ+HPLRTKV+TAGVLSAISD+V+QKL+GIQKLQLKRLL KVIFG Sbjct 1 MGSLAKKGLNNYVKQLQQHPLRTKVITAGVLSAISDVVSQKLTGIQKLQLKRLLFKVIFG 60 Query 661 AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW 482 AAYLGPFGHF+H++LDKIFKGK+D+KTVAKKV+ EQLTS+P NNLLFM+YYG V+EG+PW Sbjct 61 AAYLGPFGHFFHLILDKIFKGKRDSKTVAKKVLIEQLTSNPWNNLLFMIYYGLVVEGQPW 120 Query 481 IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM 302 ++VK+K+KK+YP+VQ+T+W+ WPVVGW+NH+++PL FRV+ QS+VA FW +FLNLRARSM Sbjct 121 VNVKAKVKKDYPSVQYTSWTVWPVVGWINHKFMPLHFRVVFQSLVAFFWGVFLNLRARSM 180 Query 301 TLTK 290 L K Sbjct 181 ALIK 184 >ref|XP_003534581.1| PREDICTED: peroxisomal membrane protein PMP22-like [Glycine max] Length=185 Score = 229 bits (585), Expect = 8e-72 Identities = 135/184 (73%), Positives = 169/184 (92%), Gaps = 0/184 (0%) Frame = -3 Query 841 MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG 662 MGS+AKKGL Y+ QLQ+HPLRTKV+TAGVLSAISD+V+QKL+GIQK+QLKRLL KVIFG Sbjct 1 MGSLAKKGLNNYVKQLQQHPLRTKVITAGVLSAISDVVSQKLTGIQKIQLKRLLFKVIFG 60 Query 661 AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW 482 AAYLGPFGHF+H++LDKIFKGK+D+KTVAKKV+ EQLTS+P NNLLFM+YYG V+EG+PW Sbjct 61 AAYLGPFGHFFHLILDKIFKGKRDSKTVAKKVLIEQLTSNPWNNLLFMIYYGLVVEGQPW 120 Query 481 IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM 302 ++VK+K+KK+Y +VQ+T+W+ WPVVGW+NH+++PL FRV+ QS+VA FW +FLNLRARSM Sbjct 121 VNVKAKVKKDYLSVQYTSWTVWPVVGWINHKFMPLHFRVVFQSLVAFFWGVFLNLRARSM 180 Query 301 TLTK 290 L K Sbjct 181 ALIK 184 >gb|ACU19135.1| unknown [Glycine max] Length=185 Score = 222 bits (566), Expect = 6e-69 Identities = 132/184 (72%), Positives = 166/184 (90%), Gaps = 0/184 (0%) Frame = -3 Query 841 MGSVAkkglqqylgqlqKHPLRTKVLTAGVLSAISDIVAQKLSGIqklqlkrlllKVIFG 662 MGS+AKKGL Y+ QLQ+HPLRTKV+TAGVLSAISD+V+QKL+GIQK+QLKRLL KVIFG Sbjct 1 MGSLAKKGLNNYVKQLQQHPLRTKVITAGVLSAISDVVSQKLTGIQKIQLKRLLFKVIFG 60 Query 661 AAYLGPFGHFYHMMLDKIFkgkkdtktvakkvVFEQLTSSPLNNLLFMVYYGFVIEGRPW 482 AAY GPFGH +H++LDKIFKGK+D+KTVAKKV+ EQLTS+P NNLLFM+YYG V+EG+PW Sbjct 61 AAYPGPFGHLFHLILDKIFKGKRDSKTVAKKVLIEQLTSNPWNNLLFMIYYGLVVEGQPW 120 Query 481 IHVKSKIKKEYPAVQFTAWSFWPVVGWVNHQYVPLQFRVIVQSVVACFWAIFLNLRARSM 302 ++VK+K+KK+Y +VQ+T+W+ WPVVGW+NH+++PL FRV+ QS+VA FW +FLNLRAR M Sbjct 121 VNVKAKVKKDYLSVQYTSWTVWPVVGWINHKFMPLHFRVVFQSLVAFFWGVFLNLRARFM 180 Query 301 TLTK 290 L K Sbjct 181 ALIK 184 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 1,855,251,573 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 438472506 Number of extensions: 9096450 Number of successful extensions: 17563 Number of sequences better than 1e-10: 2 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 17537 Number of HSP's successfully gapped: 2 Length of query: 876 Length of database: 6150218869 Length adjustment: 139 Effective length of query: 737 Effective length of database: 3659466193 Effective search space: 559898327529 Effective search space used: 559898327529 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 176 (72.4 bits)A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPDZ3HTU016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_147 Length=1254 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003606383.1| Aquaporin PIP2-7 [Medicago truncatula] >g... 404 9e-137 emb|CAB61749.1| putative water channel protein [Cicer arietinum] 393 4e-133 emb|CAB45651.1| putative plasma membrane intrinsic protein [P... 392 6e-132 ref|NP_001105639.1| aquaporin PIP2-7 [Zea mays] >sp|Q9ATM4.1|... 390 3e-131 ref|NP_001237286.1| PIP2,2 [Glycine max] >gb|AAX86046.1| PIP2... 389 1e-130 ALIGNMENTS >ref|XP_003606383.1| Aquaporin PIP2-7 [Medicago truncatula] gb|AES88580.1| Aquaporin PIP2-7 [Medicago truncatula] Length=285 Score = 404 bits (1038), Expect = 9e-137 Identities = 197/203 (97%), Positives = 200/203 (99%), Gaps = 0/203 (0%) Frame = -3 Query 1252 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL 1073 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL Sbjct 83 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL 142 Query 1072 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 893 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV Sbjct 143 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 202 Query 892 LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA 713 LAPLPIGFAVFMVHLATIPVTGTGINPARSFG AVI+N+ K WDDQWI+WVGPFIGAAVA Sbjct 203 LAPLPIGFAVFMVHLATIPVTGTGINPARSFGSAVIYNDGKIWDDQWIFWVGPFIGAAVA 262 Query 712 AIYHQYILRGSAIKALGSFRSNA 644 AIYHQYILRGSAIKALGSFRSNA Sbjct 263 AIYHQYILRGSAIKALGSFRSNA 285 >emb|CAB61749.1| putative water channel protein [Cicer arietinum] Length=237 Score = 393 bits (1009), Expect = 4e-133 Identities = 191/203 (94%), Positives = 195/203 (96%), Gaps = 0/203 (0%) Frame = -3 Query 1252 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL 1073 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSL+RAVFYMAAQ AGAI GTGL Sbjct 35 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLLRAVFYMAAQCAGAISGTGL 94 Query 1072 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 893 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV Sbjct 95 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 154 Query 892 LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA 713 LAPLPIGFAVFMVHLATIP+TGTGINPARSFG AVI+N K WDDQWI+WVGP IGA VA Sbjct 155 LAPLPIGFAVFMVHLATIPITGTGINPARSFGSAVIYNEGKIWDDQWIFWVGPIIGATVA 214 Query 712 AIYHQYILRGSAIKALGSFRSNA 644 AIYHQYILRGSAIKALGSFRSNA Sbjct 215 AIYHQYILRGSAIKALGSFRSNA 237 >emb|CAB45651.1| putative plasma membrane intrinsic protein [Pisum sativum] Length=285 Score = 392 bits (1006), Expect = 6e-132 Identities = 187/203 (92%), Positives = 195/203 (96%), Gaps = 0/203 (0%) Frame = -3 Query 1252 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL 1073 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSL+RAVFYMAAQ AGA+CGTGL Sbjct 83 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLLRAVFYMAAQCAGAVCGTGL 142 Query 1072 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 893 AKGFQK++FDRYGGGANF+HDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV Sbjct 143 AKGFQKSFFDRYGGGANFIHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 202 Query 892 LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA 713 LAPLPIGFAVFMVHLATIP+TGTGINPARSFG AVI N K WDDQW++WVGP IGA VA Sbjct 203 LAPLPIGFAVFMVHLATIPITGTGINPARSFGSAVILNQGKIWDDQWVFWVGPIIGATVA 262 Query 712 AIYHQYILRGSAIKALGSFRSNA 644 AIYHQYILRGSAIKALGSFRSNA Sbjct 263 AIYHQYILRGSAIKALGSFRSNA 285 >ref|NP_001105639.1| aquaporin PIP2-7 [Zea mays] sp|Q9ATM4.1|PIP27_MAIZE RecName: Full=Aquaporin PIP2-7; AltName: Full=Plasma membrane intrinsic protein 2-7; AltName: Full=ZmPIP2-7; AltName: Full=ZmPIP2;7 gb|AAK26763.1| plasma membrane integral protein ZmPIP2-7 [Zea mays] gb|ACU23788.1| unknown [Glycine max] Length=287 Score = 390 bits (1002), Expect = 3e-131 Identities = 187/203 (92%), Positives = 196/203 (97%), Gaps = 0/203 (0%) Frame = -3 Query 1252 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL 1073 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLF+GRKVSL+RA+ YM AQ AGAICG GL Sbjct 85 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLGRKVSLVRALLYMIAQCAGAICGAGL 144 Query 1072 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 893 AKGFQK++++RYGGG N V DGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV Sbjct 145 AKGFQKSFYNRYGGGVNTVSDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 204 Query 892 LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA 713 LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNN+KAWDDQWIYWVGPF+GAAVA Sbjct 205 LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNDKAWDDQWIYWVGPFVGAAVA 264 Query 712 AIYHQYILRGSAIKALGSFRSNA 644 AIYHQYILRGSAIKALGSFRSNA Sbjct 265 AIYHQYILRGSAIKALGSFRSNA 287 >ref|NP_001237286.1| PIP2,2 [Glycine max] gb|AAX86046.1| PIP2,2 [Glycine max] Length=287 Score = 389 bits (998), Expect = 1e-130 Identities = 186/203 (92%), Positives = 195/203 (96%), Gaps = 0/203 (0%) Frame = -3 Query 1252 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFVGRKVSLIRAVFYMAAQSAGAICGTGL 1073 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLF+GRKVSL+RA+ YM AQ AGAICG GL Sbjct 85 IAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLGRKVSLVRALLYMIAQCAGAICGAGL 144 Query 1072 AKGFQKAYFDRYGGGANFVHDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 893 AKGFQK++++RYGGG N V DGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV Sbjct 145 AKGFQKSFYNRYGGGVNTVSDGYNKGTALGAEIIGTFVLVYTVFSATDPKRNARDSHVPV 204 Query 892 LAPLPIGFAVFMVHLATIPVTGTGINPARSFGPAVIFNNEKAWDDQWIYWVGPFIGAAVA 713 LAPLPIGFAVFM HLATIPVTGTGINPARSFGPAVIFNN+KAWDDQWIYWVGPF+GAAVA Sbjct 205 LAPLPIGFAVFMAHLATIPVTGTGINPARSFGPAVIFNNDKAWDDQWIYWVGPFVGAAVA 264 Query 712 AIYHQYILRGSAIKALGSFRSNA 644 AIYHQYILRGSAIKALGSFRSNA Sbjct 265 AIYHQYILRGSAIKALGSFRSNA 287 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 704560993 Number of extensions: 16073247 Number of successful extensions: 44958 Number of sequences better than 1e-10: 350 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 44043 Number of HSP's successfully gapped: 352 Length of query: 1254 Length of database: 6150218869 Length adjustment: 143 Effective length of query: 1111 Effective length of database: 3587789857 Effective search space: 986642210675 Effective search space used: 986642210675 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 178 (73.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEF4CNU01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_15159 Length=369 Score E Sequences producing significant alignments: (Bits) Value ref|NP_001051179.1| Os03g0734300 [Oryza sativa Japonica Group... 81.3 9e-18 ref|XP_002263203.1| PREDICTED: uncharacterized protein LOC100... 70.1 1e-13 ref|NP_001105983.1| LOC100037813 precursor [Zea mays] >gb|ABN... 64.7 1e-11 ref|XP_003566353.1| PREDICTED: uncharacterized protein LOC100... 63.9 3e-11 ref|XP_002521828.1| serine-type endopeptidase inhibitor, puta... 63.5 4e-11 ALIGNMENTS >ref|NP_001051179.1| Os03g0734300 [Oryza sativa Japonica Group] gb|AAT78791.1| putative roteinase inhibitor [Oryza sativa Japonica Group] gb|ABF98725.1| type II proteinase inhibitor family protein, expressed [Oryza sativa Japonica Group] dbj|BAF13093.1| Os03g0734300 [Oryza sativa Japonica Group] gb|EAY91770.1| hypothetical protein OsI_13414 [Oryza sativa Indica Group] gb|EAZ28490.1| hypothetical protein OsJ_12473 [Oryza sativa Japonica Group] dbj|BAG97220.1| unnamed protein product [Oryza sativa Japonica Group] Length=82 Score = 81.3 bits (199), Expect = 9e-18 Identities = 38/81 (47%), Positives = 56/81 (69%), Gaps = 5/81 (6%) Frame = -3 Query 367 ALKVGTILLVFVCGAILSGGNLKNVDAQ--KICPQFCYDSVAYMTCPSSGDQHLTPKCNC 194 ++K+ + + +CG ++ G ++++ +AQ K CPQFCYD + YMTCPS+G QHL P CNC Sbjct 3 SIKLALPMALLLCGLMVIG-SIQSAEAQGGKFCPQFCYDGLEYMTCPSTGSQHLKPACNC 61 Query 193 CLA-STGCILYEADGTPI-CT 137 C+A GC+LY +G I CT Sbjct 62 CIAGEKGCVLYLNNGQVINCT 82 >ref|XP_002263203.1| PREDICTED: uncharacterized protein LOC100267991 [Vitis vinifera] emb|CBI29703.3| unnamed protein product [Vitis vinifera] Length=78 Score = 70.1 bits (170), Expect = 1e-13 Identities = 34/74 (46%), Positives = 45/74 (61%), Gaps = 2/74 (3%) Frame = -3 Query 364 LKVGTILLVFVCGAILSGGNLKNVDAQKICPQFCYDSVAYMTCPSSGDQHLTPKCNCCLA 185 +K +L+ VCG +L G K+ A K CP +C D V YMTC SSG++ LT CNCCLA Sbjct 4 MKFSVFVLLLVCGVVLLGETSKSFGA-KACPLYCLD-VDYMTCVSSGEEKLTAPCNCCLA 61 Query 184 STGCILYEADGTPI 143 C L+ DG+ + Sbjct 62 PKQCTLHLVDGSEV 75 >ref|NP_001105983.1| LOC100037813 precursor [Zea mays] gb|ABN54444.1| putative serine type endopeptidase inhibitor [Zea mays] Length=78 Score = 64.7 bits (156), Expect = 1e-11 Identities = 29/64 (45%), Positives = 40/64 (63%), Gaps = 3/64 (5%) Frame = -3 Query 340 VFVCGAILSGGNLKNVDAQKICPQFCYDSVAYMTCPSSGDQHLTPKCNCCLASTGCILYE 161 + + G +L G + +D CPQFC D V Y+TCPSSG + L +CNCC+ GC L+ Sbjct 13 LLLIGVVLLGQ--QGIDGAVACPQFCLD-VDYVTCPSSGSEKLPARCNCCMTPKGCTLHL 69 Query 160 ADGT 149 +DGT Sbjct 70 SDGT 73 >ref|XP_003566353.1| PREDICTED: uncharacterized protein LOC100841120 [Brachypodium distachyon] Length=79 Score = 63.9 bits (154), Expect = 3e-11 Identities = 32/70 (46%), Positives = 43/70 (61%), Gaps = 3/70 (4%) Frame = -3 Query 358 VGTILLVFVCGAILSGGNLKNVDAQKICPQFCYDSVAYMTCPSSGDQHLTPKCNCCLAST 179 + LL+F GA+L G + K CPQ+C + V Y TCPSSG + L +CNCC+A Sbjct 8 IACALLLF--GAVLLGQDGKAGMEAVACPQYCLE-VEYTTCPSSGSEKLPARCNCCMAPK 64 Query 178 GCILYEADGT 149 GC L+ +DGT Sbjct 65 GCTLHLSDGT 74 >ref|XP_002521828.1| serine-type endopeptidase inhibitor, putative [Ricinus communis] gb|EEF40638.1| serine-type endopeptidase inhibitor, putative [Ricinus communis] Length=75 Score = 63.5 bits (153), Expect = 4e-11 Identities = 29/60 (48%), Positives = 36/60 (60%), Gaps = 1/60 (2%) Frame = -3 Query 322 ILSGGNLKNVDAQKICPQFCYDSVAYMTCPSSGDQHLTPKCNCCLASTGCILYEADGTPI 143 +L G + K CP +C D V YMTC SSGD+ L P CNCCLA C L+ +DGT + Sbjct 12 LLYGAISLQATSGKACPLYCLD-VEYMTCQSSGDEKLNPSCNCCLAPKNCTLHLSDGTSL 70 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 186386003 Number of extensions: 3776373 Number of successful extensions: 6771 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 6759 Number of HSP's successfully gapped: 0 Length of query: 369 Length of database: 6150218869 Length adjustment: 89 Effective length of query: 280 Effective length of database: 4555420393 Effective search space: 154884293362 Effective search space used: 154884293362 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE32YJ101N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_1550 Length=859 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003546558.1| PREDICTED: CASP-like protein 9-like [Glyc... 271 6e-88 ref|XP_003535007.1| PREDICTED: LOW QUALITY PROTEIN: CASP-like... 256 4e-82 ref|NP_001241476.1| CASP-like protein 9 [Glycine max] >sp|C6T... 233 6e-73 ref|XP_003531310.1| PREDICTED: CASP-like protein 9-like [Glyc... 230 6e-72 ref|XP_003630011.1| hypothetical protein MTR_8g089300 [Medica... 229 9e-72 ALIGNMENTS >ref|XP_003546558.1| PREDICTED: CASP-like protein 9-like [Glycine max] Length=201 Score = 271 bits (693), Expect = 6e-88 Identities = 131/155 (85%), Positives = 145/155 (94%), Gaps = 0/155 (0%) Frame = +1 Query 40 LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHNLVMIALDILG 219 LVM NKQTKSFV+ TVG+TP+TATL+AKFNQTPAFVFFVIANGNASLHNLVMI +++LG Sbjct 45 LVMTFNKQTKSFVVATVGSTPITATLAAKFNQTPAFVFFVIANGNASLHNLVMIVMEVLG 104 Query 220 PQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCNR 399 P+YDYKGLRLALIAILDM+TMALASAGDGAATFMS LG+NGNSHARWDKICDKFE+YCNR Sbjct 105 PRYDYKGLRLALIAILDMMTMALASAGDGAATFMSELGKNGNSHARWDKICDKFETYCNR 164 Query 400 GGGALIASFIGFILLLIITVMSISKLLKPNRINHA 504 GG ALI SF+GFILL II+VMS+ KLLK NRINHA Sbjct 165 GGAALIVSFVGFILLFIISVMSVIKLLKSNRINHA 199 >ref|XP_003535007.1| PREDICTED: LOW QUALITY PROTEIN: CASP-like protein 9-like [Glycine max] Length=207 Score = 256 bits (655), Expect = 4e-82 Identities = 129/156 (83%), Positives = 144/156 (92%), Gaps = 1/156 (1%) Frame = +1 Query 40 LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHN-LVMIALDIL 216 LVMA NKQTKSFV+ TVG+TP+TAT +AKFNQTPAFVFFVIANGNA+LHN LVMIA++IL Sbjct 50 LVMAFNKQTKSFVVATVGSTPITATFAAKFNQTPAFVFFVIANGNAALHNNLVMIAMEIL 109 Query 217 GPQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCN 396 G +YDYKG RLALIAILDM+TMALAS GDGAATFMS LG+NGNSHA+WDKICDKFE+YC+ Sbjct 110 GTRYDYKGPRLALIAILDMMTMALASDGDGAATFMSELGKNGNSHAKWDKICDKFETYCD 169 Query 397 RGGGALIASFIGFILLLIITVMSISKLLKPNRINHA 504 RG ALI SF+GFILLLII+VMSI KLLKPNRINHA Sbjct 170 RGVVALIVSFVGFILLLIISVMSIIKLLKPNRINHA 205 >ref|NP_001241476.1| CASP-like protein 9 [Glycine max] sp|C6TG62.1|CSPL9_SOYBN RecName: Full=CASP-like protein 9 gb|ACU20814.1| unknown [Glycine max] Length=194 Score = 233 bits (593), Expect = 6e-73 Identities = 111/152 (73%), Positives = 131/152 (86%), Gaps = 0/152 (0%) Frame = +1 Query 40 LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHNLVMIALDILG 219 LVMA NKQTK V+ T+G P+T TL+A F TPAF+FFVI N AS +NL++I ++ILG Sbjct 43 LVMAFNKQTKGMVVATIGTNPVTITLTAMFQHTPAFIFFVIVNAIASFYNLLVIGVEILG 102 Query 220 PQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCNR 399 PQYDYKGLRL LIAILD++TMALA+ GDGAATFM+ LGRNGNSHARWDKICDKFE+YCNR Sbjct 103 PQYDYKGLRLGLIAILDVMTMALAATGDGAATFMAELGRNGNSHARWDKICDKFEAYCNR 162 Query 400 GGGALIASFIGFILLLIITVMSISKLLKPNRI 495 GG AL+ASF+G ILLL++TVMSI+KLLK NRI Sbjct 163 GGVALVASFVGLILLLVVTVMSITKLLKLNRI 194 >ref|XP_003531310.1| PREDICTED: CASP-like protein 9-like [Glycine max] Length=194 Score = 230 bits (586), Expect = 6e-72 Identities = 111/152 (73%), Positives = 130/152 (86%), Gaps = 0/152 (0%) Frame = +1 Query 40 LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHNLVMIALDILG 219 LVMA NKQTKS V+ T+G P+T TL+A F TPAF FFVI N AS +N+V+I ++ILG Sbjct 43 LVMAFNKQTKSMVVATIGTNPVTITLTAMFQHTPAFTFFVIVNAIASFYNMVVIGVEILG 102 Query 220 PQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCNR 399 PQYDYK LRL LIAILD++TMALA+ GDGAATFM+ LGRNGNSHARWDKICDKFE+YCNR Sbjct 103 PQYDYKELRLGLIAILDVMTMALAATGDGAATFMAELGRNGNSHARWDKICDKFEAYCNR 162 Query 400 GGGALIASFIGFILLLIITVMSISKLLKPNRI 495 GG ALIASF+G ILLL++TVMSI+K+LK NRI Sbjct 163 GGVALIASFVGLILLLVVTVMSITKMLKLNRI 194 >ref|XP_003630011.1| hypothetical protein MTR_8g089300 [Medicago truncatula] gb|AET04487.1| hypothetical protein MTR_8g089300 [Medicago truncatula] Length=194 Score = 229 bits (585), Expect = 9e-72 Identities = 112/151 (74%), Positives = 133/151 (88%), Gaps = 0/151 (0%) Frame = +1 Query 40 LVMALNKQTKSFVIGTVGNTPLTATLSAKFNQTPAFVFFVIANGNASLHNLVMIALDILG 219 LVM+LNKQTK+FV+ T+G+TP+T L+AKF TPAFV+FV+ NG SLHNLVMIA+ ILG Sbjct 43 LVMSLNKQTKTFVVATIGSTPITVPLTAKFQHTPAFVYFVVPNGIVSLHNLVMIAMYILG 102 Query 220 PQYDYKGLRLALIAILDMLTMALASAGDGAATFMSALGRNGNSHARWDKICDKFESYCNR 399 P++ KGL+LALIA+ D + +ALAS+GDGAAT MS LGRNGNSHA+W+KICDKFESYCNR Sbjct 103 PKFHNKGLQLALIAVFDTMALALASSGDGAATAMSELGRNGNSHAKWNKICDKFESYCNR 162 Query 400 GGGALIASFIGFILLLIITVMSISKLLKPNR 492 GGG+LIASFIG ILLLIITVMSI+KLLK NR Sbjct 163 GGGSLIASFIGLILLLIITVMSINKLLKLNR 193 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 404990501 Number of extensions: 7972247 Number of successful extensions: 19031 Number of sequences better than 1e-10: 2 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 19025 Number of HSP's successfully gapped: 2 Length of query: 859 Length of database: 6150218869 Length adjustment: 139 Effective length of query: 720 Effective length of database: 3659466193 Effective search space: 537941530371 Effective search space used: 537941530371 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 176 (72.4 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEF6YZM013 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_15676 Length=357 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002303808.1| predicted protein [Populus trichocarpa] >... 89.7 2e-21 dbj|BAC98492.1| AG-motif binding protein-2 [Nicotiana tabacum] 94.0 4e-21 ref|XP_002442575.1| hypothetical protein SORBIDRAFT_08g022276... 91.3 4e-20 dbj|BAE99493.1| GATA transcription factor 1 [Arabidopsis thal... 88.2 6e-20 ref|XP_003579113.1| PREDICTED: GATA transcription factor 2-li... 90.9 6e-20 ALIGNMENTS >ref|XP_002303808.1| predicted protein [Populus trichocarpa] gb|EEE78787.1| predicted protein [Populus trichocarpa] Length=55 Score = 89.7 bits (221), Expect = 2e-21 Identities = 40/44 (91%), Positives = 42/44 (95%), Gaps = 0/44 (0%) Frame = -1 Query 357 RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSH 226 RAGP GPKTLCNACGVRYKSGRL+PEYRPANSPTFSS +HSNSH Sbjct 3 RAGPDGPKTLCNACGVRYKSGRLVPEYRPANSPTFSSKLHSNSH 46 >dbj|BAC98492.1| AG-motif binding protein-2 [Nicotiana tabacum] Length=289 Score = 94.0 bits (232), Expect = 4e-21 Identities = 56/77 (73%), Positives = 63/77 (82%), Gaps = 6/77 (8%) Frame = -1 Query 357 RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSHrkvvemrrkkevVPGE 178 RAGP+GPKTLCNACGVRYKSGRLLPEYRPANSPTFS VHSNSHRKV+EMR++K G Sbjct 219 RAGPLGPKTLCNACGVRYKSGRLLPEYRPANSPTFSPTVHSNSHRKVLEMRKQKI---GV 275 Query 177 EGDMVNVAVVSCGYSLG 127 G M++ A CGY +G Sbjct 276 GGMMIHEA---CGYRVG 289 >ref|XP_002442575.1| hypothetical protein SORBIDRAFT_08g022276 [Sorghum bicolor] gb|EES16413.1| hypothetical protein SORBIDRAFT_08g022276 [Sorghum bicolor] Length=306 Score = 91.3 bits (225), Expect = 4e-20 Identities = 40/44 (91%), Positives = 42/44 (95%), Gaps = 0/44 (0%) Frame = -1 Query 357 RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSH 226 RAGP+GPKTLCNACGVRYKSGRLLPEYRPANSPTF S +HSNSH Sbjct 245 RAGPLGPKTLCNACGVRYKSGRLLPEYRPANSPTFMSCIHSNSH 288 >dbj|BAE99493.1| GATA transcription factor 1 [Arabidopsis thaliana] Length=134 Score = 88.2 bits (217), Expect = 6e-20 Identities = 38/44 (86%), Positives = 42/44 (95%), Gaps = 0/44 (0%) Frame = -1 Query 357 RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSH 226 RAGP GPKTLCNACGVRYKSGRL+PEYRPANSPTF++ +HSNSH Sbjct 68 RAGPAGPKTLCNACGVRYKSGRLVPEYRPANSPTFTAELHSNSH 111 >ref|XP_003579113.1| PREDICTED: GATA transcription factor 2-like [Brachypodium distachyon] Length=321 Score = 90.9 bits (224), Expect = 6e-20 Identities = 40/44 (91%), Positives = 42/44 (95%), Gaps = 0/44 (0%) Frame = -1 Query 357 RAGPMGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSGVHSNSH 226 R GP+GPKTLCNACGVRYKSGRLLPEYRPANSPTFSS +HSNSH Sbjct 263 RTGPLGPKTLCNACGVRYKSGRLLPEYRPANSPTFSSYMHSNSH 306 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: May 9, 2012 2:26 PM Number of letters in database: 6,200,364,692 Number of sequences in database: 18,076,563 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 18076563 Number of Hits to DB: 151315773 Number of extensions: 2768988 Number of successful extensions: 5604 Number of sequences better than 1e-10: 6 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 5604 Number of HSP's successfully gapped: 6 Length of query: 357 Length of database: 6200364692 Length adjustment: 86 Effective length of query: 271 Effective length of database: 4645780274 Effective search space: 153310749042 Effective search space used: 153310749042 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEF903P01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_16638 Length=336 Score E Sequences producing significant alignments: (Bits) Value gb|AAO86692.1| small blue copper protein Bcp1 [Paraboea crass... 119 2e-31 gb|ADV57644.1| copper binding protein 9 [Gossypium hirsutum] 107 4e-27 emb|CBI33124.3| unnamed protein product [Vitis vinifera] 105 4e-26 ref|XP_002265268.1| PREDICTED: uncharacterized protein LOC100... 105 8e-26 ref|XP_002271669.2| PREDICTED: blue copper protein [Vitis vin... 103 1e-25 ALIGNMENTS >gb|AAO86692.1| small blue copper protein Bcp1 [Paraboea crassifolia] Length=201 Score = 119 bits (299), Expect = 2e-31 Identities = 58/91 (64%), Positives = 69/91 (76%), Gaps = 0/91 (0%) Frame = -1 Query 333 GSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPMA 154 G A +I SWLSGRVFRVGDK+ F+ T D+IVEL+ +E +CDL NPIRMY D Sbjct 38 GWNSASNISSWLSGRVFRVGDKLWFSVPATADSIVELQSLEELATCDLRNPIRMYADGSN 97 Query 153 HVVLEKEGTRYFTSKNPESCKNGLKLPVSVQ 61 HV L+KEGTRYF+S N ESCKNG+KLPV+VQ Sbjct 98 HVTLDKEGTRYFSSGNLESCKNGMKLPVTVQ 128 >gb|ADV57644.1| copper binding protein 9 [Gossypium hirsutum] Length=149 Score = 107 bits (266), Expect = 4e-27 Identities = 50/91 (55%), Positives = 64/91 (70%), Gaps = 0/91 (0%) Frame = -1 Query 336 RGSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPM 157 RG P+ D+ SW SGR+FRVGDKI F S +++IVE++ DE+ SCD+ NPIRMYT + Sbjct 33 RGWDPSFDVASWSSGRIFRVGDKICFPYSAAQESIVEVKSKDEYESCDVGNPIRMYTVGL 92 Query 156 AHVVLEKEGTRYFTSKNPESCKNGLKLPVSV 64 + L+ EG RYF S PESCK GLKL V + Sbjct 93 DGIELDGEGIRYFMSSKPESCKKGLKLRVEL 123 >emb|CBI33124.3| unnamed protein product [Vitis vinifera] Length=186 Score = 105 bits (262), Expect = 4e-26 Identities = 49/92 (53%), Positives = 68/92 (74%), Gaps = 0/92 (0%) Frame = -1 Query 336 RGSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPM 157 RG + D+++WLS +VFRVGDKI F SG ++ +VEL+ +EF SCD++NPIR YT+ + Sbjct 31 RGWDTSSDVQAWLSNKVFRVGDKIWFIYSGGQEGVVELKSREEFDSCDVSNPIRTYTEGL 90 Query 156 AHVVLEKEGTRYFTSKNPESCKNGLKLPVSVQ 61 V++ EG RYFTS P+SCK+GL+L V VQ Sbjct 91 DAVLMGSEGIRYFTSSKPKSCKDGLRLLVEVQ 122 >ref|XP_002265268.1| PREDICTED: uncharacterized protein LOC100255445 [Vitis vinifera] Length=224 Score = 105 bits (262), Expect = 8e-26 Identities = 49/92 (53%), Positives = 68/92 (74%), Gaps = 0/92 (0%) Frame = -1 Query 336 RGSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPM 157 RG + D+++WLS +VFRVGDKI F SG ++ +VEL+ +EF SCD++NPIR YT+ + Sbjct 31 RGWDTSSDVQAWLSNKVFRVGDKIWFIYSGGQEGVVELKSREEFDSCDVSNPIRTYTEGL 90 Query 156 AHVVLEKEGTRYFTSKNPESCKNGLKLPVSVQ 61 V++ EG RYFTS P+SCK+GL+L V VQ Sbjct 91 DAVLMGSEGIRYFTSSKPKSCKDGLRLLVEVQ 122 >ref|XP_002271669.2| PREDICTED: blue copper protein [Vitis vinifera] Length=171 Score = 103 bits (257), Expect = 1e-25 Identities = 49/92 (53%), Positives = 63/92 (68%), Gaps = 0/92 (0%) Frame = -1 Query 336 RGSGPALDIESWLSGRVFRVGDKISFNCSGTEDNIVELEGPDEFHSCDLTNPIRMYTDPM 157 RG + ++ WLS +VFRVGDKI F S ++ + EL +EF SCD++NPI+MYTD + Sbjct 31 RGWAKSSEVRDWLSDKVFRVGDKIWFIYSAAQEGVAELRSKEEFESCDVSNPIKMYTDGL 90 Query 156 AHVVLEKEGTRYFTSKNPESCKNGLKLPVSVQ 61 V L+ EG RYFTS ESCK+GLKL V VQ Sbjct 91 DSVPLDGEGIRYFTSSKTESCKDGLKLHVDVQ 122 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 194106224 Number of extensions: 4248433 Number of successful extensions: 12638 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 12607 Number of HSP's successfully gapped: 0 Length of query: 336 Length of database: 6150218869 Length adjustment: 80 Effective length of query: 256 Effective length of database: 4716692149 Effective search space: 150934148768 Effective search space used: 150934148768 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE383FU01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_2050 Length=814 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002266452.2| PREDICTED: disease resistance response pr... 158 2e-44 ref|XP_002323339.1| predicted protein [Populus trichocarpa] >... 152 5e-42 gb|ABK93789.1| unknown [Populus trichocarpa] 152 8e-42 ref|XP_002880356.1| hypothetical protein ARALYDRAFT_480959 [A... 146 1e-39 ref|NP_850009.1| disease resistance-responsive, dirigent doma... 145 2e-39 ALIGNMENTS >ref|XP_002266452.2| PREDICTED: disease resistance response protein 206-like [Vitis vinifera] emb|CAN62170.1| hypothetical protein VITISV_027159 [Vitis vinifera] emb|CBI17737.3| unnamed protein product [Vitis vinifera] Length=182 Score = 158 bits (399), Expect = 2e-44 Identities = 85/160 (53%), Positives = 111/160 (69%), Gaps = 3/160 (2%) Frame = -2 Query 690 SQRSKNDFGFWPRPKITKLLFYFYDKPSSPDATAEVIVNGTTPFGFGTVVMIDDALLKTA 511 SQ+ + K++ L FYF+D S + TA I G FG +M+DDAL + Sbjct 26 SQQFAEEIATMRLEKVSHLHFYFHDILSGKNPTATQIA-GPKKGHFGVTMMVDDALTEGP 84 Query 510 NKKSKIIGRAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVLQTSGRELP 331 SK++GRAQG YA++ Q+ ALLMV+N+ F EG +YNGSS+SVLGRNPV+ RE+P Sbjct 85 EPSSKLLGRAQGLYALSAQQEPALLMVMNFAFMEG-KYNGSSISVLGRNPVMHAV-REMP 142 Query 330 IVGGTGVFRFARGFALASTKWFDPRTGDAIVEYKVTVLHF 211 IVGG+G+FR+ARG+ALA T WFD +TGDAIVEY V+VLHF Sbjct 143 IVGGSGLFRYARGYALAHTVWFDGKTGDAIVEYNVSVLHF 182 >ref|XP_002323339.1| predicted protein [Populus trichocarpa] gb|EEF05100.1| predicted protein [Populus trichocarpa] Length=178 Score = 152 bits (383), Expect = 5e-42 Identities = 87/168 (52%), Positives = 109/168 (65%), Gaps = 7/168 (4%) Frame = -2 Query 702 QQFSSQRSKNDFGFWPRPKITKLLFYFYDKPSSPDATAEVI----VNGTTPFGFGTVVMI 535 Q+F+ S G + K++ L FYF+D S + TA I + T+ GFG V MI Sbjct 14 QRFTKYLSPATLGL-KKEKLSHLHFYFHDIVSGKNPTAVRIARADMTNTSSTGFGMVAMI 72 Query 534 DDALLKTANKKSKIIGRAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVL 355 DD L T SK++GRAQGFYA A Q LLM +N+VF EG ++NGS+LSVLGRN V Sbjct 73 DDPLTMTPELSSKLVGRAQGFYASASQNDVGLLMTMNFVFMEG-KFNGSTLSVLGRNSVF 131 Query 354 QTSGRELPIVGGTGVFRFARGFALASTKWFDPRTGDAIVEYKVTVLHF 211 T RE+PIVGG+G+FRFARG+A AST FD TGDA+VEY V V H+ Sbjct 132 STV-REMPIVGGSGLFRFARGYAQASTHMFDRTTGDAVVEYNVYVFHY 178 >gb|ABK93789.1| unknown [Populus trichocarpa] Length=194 Score = 152 bits (383), Expect = 8e-42 Identities = 87/168 (52%), Positives = 109/168 (65%), Gaps = 7/168 (4%) Frame = -2 Query 702 QQFSSQRSKNDFGFWPRPKITKLLFYFYDKPSSPDATAEVI----VNGTTPFGFGTVVMI 535 Q+F+ S G + K++ L FYF+D S + TA I + T+ GFG V MI Sbjct 30 QRFTKYLSPATLGL-KKEKLSHLHFYFHDIVSGKNPTAVRIARADMTNTSSTGFGMVAMI 88 Query 534 DDALLKTANKKSKIIGRAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVL 355 DD L T SK++GRAQGFYA A Q LLM +N+VF EG ++NGS+LSVLGRN V Sbjct 89 DDPLTMTPELSSKLVGRAQGFYASASQNDVGLLMTMNFVFMEG-KFNGSTLSVLGRNSVF 147 Query 354 QTSGRELPIVGGTGVFRFARGFALASTKWFDPRTGDAIVEYKVTVLHF 211 T RE+PIVGG+G+FRFARG+A AST FD TGDA+VEY V V H+ Sbjct 148 STV-REMPIVGGSGLFRFARGYAQASTHMFDRTTGDAVVEYNVYVFHY 194 >ref|XP_002880356.1| hypothetical protein ARALYDRAFT_480959 [Arabidopsis lyrata subsp. lyrata] gb|EFH56615.1| hypothetical protein ARALYDRAFT_480959 [Arabidopsis lyrata subsp. lyrata] Length=187 Score = 146 bits (368), Expect = 1e-39 Identities = 73/151 (48%), Positives = 104/151 (69%), Gaps = 6/151 (4%) Frame = -2 Query 654 RPKITKLLFYFYDKPSSPDATAEVIVNGT----TPFGFGTVVMIDDALLKTANKKSKIIG 487 + K+T L FYF+D S + TA + GT +P FG V M+DDAL +TA+ KSK++G Sbjct 39 KEKVTNLQFYFHDTLSGKNPTAVKVAQGTDTDKSPTLFGAVFMVDDALTETADPKSKLVG 98 Query 486 RAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVLQTSGRELPIVGGTGVF 307 RAQG Y + ++ L+M +++ F +G Y S++S++G+N + RE+PIVGGTG+F Sbjct 99 RAQGLYGSSCKEEVGLIMAMSFCFEDGP-YKDSTISMIGKNSAMNPI-REMPIVGGTGMF 156 Query 306 RFARGFALASTKWFDPRTGDAIVEYKVTVLH 214 R ARG+A+A T WFDP+TGDAIV Y VTV+H Sbjct 157 RMARGYAIAKTNWFDPKTGDAIVGYNVTVVH 187 >ref|NP_850009.1| disease resistance-responsive, dirigent domain-containing protein [Arabidopsis thaliana] gb|AAO64191.1| putative disease resistance response protein/dirigent protein [Arabidopsis thaliana] gb|AAT71988.1| At2g21100 [Arabidopsis thaliana] dbj|BAF00324.1| putative disease resistance response protein [Arabidopsis thaliana] gb|AEC07123.1| disease resistance-responsive, dirigent domain-containing protein [Arabidopsis thaliana] Length=187 Score = 145 bits (367), Expect = 2e-39 Identities = 72/151 (48%), Positives = 104/151 (69%), Gaps = 6/151 (4%) Frame = -2 Query 654 RPKITKLLFYFYDKPSSPDATAEVIVNGT----TPFGFGTVVMIDDALLKTANKKSKIIG 487 + K+T L FYF+D S + TA + GT +P FG V M+DDAL +TA+ KSK++G Sbjct 39 KDKVTNLQFYFHDTLSGKNPTAVKVAQGTDTEKSPTLFGAVFMVDDALTETADPKSKLVG 98 Query 486 RAQGFYAMADQKTTALLMVVNYVFTEGSEYNGSSLSVLGRNPVLQTSGRELPIVGGTGVF 307 RAQG Y + ++ L+M +++ F +G Y S++S++G+N + RE+PIVGGTG+F Sbjct 99 RAQGLYGSSCKEEVGLIMAMSFCFEDGP-YKDSTISMIGKNSAMNPI-REMPIVGGTGMF 156 Query 306 RFARGFALASTKWFDPRTGDAIVEYKVTVLH 214 R ARG+A+A T WFDP+TGDAIV Y VT++H Sbjct 157 RMARGYAIARTNWFDPKTGDAIVGYNVTIMH 187 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 423147972 Number of extensions: 9121653 Number of successful extensions: 24229 Number of sequences better than 1e-10: 23 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 24052 Number of HSP's successfully gapped: 23 Length of query: 814 Length of database: 6150218869 Length adjustment: 138 Effective length of query: 676 Effective length of database: 3677385277 Effective search space: 489092241841 Effective search space used: 489092241841 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 176 (72.4 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE3E3ET012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_2072 Length=813 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002263257.2| PREDICTED: BRASSINOSTEROID INSENSITIVE 1-... 97.1 8e-21 gb|ABK24079.1| unknown [Picea sitchensis] 96.3 9e-21 gb|AAZ91738.1| leucine rich repeat protein 1 [Nicotiana tabacum] 95.5 2e-20 gb|ABK21187.1| unknown [Picea sitchensis] >gb|ACN40548.1| unk... 93.6 8e-20 gb|ABK24062.1| unknown [Picea sitchensis] 92.4 2e-19 ALIGNMENTS >ref|XP_002263257.2| PREDICTED: BRASSINOSTEROID INSENSITIVE 1-associated receptor kinase 1-like [Vitis vinifera] emb|CBI40693.3| unnamed protein product [Vitis vinifera] Length=250 Score = 97.1 bits (240), Expect = 8e-21 Identities = 68/144 (47%), Positives = 87/144 (60%), Gaps = 11/144 (8%) Frame = -3 Query 643 ALNAFK---DDPTNTTSSLD-------KWFHIDY-IEGDVQRVNIMNRELSGALVKRLGE 497 ALNA K +DP N S + KWFH+ V RV+++N LSG LV +LG+ Sbjct 31 ALNALKSNLEDPNNVLQSWNATLVNPCKWFHVTRNSHNSVTRVDLVNANLSGQLVPQLGQ 90 Query 496 SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn 317 +QYL LHNN I+G IP ELGNLTNL LDL N+LNG IP LG L L +L LN+N Sbjct 91 LTNLQYLELHNNNISGKIPKELGNLTNLVSLDLSMNNLNGTIPDTLGKLTKLRFLRLNNN 150 Query 316 nLSGRVPEVLTRLPNLRTVNLKGN 245 L+G +P LT + L+ ++L N Sbjct 151 ALTGTIPMSLTAVITLQVLDLSNN 174 >gb|ABK24079.1| unknown [Picea sitchensis] Length=216 Score = 96.3 bits (238), Expect = 9e-21 Identities = 71/149 (48%), Positives = 94/149 (63%), Gaps = 12/149 (8%) Frame = -3 Query 643 ALNAFKD---DPTNTTSSLDK-------WFHIDYIEGD-VQRVNIMNRELSGALVKRLGE 497 AL+AF+ DP N S D WFH+ + + V RV++ N LSG LV LG Sbjct 32 ALHAFRRSLLDPDNVLQSWDPTLVNPCTWFHVTCDQNNRVIRVDLGNSNLSGHLVPELGM 91 Query 496 SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn 317 + +QYL L+ N I G+I ELGNL NL LDL+NN L G IP+ LGNLK+L +L +N+N Sbjct 92 LEHLQYLELYKNNITGNILEELGNLKNLISLDLYNNKLTGEIPRSLGNLKSLVFLRINNN 151 Query 316 nLSGRVPEVLTRLPNLRTVNLKGNPNLCG 230 L+G++P LT LPNL+ V++ N NLCG Sbjct 152 MLTGQIPRGLTSLPNLKVVDISSN-NLCG 179 >gb|AAZ91738.1| leucine rich repeat protein 1 [Nicotiana tabacum] Length=232 Score = 95.5 bits (236), Expect = 2e-20 Identities = 71/149 (48%), Positives = 93/149 (62%), Gaps = 12/149 (8%) Frame = -3 Query 643 ALNAFKD---DPTNTTSSLDK-------WFHIDY-IEGDVQRVNIMNRELSGALVKRLGE 497 ALNA K DP N S D WFH+ E V RV++ N LSG LV +LG+ Sbjct 34 ALNALKTNLADPNNVLQSWDPTLVNPCTWFHVTCNSENSVTRVDLGNANLSGQLVPQLGQ 93 Query 496 SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn 317 +QYL L++N I+G IP ELGNLTNL LDL+ N LNG IP LG L+ L +L LN+N Sbjct 94 LPNLQYLELYSNNISGRIPFELGNLTNLVSLDLYLNRLNGPIPDTLGKLQKLRFLRLNNN 153 Query 316 nLSGRVPEVLTRLPNLRTVNLKGNPNLCG 230 +L+GR+P +LT + +L+ ++L N NL G Sbjct 154 SLNGRIPMLLTTVISLQVLDLSNN-NLTG 181 >gb|ABK21187.1| unknown [Picea sitchensis] gb|ACN40548.1| unknown [Picea sitchensis] Length=216 Score = 93.6 bits (231), Expect = 8e-20 Identities = 66/149 (44%), Positives = 97/149 (65%), Gaps = 12/149 (8%) Frame = -3 Query 643 ALNAFK---DDPTNTTSSLDK-------WFHIDYIEGD-VQRVNIMNRELSGALVKRLGE 497 AL+AF+ DP N S D WFHI + + V R+++ N LSG+LV LG Sbjct 32 ALHAFRRSLSDPLNVLQSWDPTLVNPCTWFHITCNQDNRVTRIDLGNSNLSGSLVPELGR 91 Query 496 SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn 317 + +QYL L+ N+I GSIP E GNL +L +DL+NN++ G IP+ LGNLK+L +L LN+N Sbjct 92 LEHLQYLELYKNRIGGSIPEEFGNLKSLISMDLYNNNITGEIPRSLGNLKSLVFLRLNNN 151 Query 316 nLSGRVPEVLTRLPNLRTVNLKGNPNLCG 230 +L+G++P LT++ NL+ ++ N +LCG Sbjct 152 SLTGQIPRELTKISNLKVSDVSNN-DLCG 179 >gb|ABK24062.1| unknown [Picea sitchensis] Length=216 Score = 92.4 bits (228), Expect = 2e-19 Identities = 65/149 (44%), Positives = 97/149 (65%), Gaps = 12/149 (8%) Frame = -3 Query 643 ALNAFK---DDPTNTTSSLDK-------WFHIDYIEGD-VQRVNIMNRELSGALVKRLGE 497 AL+AF+ DP N S D WFHI + + V R+++ N LSG+L+ LG Sbjct 32 ALHAFRRSLSDPLNVLQSWDPTLVNPCTWFHITCNQDNRVTRIDLGNSNLSGSLMPELGR 91 Query 496 SDKMQYLVLHNNKINGSIPAELGNLTNLKGLDLHNNSLNGYIPQElgnlknltylllndn 317 + +QYL L+ N+I GSIP E GNL +L +DL+NN++ G IP+ LGNLK+L +L LN+N Sbjct 92 LEHLQYLELYKNRIGGSIPEEFGNLKSLISMDLYNNNITGEIPRSLGNLKSLVFLRLNNN 151 Query 316 nLSGRVPEVLTRLPNLRTVNLKGNPNLCG 230 +L+G++P LT++ NL+ ++ N +LCG Sbjct 152 SLTGQIPRELTKISNLKVSDVSNN-DLCG 179 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 378401929 Number of extensions: 7587992 Number of successful extensions: 21346 Number of sequences better than 1e-10: 25 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 19932 Number of HSP's successfully gapped: 25 Length of query: 813 Length of database: 6150218869 Length adjustment: 138 Effective length of query: 675 Effective length of database: 3677385277 Effective search space: 489092241841 Effective search space used: 489092241841 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 176 (72.4 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEGC8UN01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_20761 Length=260 Score E Sequences producing significant alignments: (Bits) Value gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutin... 76.3 1e-15 gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltior... 71.2 8e-14 ref|XP_002271428.2| PREDICTED: pathogenesis-related protein S... 63.9 3e-11 ALIGNMENTS >gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa] Length=154 Score = 76.3 bits (186), Expect = 1e-15 Identities = 37/66 (56%), Positives = 45/66 (68%), Gaps = 0/66 (0%) Frame = +1 Query 16 KIESFVYDVKFEEASGGGCTIKIVNEYNTKGDAALKDGDIKETVDRTKGFYVAAEAYLIA 195 KIES Y+ KFE++S GGC KIV EY+TKGD LK+ +K D+ GFY +E YL A Sbjct 89 KIESIHYEQKFEDSSDGGCVAKIVCEYHTKGDIQLKEEGVKAINDQALGFYTLSEEYLHA 148 Query 196 NSNVCA 213 N NVCA Sbjct 149 NPNVCA 154 >gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza] Length=160 Score = 71.2 bits (173), Expect = 8e-14 Identities = 30/66 (45%), Positives = 44/66 (67%), Gaps = 0/66 (0%) Frame = +1 Query 16 KIESFVYDVKFEEASGGGCTIKIVNEYNTKGDAALKDGDIKETVDRTKGFYVAAEAYLIA 195 K+E YD+KFE+ GGC +K+ +EY+TKG L D D+K +++ G Y + E YL+A Sbjct 95 KLEKICYDMKFEDTEDGGCVVKVTSEYHTKGGYELADEDLKGAKEQSLGMYKSCEDYLLA 154 Query 196 NSNVCA 213 N +VCA Sbjct 155 NPHVCA 160 >ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera] emb|CBI22930.3| unnamed protein product [Vitis vinifera] Length=160 Score = 63.9 bits (154), Expect = 3e-11 Identities = 28/61 (46%), Positives = 42/61 (69%), Gaps = 0/61 (0%) Frame = +1 Query 16 KIESFVYDVKFEEASGGGCTIKIVNEYNTKGDAALKDGDIKETVDRTKGFYVAAEAYLIA 195 ++ES VY++KFEE+ GGC K +EY+TKG+ +K+ I+E ++ G Y EAYL+A Sbjct 95 QLESIVYEMKFEESGDGGCICKTRSEYHTKGEFEIKEESIREGKEKAMGVYKLVEAYLLA 154 Query 196 N 198 N Sbjct 155 N 155 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 131337325 Number of extensions: 2704680 Number of successful extensions: 6378 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 6376 Number of HSP's successfully gapped: 0 Length of query: 260 Length of database: 6150218869 Length adjustment: 56 Effective length of query: 204 Effective length of database: 5146750165 Effective search space: 154402504950 Effective search space used: 154402504950 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEH7DTE01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_22366 Length=240 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003536253.1| PREDICTED: uncharacterized protein At4g14... 70.1 5e-14 gb|ABK93173.1| unknown [Populus trichocarpa] 67.8 5e-13 ref|XP_002284531.1| PREDICTED: uncharacterized protein At4g14... 67.4 7e-13 emb|CAN71216.1| hypothetical protein VITISV_033484 [Vitis vin... 67.4 7e-13 emb|CBI23583.3| unnamed protein product [Vitis vinifera] 67.4 3e-11 ALIGNMENTS >ref|XP_003536253.1| PREDICTED: uncharacterized protein At4g14450, chloroplastic-like [Glycine max] Length=95 Score = 70.1 bits (170), Expect = 5e-14 Identities = 38/74 (51%), Positives = 44/74 (59%), Gaps = 7/74 (9%) Frame = -3 Query 238 ASLQISPVTEWNVAIPLLSPLVSPTSEFNPTVEIKTSCSGGNKEAGAAEKPVVVMKNWQH 59 +SLQI+ EWNVAIPLLSPL S P +E+K +EA EK V K WQH Sbjct 22 SSLQINRAVEWNVAIPLLSPLASSP----PPMELKPPQEPPQREA---EKVTVSFKKWQH 74 Query 58 PAAPFCYDATPFRP 17 PAAPFCY+ P P Sbjct 75 PAAPFCYEPAPMVP 88 >gb|ABK93173.1| unknown [Populus trichocarpa] Length=105 Score = 67.8 bits (164), Expect = 5e-13 Identities = 41/80 (51%), Positives = 51/80 (64%), Gaps = 12/80 (15%) Frame = -3 Query 238 ASLQISPVTE--WNVAIPLLSPLV-SPTSEFNPTVEIKTSC---SGGNKEAGAAEKPVVV 77 ASLQISP + WN AIPLLSPL+ SPT+ +++K+ S + EKPVV Sbjct 26 ASLQISPASSSSWNAAIPLLSPLITSPTA-----MDMKSRDDPPSPPRIQVTEGEKPVV- 79 Query 76 MKNWQHPAAPFCYDATPFRP 17 K WQHPAAPFCY+ PF+P Sbjct 80 FKKWQHPAAPFCYELAPFKP 99 >ref|XP_002284531.1| PREDICTED: uncharacterized protein At4g14450, chloroplastic-like [Vitis vinifera] Length=104 Score = 67.4 bits (163), Expect = 7e-13 Identities = 42/80 (53%), Positives = 49/80 (61%), Gaps = 8/80 (10%) Frame = -3 Query 238 ASLQISPVTEWNVAIPLLSPL-VSPTSE--FNPTVEIKTSCSGGNKEAGAAEKPVVVMKN 68 +SLQISP +W VAIPLLSPL SP+S + T E+K + G EKP V K Sbjct 27 SSLQISPAADWKVAIPLLSPLATSPSSPKLIDRTAEVKPKEEPRQMKEG--EKP--VFKK 82 Query 67 WQHPAAPFCYDATPF-RPFV 11 WQHPAAPFCY+ F R FV Sbjct 83 WQHPAAPFCYEPASFVRSFV 102 >emb|CAN71216.1| hypothetical protein VITISV_033484 [Vitis vinifera] Length=104 Score = 67.4 bits (163), Expect = 7e-13 Identities = 42/80 (53%), Positives = 49/80 (61%), Gaps = 8/80 (10%) Frame = -3 Query 238 ASLQISPVTEWNVAIPLLSPL-VSPTSE--FNPTVEIKTSCSGGNKEAGAAEKPVVVMKN 68 +SLQISP +W VAIPLLSPL SP+S + T E+K + G EKP V K Sbjct 27 SSLQISPAADWKVAIPLLSPLATSPSSPKLIDRTAEVKPKEEPRQTKEG--EKP--VFKK 82 Query 67 WQHPAAPFCYDATPF-RPFV 11 WQHPAAPFCY+ F R FV Sbjct 83 WQHPAAPFCYEPASFVRSFV 102 >emb|CBI23583.3| unnamed protein product [Vitis vinifera] Length=1287 Score = 67.4 bits (163), Expect = 3e-11 Identities = 42/80 (53%), Positives = 49/80 (61%), Gaps = 8/80 (10%) Frame = -3 Query 238 ASLQISPVTEWNVAIPLLSPL-VSPTSE--FNPTVEIKTSCSGGNKEAGAAEKPVVVMKN 68 +SLQISP +W VAIPLLSPL SP+S + T E+K + G EKP V K Sbjct 1210 SSLQISPAADWKVAIPLLSPLATSPSSPKLIDRTAEVKPKEEPRQMKEG--EKP--VFKK 1265 Query 67 WQHPAAPFCYDATPF-RPFV 11 WQHPAAPFCY+ F R FV Sbjct 1266 WQHPAAPFCYEPASFVRSFV 1285 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 93585762 Number of extensions: 1737241 Number of successful extensions: 6813 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 6801 Number of HSP's successfully gapped: 0 Length of query: 240 Length of database: 6150218869 Length adjustment: 51 Effective length of query: 189 Effective length of database: 5236345585 Effective search space: 151854021965 Effective search space used: 151854021965 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE4758N012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_2255 Length=796 Score E Sequences producing significant alignments: (Bits) Value gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutin... 147 9e-41 gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltior... 145 6e-40 ref|XP_002328585.1| predicted protein [Populus trichocarpa] >... 141 4e-38 ref|XP_002271428.2| PREDICTED: pathogenesis-related protein S... 139 1e-37 ref|XP_002273790.2| PREDICTED: major allergen Pru ar 1 [Vitis... 138 4e-37 ALIGNMENTS >gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa] Length=154 Score = 147 bits (372), Expect = 9e-41 Identities = 80/163 (49%), Positives = 113/163 (69%), Gaps = 10/163 (6%) Frame = +3 Query 87 MAVTKHTQEITVSISAKRMIKAMVTEASTF-LPKITSGAIQSIVVLNGNGGPGSILQTNF 263 M +TKH QE+ + +SAKRM KA+VTE+ + LP AI+SI +L+G+G G+I +TN Sbjct 1 MGITKHIQELKLRVSAKRMFKALVTESHSIPLPD----AIKSIEILHGDGSAGTIRKTNL 56 Query 264 SDVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKI 443 +D G+ V K RI+A+D + +K+T++EG LGDKIES+ ++ KFE+ S GGC KI Sbjct 57 AD--GSYV---KIRIEAVDIDNQVSKYTVIEGPMLGDKIESIHYEQKFEDSSDGGCVAKI 111 Query 444 LNEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPNECA 572 + EY+TKGD+ LK+E +K D+ GFY +E YL ANPN CA Sbjct 112 VCEYHTKGDIQLKEEGVKAINDQALGFYTLSEEYLHANPNVCA 154 >gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza] Length=160 Score = 145 bits (367), Expect = 6e-40 Identities = 74/163 (45%), Positives = 109/163 (67%), Gaps = 4/163 (2%) Frame = +3 Query 87 MAVTKHTQEITVSISAKRMIKAMVTEASTFLPKITSGAIQSIVVLNGNG-GPGSILQTNF 263 M V QE+ IS+ R+ KA+VTE+ +PK T+ +I+SI ++ G+G PG+I QTNF Sbjct 1 MGVKSFFQEMKTKISSSRLFKALVTESPEVVPKFTT-SIKSIELIQGSGYAPGAIFQTNF 59 Query 264 SDVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKI 443 + GA K R+D +D EK + K+T++EGD LGDK+E + +D+KFE+ GGC VK+ Sbjct 60 PE--GAHFKYMKCRVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKV 117 Query 444 LNEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPNECA 572 +EY+TKG L DED+K +++ G Y + E YL+ANP+ CA Sbjct 118 TSEYHTKGGYELADEDLKGAKEQSLGMYKSCEDYLLANPHVCA 160 >ref|XP_002328585.1| predicted protein [Populus trichocarpa] gb|EEE76932.1| predicted protein [Populus trichocarpa] Length=160 Score = 141 bits (355), Expect = 4e-38 Identities = 68/159 (43%), Positives = 102/159 (64%), Gaps = 2/159 (1%) Frame = +3 Query 87 MAVTKHTQEITVSISAKRMIKAMVTEASTFLPKITSGAIQSIVVLNGNGGPGSILQTNFS 266 M V +TQE T IS RM KA++ +++ +PK+ ++S+ +++G+GG GSI Q NF+ Sbjct 1 MGVASYTQEFTCPISPARMFKALILDSNNLIPKLLPQIVKSVDLIHGDGGAGSIEQVNFT 60 Query 267 DVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKIL 446 + G + KHRID LD K+TM+EGD LG+K+ES+ ++V+FE S GGC K+ Sbjct 61 E--GTDIKYVKHRIDELDRVNLVCKYTMIEGDSLGEKLESIAYEVRFEVGSDGGCDCKMT 118 Query 447 NEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPN 563 + Y GD LK+E+IK D+ +G Y EAYL+ NP+ Sbjct 119 SSYLMLGDFTLKEEEIKAGQDKARGIYKVVEAYLLENPH 157 >ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera] emb|CBI22930.3| unnamed protein product [Vitis vinifera] Length=160 Score = 139 bits (351), Expect = 1e-37 Identities = 68/160 (43%), Positives = 105/160 (66%), Gaps = 2/160 (1%) Frame = +3 Query 87 MAVTKHTQEITVSISAKRMIKAMVTEASTFLPKITSGAIQSIVVLNGNGGPGSILQTNFS 266 M VT TQE +S RM KA+V ++ +P++ +++SI + G+GG GSI QTNFS Sbjct 1 MGVTTFTQEFVTPVSPARMFKALVVDSHILVPRLVPESVKSIEFVEGDGGAGSITQTNFS 60 Query 267 DVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKIL 446 + K++I+A+D EK ++T++EG LGD++ES+V+++KFEE GGC K Sbjct 61 --GDSDCEYLKYKINAVDKEKLECRYTLIEGGVLGDQLESIVYEMKFEESGDGGCICKTR 118 Query 447 NEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPNE 566 +EY+TKG+ +K+E I+E ++ G Y EAYL+ANP+E Sbjct 119 SEYHTKGEFEIKEESIREGKEKAMGVYKLVEAYLLANPDE 158 >ref|XP_002273790.2| PREDICTED: major allergen Pru ar 1 [Vitis vinifera] emb|CBI22941.3| unnamed protein product [Vitis vinifera] Length=158 Score = 138 bits (348), Expect = 4e-37 Identities = 70/159 (44%), Positives = 103/159 (65%), Gaps = 2/159 (1%) Frame = +3 Query 87 MAVTKHTQEITVSISAKRMIKAMVTEASTFLPKITSGAIQSIVVLNGNGGPGSILQTNFS 266 M V +T E+T + A R+ KA++ EA + LPKI AI+SI + GNGGPG+I Q NF+ Sbjct 1 MGVVTYTDELTSPVPAPRLFKALILEADSLLPKIVPQAIKSIETVEGNGGPGTIKQLNFA 60 Query 267 DVAGASVACAKHRIDALDAEKGTTKFTMMEGDWLGDKIESVVFDVKFEEVSGGGCTVKIL 446 + G+ KHRID LD EK K+T++EGD L DKIE + +++ FE GGC K + Sbjct 61 E--GSQFKYVKHRIDELDKEKMIYKYTLIEGDALMDKIEYISYEISFEASPDGGCKSKNV 118 Query 447 NEYNTKGDVVLKDEDIKETVDRTKGFYVAAEAYLIANPN 563 + Y++K V +K+E+IK+ ++ + A EAYL+ANP+ Sbjct 119 SVYHSKPGVEIKEEEIKDGKEKAAAVFKAVEAYLLANPD 157 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 374443729 Number of extensions: 7491674 Number of successful extensions: 19377 Number of sequences better than 1e-10: 33 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 19345 Number of HSP's successfully gapped: 33 Length of query: 796 Length of database: 6150218869 Length adjustment: 138 Effective length of query: 658 Effective length of database: 3677385277 Effective search space: 467027930179 Effective search space used: 467027930179 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 175 (72.0 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE4A0UZ016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_2274 Length=795 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003529742.1| PREDICTED: uncharacterized protein LOC100... 320 6e-105 ref|XP_003533024.1| PREDICTED: uncharacterized protein LOC100... 318 6e-104 ref|XP_002276490.1| PREDICTED: uncharacterized protein LOC100... 312 9e-102 ref|XP_002515894.1| conserved hypothetical protein [Ricinus c... 311 2e-101 ref|NP_196734.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 310 6e-101 ALIGNMENTS >ref|XP_003529742.1| PREDICTED: uncharacterized protein LOC100793448 [Glycine max] Length=387 Score = 320 bits (821), Expect = 6e-105 Identities = 148/215 (69%), Positives = 178/215 (83%), Gaps = 1/215 (0%) Frame = +3 Query 3 VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS 182 VFY RQIPS+V WG MSMCDAERRLLANA+LD+SNE+FIL+SE+CIPL NFS +Y YI+ Sbjct 159 VFYRRQIPSQVAEWGMMSMCDAERRLLANALLDISNEWFILLSESCIPLQNFSIVYLYIA 218 Query 183 GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV 362 SR SFMGAVD GP GRGRYD M+P +N+++WRKG QWFE+NR+LA+ IV D+ YYP Sbjct 219 RSRYSFMGAVDEPGPYGRGRYDGNMAPEINMSDWRKGSQWFEINRELALRIVEDNTYYPK 278 Query 363 FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF 542 KE+C+PH CYVDEHYFQTMLTI +PHLLANRSLT+VDWSRGG HP FGKD +I F Sbjct 279 LKEFCKPHKCYVDEHYFQTMLTINTPHLLANRSLTYVDWSRGGAHPATFGKD-DIKEEFF 337 Query 543 KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLL 647 KKI ++ C YNN+ +++CF+FARKF+PNALG LL Sbjct 338 KKILQDQTCLYNNQPSSLCFLFARKFAPNALGPLL 372 >ref|XP_003533024.1| PREDICTED: uncharacterized protein LOC100819579 [Glycine max] Length=387 Score = 318 bits (814), Expect = 6e-104 Identities = 147/215 (68%), Positives = 177/215 (82%), Gaps = 1/215 (0%) Frame = +3 Query 3 VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS 182 VFY RQIPS+V WG MSMCDAERRLLANA+LD+SNE+FIL+SE+CIPL NFS +Y YI+ Sbjct 159 VFYRRQIPSQVAEWGMMSMCDAERRLLANALLDISNEWFILLSESCIPLQNFSIVYRYIA 218 Query 183 GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV 362 SR SFMGAVD GP GRGRYD M+P +N+++WRKG QWFE+ R+LA+ IV D YYP Sbjct 219 HSRYSFMGAVDEPGPYGRGRYDGNMAPEINVSDWRKGSQWFEIKRELALRIVEDRTYYPK 278 Query 363 FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF 542 KE+C+PH CYVDEHYFQTMLTI +PHLLANRSLT+VDWSRGG HP FGKD +I F Sbjct 279 LKEFCRPHKCYVDEHYFQTMLTINTPHLLANRSLTYVDWSRGGAHPATFGKD-DIKEEFF 337 Query 543 KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLL 647 KKI +++C YNN+ +++CF+FARKF+PNALG LL Sbjct 338 KKILQDQKCLYNNQPSSLCFLFARKFAPNALGPLL 372 >ref|XP_002276490.1| PREDICTED: uncharacterized protein LOC100266878 [Vitis vinifera] Length=380 Score = 312 bits (799), Expect = 9e-102 Identities = 148/224 (66%), Positives = 178/224 (79%), Gaps = 2/224 (1%) Frame = +3 Query 3 VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS 182 VFY RQIPS+V WG MSMCDAERRLLANA+LD+ NE+FIL+SE+CIPLHNFS +Y Y+S Sbjct 159 VFYKRQIPSQVVEWGMMSMCDAERRLLANALLDIDNEWFILLSESCIPLHNFSIVYRYLS 218 Query 183 GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV 362 SR SF+GA D P GRGRY+ ++P VNLT WRKG QWFEVNRKLAIDIV D+ +YP Sbjct 219 RSRYSFIGAFDEDSPFGRGRYNPNLAPQVNLTEWRKGSQWFEVNRKLAIDIVGDNTFYPR 278 Query 363 FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF 542 FKE+C+P CYVDEHYFQTMLTI +PHLLANR+ TWVDWSRGG HP FG+ +I F Sbjct 279 FKEFCRPS-CYVDEHYFQTMLTILAPHLLANRTTTWVDWSRGGAHPATFGQ-ADITKEFF 336 Query 543 KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLLRRSFELFGF 674 KKI + C YNN+ T++CF+FARKF+P+AL LL + E+FG+ Sbjct 337 KKIIEGGTCIYNNQPTSLCFLFARKFAPSALEPLLDLASEVFGY 380 >ref|XP_002515894.1| conserved hypothetical protein [Ricinus communis] gb|EEF46314.1| conserved hypothetical protein [Ricinus communis] Length=385 Score = 311 bits (797), Expect = 2e-101 Identities = 147/224 (66%), Positives = 175/224 (78%), Gaps = 2/224 (1%) Frame = +3 Query 3 VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS 182 VFY RQIPS++ WG MSMCD ERRLLANA+LD+SNE+FIL+SEACIPLHNFS IY YIS Sbjct 164 VFYRRQIPSQIVEWGRMSMCDGERRLLANALLDISNEWFILLSEACIPLHNFSIIYRYIS 223 Query 183 GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV 362 SR SFMG+ D P GRGRY+ M P V L WRKG QWFEVNR+ A++IV D YYP Sbjct 224 RSRHSFMGSFDENSPYGRGRYNWNMQPEVTLEQWRKGSQWFEVNRRFAVNIVEDTTYYPK 283 Query 363 FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF 542 F+++CQP CYVDEHYF TMLTIQ PHLLANR+LTW DWSRGG HP FGK +I F Sbjct 284 FRDFCQP-ACYVDEHYFPTMLTIQVPHLLANRTLTWTDWSRGGAHPATFGK-ADITEEFF 341 Query 543 KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLLRRSFELFGF 674 K++F+ + C+YNN+ TTVC++FARKF+P+AL LL S ++FGF Sbjct 342 KRMFEGQSCTYNNQPTTVCYLFARKFAPSALEPLLGLSSKVFGF 385 >ref|NP_196734.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Arabidopsis thaliana] emb|CAB87691.1| putative protein [Arabidopsis thaliana] gb|AAM13183.1| putative protein [Arabidopsis thaliana] gb|AAP68292.1| At5g11730 [Arabidopsis thaliana] dbj|BAE99143.1| hypothetical protein [Arabidopsis thaliana] gb|AED91715.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Arabidopsis thaliana] Length=386 Score = 310 bits (794), Expect = 6e-101 Identities = 148/224 (66%), Positives = 179/224 (80%), Gaps = 2/224 (1%) Frame = +3 Query 3 VFYGRQIPSKVTRWGTMSMCDAERRLLANAMLDVSNEYFILVSEACIPLHNFSTIYSYIS 182 VF+ RQIPS+V WG MSMCDAE+RLLANA+LDVSNE+F+LVSE+CIPL+NF+TIYSY+S Sbjct 165 VFHRRQIPSQVAEWGRMSMCDAEKRLLANALLDVSNEWFVLVSESCIPLYNFTTIYSYLS 224 Query 183 GSRTSFMGAVDVLGPVGRGRYDDYMSPLVNLTNWRKGYQWFEVNRKLAIDIVRDDVYYPV 362 S+ SFMGA D GP GRGRY+ M P V LT WRKG QWFEVNR LA IV+D +YYP Sbjct 225 RSKHSFMGAFDDPGPFGRGRYNGNMEPEVPLTKWRKGSQWFEVNRDLAATIVKDTLYYPK 284 Query 363 FKEYCQPHYCYVDEHYFQTMLTIQSPHLLANRSLTWVDWSRGGPHPVMFGKDGNIDTWIF 542 FKE+C+P CYVDEHYF TMLTI+ P +LANRSLTWVDWSRGGPHP FG+ +I F Sbjct 285 FKEFCRP-ACYVDEHYFPTMLTIEKPTVLANRSLTWVDWSRGGPHPATFGR-SDITENFF 342 Query 543 KKIFDNKECSYNNRSTTVCFMFARKFSPNALGVLLRRSFELFGF 674 KIFD + CSYN R+T++C++FARKF+P+AL LL + ++ GF Sbjct 343 GKIFDGRNCSYNGRNTSMCYLFARKFAPSALEPLLHIAPKILGF 386 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 425742378 Number of extensions: 9273448 Number of successful extensions: 22362 Number of sequences better than 1e-10: 5 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 22349 Number of HSP's successfully gapped: 5 Length of query: 795 Length of database: 6150218869 Length adjustment: 138 Effective length of query: 657 Effective length of database: 3677385277 Effective search space: 467027930179 Effective search space used: 467027930179 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 175 (72.0 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEHB6WZ01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 18,076,563 sequences; 6,200,364,692 total letters Query= TrVeIntMedtrGB1_24566 Length=216 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003544026.1| PREDICTED: uncharacterized protein LOC100... 106 2e-27 ref|XP_003614382.1| hypothetical protein MTR_5g050970 [Medica... 80.5 6e-16 ref|XP_003614392.1| hypothetical protein MTR_5g051110 [Medica... 77.8 2e-15 ref|XP_003614384.1| hypothetical protein MTR_5g051010 [Medica... 77.8 5e-15 ref|XP_003638451.1| hypothetical protein MTR_132s0010, partia... 77.8 5e-15 ALIGNMENTS >ref|XP_003544026.1| PREDICTED: uncharacterized protein LOC100801029 [Glycine max] Length=167 Score = 106 bits (265), Expect = 2e-27 Identities = 56/71 (79%), Positives = 56/71 (79%), Gaps = 0/71 (0%) Frame = -3 Query 214 RLQRILPAARWEL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDL 35 RLQRILPAARWEL FKA A PP QRHVPLG PLL VGKRTAGA VASSPDSDL Sbjct 75 RLQRILPAARWELRFKASRRANPPHEGHQRHVPLGGRGPLLLVGKRTAGARVASSPDSDL 134 Query 34 EVFSHNPTHGS 2 E FSHNPTHGS Sbjct 135 EAFSHNPTHGS 145 >ref|XP_003614382.1| hypothetical protein MTR_5g050970 [Medicago truncatula] gb|AES97340.1| hypothetical protein MTR_5g050970 [Medicago truncatula] Length=1065 Score = 80.5 bits (197), Expect = 6e-16 Identities = 40/60 (67%), Positives = 43/60 (72%), Gaps = 0/60 (0%) Frame = -3 Query 181 EL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDLEVFSHNPTHGS 2 +L FK A PP + QRHVPLG PLL VGKRT G +ASSPDSDLE FSHNPTHGS Sbjct 25 KLRFKVSRKAHPPYGVHQRHVPLGGQGPLLLVGKRTTGTRIASSPDSDLEAFSHNPTHGS 84 >ref|XP_003614392.1| hypothetical protein MTR_5g051110 [Medicago truncatula] gb|AES97350.1| hypothetical protein MTR_5g051110 [Medicago truncatula] Length=407 Score = 77.8 bits (190), Expect = 2e-15 Identities = 39/60 (65%), Positives = 42/60 (70%), Gaps = 0/60 (0%) Frame = -3 Query 181 EL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDLEVFSHNPTHGS 2 +L FK A PP + QRHVPLG PLL VGKRT G +ASSPD DLE FSHNPTHGS Sbjct 25 KLRFKVSRKAHPPYEVHQRHVPLGGQGPLLLVGKRTTGTRIASSPDFDLEAFSHNPTHGS 84 >ref|XP_003614384.1| hypothetical protein MTR_5g051010 [Medicago truncatula] gb|AES97342.1| hypothetical protein MTR_5g051010 [Medicago truncatula] Length=1153 Score = 77.8 bits (190), Expect = 5e-15 Identities = 39/60 (65%), Positives = 42/60 (70%), Gaps = 0/60 (0%) Frame = -3 Query 181 EL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDLEVFSHNPTHGS 2 +L FK A PP + QRHVPLG PLL VGKRT G +ASSPD DLE FSHNPTHGS Sbjct 25 KLRFKVSRKAHPPYEVHQRHVPLGGQGPLLLVGKRTTGTRIASSPDFDLEAFSHNPTHGS 84 >ref|XP_003638451.1| hypothetical protein MTR_132s0010, partial [Medicago truncatula] gb|AES85589.1| hypothetical protein MTR_132s0010, partial [Medicago truncatula] Length=1458 Score = 77.8 bits (190), Expect = 5e-15 Identities = 39/60 (65%), Positives = 42/60 (70%), Gaps = 0/60 (0%) Frame = -3 Query 181 EL*FKADNTAIPPCWLSQRHVPLGA*RPLLRVGKRTAGACVASSPDSDLEVFSHNPTHGS 2 +L FK A PP + QRHVPLG PLL VGKRT G +ASSPD DLE FSHNPTHGS Sbjct 68 KLRFKVSRKAHPPYEVHQRHVPLGGQGPLLLVGKRTTGTRIASSPDFDLEAFSHNPTHGS 127 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 229559892 Number of extensions: 4633336 Number of successful extensions: 9639 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 9636 Number of HSP's successfully gapped: 0 Length of query: 216 Length of database: 6150218869 Length adjustment: 44 Effective length of query: 172 Effective length of database: 5361779173 Effective search space: 150129816844 Effective search space used: 150129816844 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE4N93W01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_2665 Length=766 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003538935.1| PREDICTED: cytochrome P450 81D1-like [Gly... 311 4e-99 emb|CBI33752.3| unnamed protein product [Vitis vinifera] 251 7e-79 emb|CBI33755.3| unnamed protein product [Vitis vinifera] 248 1e-78 emb|CBI33749.3| unnamed protein product [Vitis vinifera] 244 1e-77 ref|XP_002283502.1| PREDICTED: isoflavone 2'-hydroxylase-like... 253 2e-77 ALIGNMENTS >ref|XP_003538935.1| PREDICTED: cytochrome P450 81D1-like [Glycine max] Length=580 Score = 311 bits (797), Expect = 4e-99 Identities = 147/197 (75%), Positives = 168/197 (85%), Gaps = 2/197 (1%) Frame = +3 Query 3 ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA 182 ATTMEWA SLLLNHP+ MNKV+ EIDT VGQD+++N D +KLKYLQ V+ ETLRLYP A Sbjct 386 ATTMEWAFSLLLNHPKKMNKVKEEIDTYVGQDQMLNGLDTTKLKYLQNVITETLRLYPVA 445 Query 183 PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI 362 PL+LPHESSNDC VCGFDIP+GTMLLVNLW LHRD NLW DP FVPERF E+ Sbjct 446 PLLLPHESSNDCKVCGFDIPRGTMLLVNLWTLHRDANLWVDPAMFVPERF--EGEEADEV 503 Query 363 YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL 542 YNMIPFG+GRR+CPGA LAKR +GHA+G+LIQCFEWERIG +EIDM EGIGLTMPK+EPL Sbjct 504 YNMIPFGIGRRACPGAVLAKRVMGHALGTLIQCFEWERIGHQEIDMTEGIGLTMPKLEPL 563 Query 543 VALCKPRQVMVEVISNI 593 VALC+PRQ M++V+SNI Sbjct 564 VALCRPRQSMIKVLSNI 580 >emb|CBI33752.3| unnamed protein product [Vitis vinifera] Length=316 Score = 251 bits (640), Expect = 7e-79 Identities = 115/196 (59%), Positives = 156/196 (80%), Gaps = 3/196 (2%) Frame = +3 Query 3 ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA 182 A TMEWA++LLLNHP+ + K +AE+D VG+D+L+ ESD KL+YL+ ++ ETLR++P A Sbjct 124 AATMEWAMTLLLNHPDVLEKAKAELDMHVGKDRLIEESDLPKLRYLRSIISETLRVFPVA 183 Query 183 PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI 362 PL+LPH SS+DC + GFDIP+GT+LLVN+WALHRDP +W+DPT F PERFE GE Sbjct 184 PLLLPHMSSDDCQIGGFDIPRGTLLLVNVWALHRDPQVWEDPTSFKPERFENGEREN--- 240 Query 363 YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL 542 Y ++PFG+GRR+CPGA LA+R +G A+GSLIQC++W++I + ID EG GLTMPK++PL Sbjct 241 YKLVPFGIGRRACPGAGLAQRVVGLALGSLIQCYDWKKISNTAIDTIEGKGLTMPKLQPL 300 Query 543 VALCKPRQVMVEVISN 590 A+CK R+++ EV N Sbjct 301 EAMCKAREIINEVHLN 316 >emb|CBI33755.3| unnamed protein product [Vitis vinifera] Length=242 Score = 248 bits (632), Expect = 1e-78 Identities = 113/196 (58%), Positives = 148/196 (76%), Gaps = 2/196 (1%) Frame = +3 Query 3 ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA 182 A TMEWA+SLLLNHP+ + K + E+DTCVGQ++L+ E+D KL YLQ ++ ET RL PPA Sbjct 37 AATMEWAMSLLLNHPDVLKKAKVELDTCVGQERLLEEADLPKLHYLQNIISETFRLCPPA 96 Query 183 PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI 362 PL LPH SS +C + GFDIP+ TMLLVN W LHRDP LW DPT F PERF GE Sbjct 97 PLWLPHMSSENCQLGGFDIPRDTMLLVNSWTLHRDPKLWDDPTSFKPERF--EGGERGET 154 Query 363 YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL 542 Y ++PFG GRR+CPG+ LA + +G +GSLIQC+EWERI ++++DM EG GLTMPK+EPL Sbjct 155 YKLLPFGTGRRACPGSGLANKVVGLTLGSLIQCYEWERISEKKVDMMEGKGLTMPKMEPL 214 Query 543 VALCKPRQVMVEVISN 590 A+C+ +++ +V+ + Sbjct 215 EAMCRAYEIVKKVLQD 230 >emb|CBI33749.3| unnamed protein product [Vitis vinifera] Length=227 Score = 244 bits (624), Expect = 1e-77 Identities = 115/194 (59%), Positives = 151/194 (78%), Gaps = 3/194 (2%) Frame = +3 Query 3 ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA 182 ATT+EWA+SLLLNHP+ + K RAE+DT VG+D+L ESD KL+YL+ ++ ETLRL+P Sbjct 37 ATTIEWAMSLLLNHPDVLKKARAELDTHVGKDRLTEESDFPKLQYLRSIISETLRLFPAT 96 Query 183 PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI 362 PL++PH SS++C + GFDIP+GT+LLVN WA+HRDP WKDPT F PERF E GE Sbjct 97 PLLMPHISSDNCQIGGFDIPRGTILLVNAWAIHRDPKSWKDPTSFKPERF---ENEEGEA 153 Query 363 YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL 542 Y ++PFG+GRR+CPGA LA R IG +G LIQC+E ER ++E+DM EG G+TMPK+EPL Sbjct 154 YKLLPFGLGRRACPGAGLANRVIGLTLGLLIQCYELERASEKEVDMAEGKGVTMPKLEPL 213 Query 543 VALCKPRQVMVEVI 584 A+CK R ++ +V+ Sbjct 214 EAMCKARAIIRKVL 227 >ref|XP_002283502.1| PREDICTED: isoflavone 2'-hydroxylase-like [Vitis vinifera] Length=508 Score = 253 bits (645), Expect = 2e-77 Identities = 121/197 (61%), Positives = 150/197 (76%), Gaps = 3/197 (2%) Frame = +3 Query 3 ATTMEWALSLLLNHPEAMNKVRAEIDTCVGQDKLVNESDASKLKYLQMVLMETLRLYPPA 182 A T+EWA+SLLLNHPE + K R E+DT +G D L++E+D KL+YLQ ++ E+LRL+P Sbjct 311 AVTIEWAMSLLLNHPEVLKKARDELDTHIGHDCLIDETDLPKLQYLQSIISESLRLFPST 370 Query 183 PLMLPHESSNDCNVCGFDIPKGTMLLVNLWALHRDPNLWKDPTRFVPERFeegelgggeI 362 PL++PH S+ DC + GFD+P GTMLLVN WALHRDP LW DPT F PERF E G E Sbjct 371 PLLVPHFSTEDCKLRGFDVPGGTMLLVNAWALHRDPKLWNDPTSFKPERF---ETGESET 427 Query 363 YNMIPFGVGRRSCPGAALAKRFIGHAIGSLIQCFEWERIGDEEIDMNEGIGLTMPKVEPL 542 Y ++PFGVGRR+CPG LA R +G +GSLIQCF+W+R+ ++EIDM EG GLTMPKVEPL Sbjct 428 YKLLPFGVGRRACPGIGLANRVMGLTLGSLIQCFDWKRVDEKEIDMAEGQGLTMPKVEPL 487 Query 543 VALCKPRQVMVEVISNI 593 A+CK RQVM V S I Sbjct 488 EAMCKTRQVMNNVSSKI 504 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 412414645 Number of extensions: 8587968 Number of successful extensions: 23775 Number of sequences better than 1e-10: 973 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 21750 Number of HSP's successfully gapped: 975 Length of query: 766 Length of database: 6150218869 Length adjustment: 137 Effective length of query: 629 Effective length of database: 3695304361 Effective search space: 436045914598 Effective search space used: 436045914598 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 175 (72.0 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE57CWY01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_3020 Length=744 Score E Sequences producing significant alignments: (Bits) Value dbj|BAD83475.2| ribosomal protein L16 [Nicotiana tabacum] 295 2e-98 ref|YP_005090456.1| rpl16 gene product (mitochondrion) [Mille... 295 4e-98 ref|YP_005090475.1| rpl16 gene product (mitochondrion) [Lotus... 292 4e-97 sp|Q95747.3|RM16_ARATH RecName: Full=60S ribosomal protein L1... 291 1e-96 ref|YP_173411.1| ribosomal protein L16 [Nicotiana tabacum] 288 9e-96 ALIGNMENTS >dbj|BAD83475.2| ribosomal protein L16 [Nicotiana tabacum] Length=158 Score = 295 bits (755), Expect = 2e-98 Identities = 154/158 (97%), Positives = 155/158 (98%), Gaps = 0/158 (0%) Frame = +2 Query 5 VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr 184 VSKCGF IVKK+R VLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYR Sbjct 1 VSKCGFPIVKKKRDVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYR 60 Query 185 aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV 364 AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV Sbjct 61 AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV 120 Query 365 STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS 478 S GQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS Sbjct 121 SRGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS 158 >ref|YP_005090456.1| rpl16 gene product (mitochondrion) [Millettia pinnata] gb|AET62916.1| ribosomal protein L16 (mitochondrion) [Millettia pinnata] Length=185 Score = 295 bits (755), Expect = 4e-98 Identities = 151/158 (96%), Positives = 157/158 (99%), Gaps = 0/158 (0%) Frame = +2 Query 5 VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr 184 VSKCGFHIVKK+R VLYPKRTK+SKYRKGRCSRGCKPDGT+LGFGRYGI+SCRAGRLSYR Sbjct 28 VSKCGFHIVKKKRDVLYPKRTKFSKYRKGRCSRGCKPDGTKLGFGRYGIQSCRAGRLSYR 87 Query 185 aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV 364 AIEAARRAIIGHFHRAMSGQFR+NGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV Sbjct 88 AIEAARRAIIGHFHRAMSGQFRKNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV 147 Query 365 STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS 478 STGQ+LFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS Sbjct 148 STGQVLFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS 185 >ref|YP_005090475.1| rpl16 gene product (mitochondrion) [Lotus japonicus] gb|AET62935.1| ribosomal protein L16 (mitochondrion) [Lotus japonicus] Length=185 Score = 292 bits (748), Expect = 4e-97 Identities = 150/158 (95%), Positives = 156/158 (99%), Gaps = 0/158 (0%) Frame = +2 Query 5 VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr 184 VSKCGFHIVKK+ VLYPKRTK+SKYRKGRCSRGCKPDGT+LGFGRYGI+SCRAGRLSYR Sbjct 28 VSKCGFHIVKKKGDVLYPKRTKFSKYRKGRCSRGCKPDGTKLGFGRYGIQSCRAGRLSYR 87 Query 185 aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV 364 AIEAARRAIIGHFHRAMSGQFR+NGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV Sbjct 88 AIEAARRAIIGHFHRAMSGQFRKNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV 147 Query 365 STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS 478 STGQ+LFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS Sbjct 148 STGQVLFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS 185 >sp|Q95747.3|RM16_ARATH RecName: Full=60S ribosomal protein L16, mitochondrial Length=179 Score = 291 bits (744), Expect = 1e-96 Identities = 150/158 (95%), Positives = 155/158 (98%), Gaps = 0/158 (0%) Frame = +2 Query 5 VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr 184 VSKCGFHIVKK+ VLYPKRTKYSKYRKGRCSRGCKPDGT+LGFGRYGIKSC+AG LSYR Sbjct 22 VSKCGFHIVKKKGDVLYPKRTKYSKYRKGRCSRGCKPDGTKLGFGRYGIKSCKAGCLSYR 81 Query 185 aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV 364 AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVFAD+PITGKPTEVRMGRGKGNPTGWIARV Sbjct 82 AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVFADLPITGKPTEVRMGRGKGNPTGWIARV 141 Query 365 STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS 478 STGQILFEMDGVSL+NARQAATLAAHKLCLSTKFVQWS Sbjct 142 STGQILFEMDGVSLANARQAATLAAHKLCLSTKFVQWS 179 >ref|YP_173411.1| ribosomal protein L16 [Nicotiana tabacum] Length=171 Score = 288 bits (738), Expect = 9e-96 Identities = 151/158 (96%), Positives = 152/158 (96%), Gaps = 0/158 (0%) Frame = +2 Query 5 VSKCGFHIVKKERGVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGIKSCRAGRLSYr 184 VSKCGF IVKK+R VLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYG KSCRAGRLSYR Sbjct 14 VSKCGFPIVKKKRDVLYPKRTKYSKYRKGRCSRGCKPDGTQLGFGRYGTKSCRAGRLSYR 73 Query 185 aieaarraiiGHFHRAMSGQFRRNGKIWVRVFADIPITGKPTEVRMGRGKGNPTGWIARV 364 AIEAARRAIIGHFHRAMSGQFRRNGKIWVRV ADIPITGKPTEVRMGRGKGNPTGWIARV Sbjct 74 AIEAARRAIIGHFHRAMSGQFRRNGKIWVRVLADIPITGKPTEVRMGRGKGNPTGWIARV 133 Query 365 STGQILFEMDGVSLSNARQAATLAAHKLCLSTKFVQWS 478 S GQILFEMDGVSLSNARQAATLAAHKLC STKFVQWS Sbjct 134 SRGQILFEMDGVSLSNARQAATLAAHKLCSSTKFVQWS 171 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 385964068 Number of extensions: 8213243 Number of successful extensions: 18553 Number of sequences better than 1e-10: 113 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 18400 Number of HSP's successfully gapped: 113 Length of query: 744 Length of database: 6150218869 Length adjustment: 137 Effective length of query: 607 Effective length of database: 3695304361 Effective search space: 410178784071 Effective search space used: 410178784071 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 175 (72.0 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE5E6JX01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_3129 Length=738 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002277851.2| PREDICTED: uncharacterized protein LOC100... 139 3e-35 ref|XP_002326236.1| predicted protein [Populus trichocarpa] >... 132 9e-33 ref|XP_002528638.1| conserved hypothetical protein [Ricinus c... 130 6e-32 emb|CBI30220.3| unnamed protein product [Vitis vinifera] 124 8e-30 ref|XP_003542565.1| PREDICTED: uncharacterized protein LOC100... 121 9e-29 ALIGNMENTS >ref|XP_002277851.2| PREDICTED: uncharacterized protein LOC100259589 [Vitis vinifera] Length=470 Score = 139 bits (351), Expect = 3e-35 Identities = 73/128 (57%), Positives = 94/128 (73%), Gaps = 12/128 (9%) Frame = -1 Query 555 SRKRMKKGLSPS------IEQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQS 394 SRKR +KGL PS + QL E+ VLQD TKS EK+ WMR ++++LEE+RV Q Sbjct 335 SRKRARKGLFPSPSPSPLMRQLSSEVMSVLQDGTKSTLEKKQWMRSRMMQLEEQRVSYQC 394 Query 393 RTLELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKELDLLD------H 232 + ELEKQRLKW+KFS+KKEREME E+L N+R RLENERM L++RQKE++LL+ Sbjct 395 KAFELEKQRLKWVKFSSKKEREMEREKLVNQRKRLENERMALLLRQKEVELLEIHHQQQQ 454 Query 231 QQHSSNRK 208 QQHSSN++ Sbjct 455 QQHSSNKR 462 >ref|XP_002326236.1| predicted protein [Populus trichocarpa] gb|EEE71906.1| predicted protein [Populus trichocarpa] Length=483 Score = 132 bits (333), Expect = 9e-33 Identities = 76/134 (57%), Positives = 97/134 (72%), Gaps = 9/134 (7%) Frame = -1 Query 558 SSRKRMKKGL----SPSIEQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQSR 391 SSRKR +K + S ++QL E+ VL+D KS EK WM+ KL++LEE++V Q + Sbjct 351 SSRKRPRKEVFSATSSLMQQLNGEIMNVLRDGAKSSWEKNQWMKLKLMQLEEQQVNYQCQ 410 Query 390 TLELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKELDLLD--HQQHS- 220 ELEKQRLKW++FS+KKEREME +L NER RLENERMVL VR+KEL+LLD HQQ Sbjct 411 AFELEKQRLKWVRFSSKKEREMERAKLENERKRLENERMVLTVRKKELELLDTTHQQQQL 470 Query 219 -SNRKSVDPSSVTG 181 SN++S DPSS+ G Sbjct 471 PSNKRS-DPSSIAG 483 >ref|XP_002528638.1| conserved hypothetical protein [Ricinus communis] gb|EEF33741.1| conserved hypothetical protein [Ricinus communis] Length=499 Score = 130 bits (327), Expect = 6e-32 Identities = 70/139 (50%), Positives = 96/139 (69%), Gaps = 14/139 (10%) Frame = -1 Query 555 SRKRMKKGL----SPSIEQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQSRT 388 ++KR + G+ S ++QL EL GV+QD KS EK+HWM+ + ++LEE+++ Q + Sbjct 361 AQKRQRTGVHSLSSSLMQQLNSELMGVIQDGAKSPWEKKHWMKLRSMQLEEQQLSYQCQA 420 Query 387 LELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKELDLLD--------- 235 ELEKQRLKW+KFS+KKEREME +L N+R LE+ERMVL++RQKEL+LLD Sbjct 421 FELEKQRLKWVKFSSKKEREMEKAKLDNDRRMLESERMVLLIRQKELELLDLQQQQHHHH 480 Query 234 -HQQHSSNRKSVDPSSVTG 181 QQ S+ K DPSS+TG Sbjct 481 HQQQQLSSNKRGDPSSITG 499 >emb|CBI30220.3| unnamed protein product [Vitis vinifera] Length=410 Score = 124 bits (310), Expect = 8e-30 Identities = 63/106 (59%), Positives = 79/106 (75%), Gaps = 6/106 (6%) Frame = -1 Query 549 KRMKKGLSPS------IEQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQSRT 388 KR +KGL PS + QL E+ VLQD TKS EK+ WMR ++++LEE+RV Q + Sbjct 253 KRARKGLFPSPSPSPLMRQLSSEVMSVLQDGTKSTLEKKQWMRSRMMQLEEQRVSYQCKA 312 Query 387 LELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKE 250 ELEKQRLKW+KFS+KKEREME E+L N+R RLENERM L++RQKE Sbjct 313 FELEKQRLKWVKFSSKKEREMEREKLVNQRKRLENERMALLLRQKE 358 >ref|XP_003542565.1| PREDICTED: uncharacterized protein LOC100816661 [Glycine max] Length=478 Score = 121 bits (303), Expect = 9e-29 Identities = 63/124 (51%), Positives = 89/124 (72%), Gaps = 7/124 (6%) Frame = -1 Query 558 SSRKRMKK----GLSPSI-EQLGFELTGVLQDRTKSQAEKRHWMRKKLVRLEEERVGIQS 394 S RKR +K +SP + +QL E++GV QD KS +K+ WMR ++++LEE+++ + Sbjct 350 SMRKRARKVGGVSMSPQLMQQLSAEVSGVFQDVGKSAWDKKQWMRSRIMQLEEQQISYHT 409 Query 393 RTLELEKQRLKWLKFSTKKEREMEWERLTNERLRLENERMVLVVRQKELDL--LDHQQHS 220 + ELEKQRLKW +FS+KKEREME +L NER RLENERMVL++RQKE +L L HQQ Sbjct 410 QAFELEKQRLKWARFSSKKEREMETAKLENERRRLENERMVLLIRQKEFELMSLQHQQQQ 469 Query 219 SNRK 208 +++ Sbjct 470 QHQQ 473 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 321482715 Number of extensions: 6325429 Number of successful extensions: 21451 Number of sequences better than 1e-10: 2 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 21012 Number of HSP's successfully gapped: 2 Length of query: 738 Length of database: 6150218869 Length adjustment: 137 Effective length of query: 601 Effective length of database: 3695304361 Effective search space: 402788175349 Effective search space used: 402788175349 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 175 (72.0 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPDZD6PS016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_331 Length=1109 Score E Sequences producing significant alignments: (Bits) Value gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltior... 213 2e-64 ref|XP_003632378.1| PREDICTED: LOW QUALITY PROTEIN: major all... 175 2e-49 ref|XP_002271428.2| PREDICTED: pathogenesis-related protein S... 167 3e-47 ref|XP_002273815.2| PREDICTED: pathogenesis-related protein S... 165 3e-46 ref|NP_001237916.1| uncharacterized protein LOC100527307 [Gly... 164 1e-45 ALIGNMENTS >gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza] Length=160 Score = 213 bits (541), Expect = 2e-64 Identities = 98/160 (61%), Positives = 128/160 (80%), Gaps = 0/160 (0%) Frame = -2 Query 721 MGVKKFFQELKTKVSPSRLFKALVTESHEVIPKVTPKVKSIETIEGDFGAVGCLRKTTFP 542 MGVK FFQE+KTK+S SRLFKALVTES EV+PK T +KSIE I+G A G + +T FP Sbjct 1 MGVKSFFQEMKTKISSSRLFKALVTESPEVVPKFTTSIKSIELIQGSGYAPGAIFQTNFP 60 Query 541 ESSQIKSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLESE 362 E + K + RVD IDH+ ++ KYT++ GD+LGDKLEK+CY++KFE TEDGGC++K+ SE Sbjct 61 EGAHFKYMKCRVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKVTSE 120 Query 361 YHTKGDFELKDEEVEAGKEQALWLYKACEDYLVAHPDVCA 242 YHTKG +EL DE+++ KEQ+L +YK+CEDYL+A+P VCA Sbjct 121 YHTKGGYELADEDLKGAKEQSLGMYKSCEDYLLANPHVCA 160 >ref|XP_003632378.1| PREDICTED: LOW QUALITY PROTEIN: major allergen Pru ar 1-like [Vitis vinifera] Length=204 Score = 175 bits (443), Expect = 2e-49 Identities = 78/155 (50%), Positives = 115/155 (74%), Gaps = 3/155 (2%) Frame = -2 Query 706 FFQELKTKVSPSRLFKALVTESHEVIPKVTPKVKSIETIEGDFGAVGCLRKTTFPESSQI 527 F QE+ T ++P+ +FKAL+ +SH ++P + P +KSIE +EGD G VG +++T FPE S Sbjct 53 FSQEITTPIAPAIMFKALIVDSHNLVPTLMPSIKSIEFVEGD-GGVGSIKQTNFPEGSHF 111 Query 526 KSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLESEYHTKG 347 K + HR+DAIDHDNY+CKYT++ G+VLGD LE + YE+KFEA+ G + K+ S YH+K Sbjct 112 KYLKHRIDAIDHDNYSCKYTLIEGEVLGDTLESISYEVKFEASGSGSSVCKMTSHYHSK- 170 Query 346 DFELKDEEVEAGKEQALWLYKACEDYLVAHPDVCA 242 ELKDE+++ GK++A+ +YK +YL+A+PD A Sbjct 171 -IELKDEDIKTGKDKAMGMYKVVGEYLLANPDAYA 204 >ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera] emb|CBI22930.3| unnamed protein product [Vitis vinifera] Length=160 Score = 167 bits (424), Expect = 3e-47 Identities = 79/158 (50%), Positives = 115/158 (73%), Gaps = 2/158 (1%) Frame = -2 Query 721 MGVKKFFQELKTKVSPSRLFKALVTESHEVIPKVTPK-VKSIETIEGDFGAVGCLRKTTF 545 MGV F QE T VSP+R+FKALV +SH ++P++ P+ VKSIE +EGD GA G + +T F Sbjct 1 MGVTTFTQEFVTPVSPARMFKALVVDSHILVPRLVPESVKSIEFVEGDGGA-GSITQTNF 59 Query 544 PESSQIKSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLES 365 S + + ++++A+D + C+YT++ G VLGD+LE + YE+KFE + DGGC+ K S Sbjct 60 SGDSDCEYLKYKINAVDKEKLECRYTLIEGGVLGDQLESIVYEMKFEESGDGGCICKTRS 119 Query 364 EYHTKGDFELKDEEVEAGKEQALWLYKACEDYLVAHPD 251 EYHTKG+FE+K+E + GKE+A+ +YK E YL+A+PD Sbjct 120 EYHTKGEFEIKEESIREGKEKAMGVYKLVEAYLLANPD 157 >ref|XP_002273815.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera] emb|CAN79554.1| hypothetical protein VITISV_025728 [Vitis vinifera] emb|CBI22943.3| unnamed protein product [Vitis vinifera] Length=159 Score = 165 bits (418), Expect = 3e-46 Identities = 77/161 (48%), Positives = 118/161 (73%), Gaps = 3/161 (2%) Frame = -2 Query 721 MGVKKFFQELKTKVSPSRLFKALVTESHEVIPKVTPK-VKSIETIEGDFGAVGCLRKTTF 545 MGV F QE ++PSR+FKAL+ +S+ +IPK+ P+ ++SI+ ++GD GA G + + F Sbjct 1 MGVTSFTQEFTCPIAPSRIFKALILDSNNLIPKLLPQTIRSIDVVQGDGGA-GTIEQVNF 59 Query 544 PESSQIKSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLES 365 E+S +K V H+++ +D +N+ CKY M+ GDVLG++LE + +E+KFEA DGG + K+ S Sbjct 60 TEASNLKYVKHQIEELDKENFVCKYRMIEGDVLGEELESIAHEVKFEAA-DGGSICKMAS 118 Query 364 EYHTKGDFELKDEEVEAGKEQALWLYKACEDYLVAHPDVCA 242 EYHTKG FE+K+EE++AGK +A+ +YK E YL+ +P V A Sbjct 119 EYHTKGKFEIKEEEIKAGKARAMGIYKVVEAYLLENPHVYA 159 >ref|NP_001237916.1| uncharacterized protein LOC100527307 [Glycine max] gb|ACU16378.1| unknown [Glycine max] Length=160 Score = 164 bits (414), Expect = 1e-45 Identities = 75/161 (47%), Positives = 110/161 (68%), Gaps = 2/161 (1%) Frame = -2 Query 721 MGVKKFFQELKTKVSPSRLFKALVTESHEVIPKVTPK-VKSIETIEGDFGAVGCLRKTTF 545 MG+ F QE + V+PS +FKAL+ +S ++PK+ P+ VK + I+GD G G + + F Sbjct 1 MGITTFTQEYSSSVAPSPMFKALIVDSRNLLPKLLPQFVKDVNVIQGD-GEAGSIEQVNF 59 Query 544 PESSQIKSVTHRVDAIDHDNYTCKYTMVAGDVLGDKLEKVCYEIKFEATEDGGCMLKLES 365 E + K + HR+D +D DN CKYTM+ GD LGDKLE + YE+KFEAT DGGC+ K+ S Sbjct 60 NEDNPFKYLKHRIDVLDKDNLVCKYTMIEGDPLGDKLESIGYEVKFEATSDGGCLCKMTS 119 Query 364 EYHTKGDFELKDEEVEAGKEQALWLYKACEDYLVAHPDVCA 242 Y+T G+F++K+EEV+ G+E + +Y+ E YL+ +P V A Sbjct 120 NYNTIGEFDVKEEEVKEGRESGIAVYRVVESYLLENPQVYA 160 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 510140310 Number of extensions: 9975518 Number of successful extensions: 26073 Number of sequences better than 1e-10: 45 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 26017 Number of HSP's successfully gapped: 45 Length of query: 1109 Length of database: 6150218869 Length adjustment: 142 Effective length of query: 967 Effective length of database: 3605708941 Effective search space: 818495929607 Effective search space used: 818495929607 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 178 (73.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPDZFM7P01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_337 Length=1105 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003552036.1| PREDICTED: uncharacterized protein LOC100... 303 2e-95 ref|XP_003519788.1| PREDICTED: uncharacterized protein LOC100... 300 7e-95 ref|XP_002893280.1| hypothetical protein ARALYDRAFT_472593 [A... 299 8e-95 ref|XP_003530633.1| PREDICTED: uncharacterized protein LOC100... 301 1e-94 ref|XP_002315678.1| predicted protein [Populus trichocarpa] >... 300 2e-94 ALIGNMENTS >ref|XP_003552036.1| PREDICTED: uncharacterized protein LOC100788950 [Glycine max] Length=464 Score = 303 bits (775), Expect = 2e-95 Identities = 156/294 (53%), Positives = 202/294 (69%), Gaps = 13/294 (4%) Frame = -2 Query 987 ESCPKNIVPIRRITKEDVLRASSVRTFVHKPRV--IIKDSSGNTHEHAIAYVTGGSYYGA 814 ESCP+ +PIRR T+ED+LRA+SVR F K + + +D+SGN HEHAI YVTG YYGA Sbjct 172 ESCPEGTIPIRRTTEEDMLRANSVRRFGRKKVINRVRRDTSGNGHEHAIGYVTGDQYYGA 231 Query 813 STAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIEAGWQI---WYGDKYPKLFVFWT 643 +++VWAP V NP EFSLSQ+WV++GSF ++LN+IEAGWQ+ YGD YP+ F +WT Sbjct 232 KASINVWAPLVENPYEFSLSQMWVISGSFG-DDLNTIEAGWQVSPELYGDSYPRFFTYWT 290 Query 642 RDAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNNANKHYDISFDIYKSSRTGNWLL 463 DAYQ TGCYNLLC GFVQ + I G AISP S Y+ +DIS I+K + GNW L Sbjct 291 TDAYQATGCYNLLCSGFVQTNSKIAIGAAISPTSSYSGGQ--FDISLLIWKDPKHGNWWL 348 Query 462 KVNN-VQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMFGAHTKTEMGSGHFSNEGYGQAA 286 + + + VGYWP +FT L A +IQFGGEIVN+G G+HT T+MGSGHF+ EG+ +A+ Sbjct 349 EFGSGILVGYWPSFLFTHLGDHASMIQFGGEIVNSGSSGSHTSTQMGSGHFAEEGFAKAS 408 Query 285 CFKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSGRDYNILGNHFYYGGPG 124 F+ +QV N PL +L+V + CY+ + G N GN+FYYGGPG Sbjct 409 YFRNMQVVDWDNNLIPLS--NLKVLADHPNCYD--IQGGVNNAWGNYFYYGGPG 458 >ref|XP_003519788.1| PREDICTED: uncharacterized protein LOC100776135 [Glycine max] Length=417 Score = 300 bits (767), Expect = 7e-95 Identities = 152/293 (52%), Positives = 205/293 (70%), Gaps = 12/293 (4%) Frame = -2 Query 987 ESCPKNIVPIRRITKEDVLRASSVRTFVHK-PRVIIKDSSGNTHEHAIAYVTGGSYYGAS 811 ESCP+ +PIRR T++D+LRASSV F K R + +D++ N HEHA+ YV+G YYGA Sbjct 126 ESCPEGTIPIRRTTEQDMLRASSVSRFGRKIRRRVRRDTNSNGHEHAVGYVSGEQYYGAK 185 Query 810 TAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIEAGWQIW---YGDKYPKLFVFWTR 640 +++VWAP+V N +EFSLSQ+WV++GSF ++LN+IEAGWQ+ YGD+YP+ F +WT Sbjct 186 ASINVWAPRVENQDEFSLSQMWVISGSFG-DDLNTIEAGWQVSPEIYGDRYPRFFTYWTS 244 Query 639 DAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNNANKHYDISFDIYKSSRTGNWLLK 460 DAYQ TGCYNLLC GFVQ + I G AISP S Y A +DIS I+K + GNW L+ Sbjct 245 DAYQATGCYNLLCSGFVQTNNRIAIGAAISPTSSY--AGGQFDISLLIWKDPKHGNWWLE 302 Query 459 VNN-VQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMFGAHTKTEMGSGHFSNEGYGQAAC 283 + + VGYWP +FT LR A ++QFGGEIVN+ G+HT T+MGSGHF++EG+G+A+ Sbjct 303 FGSGILVGYWPSFLFTHLRDHASMVQFGGEIVNSRQSGSHTSTQMGSGHFASEGFGKASY 362 Query 282 FKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSGRDYNILGNHFYYGGPG 124 F+ +QV N PL +L+V + CY+ + G N+ GN+FYYGGPG Sbjct 363 FRNMQVVDWDNNLVPLS--NLRVLADHPNCYD--IQGGINNVWGNYFYYGGPG 411 >ref|XP_002893280.1| hypothetical protein ARALYDRAFT_472593 [Arabidopsis lyrata subsp. lyrata] gb|EFH69539.1| hypothetical protein ARALYDRAFT_472593 [Arabidopsis lyrata subsp. lyrata] Length=408 Score = 299 bits (766), Expect = 8e-95 Identities = 156/316 (49%), Positives = 207/316 (66%), Gaps = 20/316 (6%) Frame = -2 Query 1032 PEMQKGPTSRNNTIK---------ESCPKNIVPIRRITKEDVLRASSVRTFVHKPRVIIK 880 PEM G + N + + ESCP+ +PIRR T++D+LRASSVR F K R + + Sbjct 94 PEMPIGYSQENESYENFQLWSLSGESCPEGTIPIRRTTEQDMLRASSVRRFGRKIRRVRR 153 Query 879 DSSGNTHEHAIAYVTGGSYYGASTAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIE 700 DSS N HEHA+ YV+G YYGA +++VW P+V++ EFSLSQIWV+AGSF ++LN+IE Sbjct 154 DSSSNGHEHAVGYVSGSQYYGAKASINVWTPRVISQYEFSLSQIWVIAGSF-ADDLNTIE 212 Query 699 AGWQI---WYGDKYPKLFVFWTRDAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNN 529 AGWQI YGD P+ F +WT DAYQ TGCYNLLC GFVQ + I G AISP S Y Sbjct 213 AGWQISPELYGDTNPRFFTYWTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYKG 272 Query 528 ANKHYDISFDIYKSSRTGNWLLKV-NNVQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMF 352 +DIS I+K + G+W L+ + VGYWP +FT LR +++QFGGEIVNT Sbjct 273 G--QFDISLLIWKDPKHGHWWLQFGSGTLVGYWPVSLFTHLREHGNMVQFGGEIVNTRPG 330 Query 351 GAHTKTEMGSGHFSNEGYGQAACFKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSG 172 G+HT T+MGSGHF+ EG+G+A+ F+ +Q+ N P+ +L+V + CY+ R G Sbjct 331 GSHTSTQMGSGHFAGEGFGKASYFRNLQMVDWDNTLIPIS--NLKVLADHPNCYDIR--G 386 Query 171 RDYNILGNHFYYGGPG 124 + GN+FYYGGPG Sbjct 387 GVNRVWGNYFYYGGPG 402 >ref|XP_003530633.1| PREDICTED: uncharacterized protein LOC100792240 [Glycine max] Length=463 Score = 301 bits (770), Expect = 1e-94 Identities = 158/312 (51%), Positives = 208/312 (67%), Gaps = 17/312 (5%) Frame = -2 Query 1029 EMQKGPTSRNNTI----KESCPKNIVPIRRITKEDVLRASSVRTFVHKPRV--IIKDSSG 868 +M G S N + ESCP+ +PIRR T++D+LRA+SVR F K + + +D+SG Sbjct 153 QMDDGDLSENFQLWSFSGESCPEGTIPIRRTTEQDMLRATSVRRFGRKKIINRVRRDTSG 212 Query 867 NTHEHAIAYVTGGSYYGASTAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIEAGWQ 688 N HEHAI YVTG YYG+ +++VWAP V NP EFSLSQ+WV++GSF ++LN+IEAGWQ Sbjct 213 NGHEHAIGYVTGDQYYGSKASINVWAPLVENPYEFSLSQMWVISGSFG-DDLNTIEAGWQ 271 Query 687 I---WYGDKYPKLFVFWTRDAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNNANKH 517 + YGD YP+ F +WT DAYQ TGCYNLLC GFVQ + I G AISP S Y+ Sbjct 272 VSPELYGDSYPRFFTYWTTDAYQATGCYNLLCSGFVQTNSKIAIGAAISPTSSYSGGQ-- 329 Query 516 YDISFDIYKSSRTGNWLLKVNN-VQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMFGAHT 340 +DIS I+K + GNW L+ + + VGYWP +FT L A +IQFGGEIVN+G G+HT Sbjct 330 FDISLLIWKDPKHGNWWLEFGSGILVGYWPSFLFTHLGDHASMIQFGGEIVNSGSSGSHT 389 Query 339 KTEMGSGHFSNEGYGQAACFKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSGRDYN 160 T+MGSGHF+ EG+ +A+ F+ +QV N PL +L+V + CY+ + G N Sbjct 390 STQMGSGHFAEEGFAKASYFRNMQVVDWDNNLIPLS--NLKVLADHPNCYD--IQGGVNN 445 Query 159 ILGNHFYYGGPG 124 GN+FYYGGPG Sbjct 446 AWGNYFYYGGPG 457 >ref|XP_002315678.1| predicted protein [Populus trichocarpa] gb|EEF01849.1| predicted protein [Populus trichocarpa] Length=466 Score = 300 bits (768), Expect = 2e-94 Identities = 154/293 (53%), Positives = 201/293 (69%), Gaps = 12/293 (4%) Frame = -2 Query 987 ESCPKNIVPIRRITKEDVLRASSVRTFVHK-PRVIIKDSSGNTHEHAIAYVTGGSYYGAS 811 ESCP+ VPIRR T++D+LRASSVR F K R + +D++ N HEHA+ YVTG YYGA Sbjct 175 ESCPEGTVPIRRTTEQDMLRASSVRRFGRKLRRHVRRDTNSNGHEHAVGYVTGDQYYGAK 234 Query 810 TAMDVWAPKVVNPNEFSLSQIWVMAGSFEKNNLNSIEAGWQI---WYGDKYPKLFVFWTR 640 +++VWAP+V N EFSLSQ+WV++GSF ++LN+IEAGWQ+ YGD YP+ F +WT Sbjct 235 ASINVWAPRVSNQYEFSLSQMWVISGSFG-DDLNTIEAGWQVSPELYGDNYPRFFTYWTT 293 Query 639 DAYQTTGCYNLLCKGFVQKSKTIVPGRAISPFSVYNNANKHYDISFDIYKSSRTGNWLLK 460 DAYQ TGCYNLLC GFVQ + I G AISP S Y+ +DIS ++K + GNW L+ Sbjct 294 DAYQATGCYNLLCSGFVQTNSRIAIGAAISPTSSYSGGQ--FDISLLVWKDPKHGNWWLE 351 Query 459 VNN-VQVGYWPWHIFTTLRGPADIIQFGGEIVNTGMFGAHTKTEMGSGHFSNEGYGQAAC 283 N V VGYWP +FT LR A ++QFGGEIVN+ G HT T+MGSGHF+ EG+G+A+ Sbjct 352 FGNGVLVGYWPSFLFTHLRDHASMVQFGGEIVNSRPSGFHTSTQMGSGHFAGEGFGKASY 411 Query 282 FKKIQVYSSTNCQNPLGSLDLQVYNETDKCYNTRVSGRDYNILGNHFYYGGPG 124 F+ +QV N PL +L+V + CY+ + G + GN+FYYGGPG Sbjct 412 FRNLQVVDWDNNLIPLS--NLRVLADHPNCYD--IQGGINRVWGNYFYYGGPG 460 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 618672975 Number of extensions: 14276356 Number of successful extensions: 36175 Number of sequences better than 1e-10: 34 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 36001 Number of HSP's successfully gapped: 36 Length of query: 1105 Length of database: 6150218869 Length adjustment: 142 Effective length of query: 963 Effective length of database: 3605708941 Effective search space: 814890220666 Effective search space used: 814890220666 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 177 (72.8 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPDZG0NG01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_376 Length=1089 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003534672.1| PREDICTED: isoliquiritigenin 2'-O-methylt... 414 2e-140 ref|NP_001242767.1| uncharacterized protein LOC100793053 [Gly... 414 3e-140 ref|NP_001242792.1| uncharacterized protein LOC100779492 [Gly... 408 4e-138 ref|XP_003556605.1| PREDICTED: LOW QUALITY PROTEIN: isoliquir... 360 3e-119 dbj|BAA13683.1| O-methyltransferase [Glycyrrhiza echinata] 318 6e-103 ALIGNMENTS >ref|XP_003534672.1| PREDICTED: isoliquiritigenin 2'-O-methyltransferase-like [Glycine max] Length=369 Score = 414 bits (1065), Expect = 2e-140 Identities = 195/286 (68%), Positives = 236/286 (83%), Gaps = 1/286 (0%) Frame = +3 Query 3 LTVLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSD-GGSLAPLSTLIHRGCNSVW 179 L +LASYSLL CSIRTNEDGKRERVYALS +G YFA D D G SLAPLS+LIHRG + +W Sbjct 84 LPLLASYSLLNCSIRTNEDGKRERVYALSPVGAYFAFDKDEGSSLAPLSSLIHRGFHDMW 143 Query 180 GDAKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTCNKAMAHSGPLEIKRILQFYKGF 359 D KDAI+DP+ N F++++G Y Y + N ELND KA+ H+ PLE+KR L+ YKGF Sbjct 144 KDVKDAIVDPNNNNHFENVHGIPPYDYMEKNAELNDIFYKAVIHAAPLELKRALKLYKGF 203 Query 360 EGVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPT 539 EGVSTLVDVGGG G+TLK I+ +YPS+KGINFD+P V+Q AP HPG+E + GDMFESVPT Sbjct 204 EGVSTLVDVGGGAGETLKQILPKYPSMKGINFDLPLVIQKAPPHPGIEQIAGDMFESVPT 263 Query 540 GDAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDN 719 GDAI++K VCHNWADE+C+KFL+N HKALP+HGKVIV +YIIPEVPNPS +SKH C +DN Sbjct 264 GDAILVKFVCHNWADEDCIKFLRNFHKALPQHGKVIVFEYIIPEVPNPSYISKHTCTLDN 323 Query 720 LMFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK 857 +MFL G+ERT++EFE+LCK +GFSKFHVA SD+S+ GVMEFYK Sbjct 324 VMFLAHGGRERTQKEFENLCKSSGFSKFHVASSDISSTLGVMEFYK 369 >ref|NP_001242767.1| uncharacterized protein LOC100793053 [Glycine max] gb|ACU22842.1| unknown [Glycine max] Length=370 Score = 414 bits (1064), Expect = 3e-140 Identities = 199/285 (70%), Positives = 236/285 (83%), Gaps = 0/285 (0%) Frame = +3 Query 3 LTVLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSDGGSLAPLSTLIHRGCNSVWG 182 L VLASYSLL CSIRTNEDG RER+YALS IGQYFACD+DGGSL PLS+L HRG V Sbjct 86 LPVLASYSLLNCSIRTNEDGVRERLYALSPIGQYFACDNDGGSLGPLSSLFHRGYFHVLK 145 Query 183 DAKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTCNKAMAHSGPLEIKRILQFYKGFE 362 D KDAI+DP+ + FQ+++G YQY KT++ELN NKA+A +GP +K +L+ YKGFE Sbjct 146 DVKDAIMDPNNNDHFQNVHGMPPYQYMKTDEELNKLFNKALAQTGPPAMKMLLKLYKGFE 205 Query 363 GVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPTG 542 VSTLVDVGGGVG+TLK II YPSIKGINFD+PQV+Q+AP HPG+EHV GDMFESVP G Sbjct 206 QVSTLVDVGGGVGETLKQIIFDYPSIKGINFDLPQVIQDAPPHPGIEHVEGDMFESVPKG 265 Query 543 DAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDNL 722 DAI+LKLVCHNW DE+CVKFL+NC+KALP+HGKVIV+DYIIPEVP+ S +S C D+L Sbjct 266 DAILLKLVCHNWLDEDCVKFLRNCYKALPQHGKVIVIDYIIPEVPDSSKISMQTCVADSL 325 Query 723 MFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK 857 MFLVT+GKERTE+EFESLC+ +GFS FHVAC D ++ V+EFYK Sbjct 326 MFLVTSGKERTEKEFESLCRNSGFSGFHVACRDSPSVLSVVEFYK 370 >ref|NP_001242792.1| uncharacterized protein LOC100779492 [Glycine max] gb|ACU18726.1| unknown [Glycine max] Length=357 Score = 408 bits (1048), Expect = 4e-138 Identities = 197/285 (69%), Positives = 234/285 (82%), Gaps = 0/285 (0%) Frame = +3 Query 3 LTVLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSDGGSLAPLSTLIHRGCNSVWG 182 L VLASYSLL C IRT EDG RER+YALS IGQYFA D DGGSL PLS+L HRG V Sbjct 73 LPVLASYSLLNCFIRTTEDGVRERLYALSPIGQYFASDDDGGSLGPLSSLFHRGYFHVLK 132 Query 183 DAKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTCNKAMAHSGPLEIKRILQFYKGFE 362 D KDAI+DP+ + FQ+++G YQY KT++ELN NKA+A +GP +K +L+ YKGFE Sbjct 133 DVKDAIMDPNNNDHFQNVHGMPPYQYMKTDEELNKLFNKALAQTGPPAMKMLLKLYKGFE 192 Query 363 GVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPTG 542 VSTLVDVGGGVG+TLK II +YPSIKGINFD+PQVVQ+AP +PG+EHV GDMFESVP G Sbjct 193 QVSTLVDVGGGVGETLKQIIFEYPSIKGINFDLPQVVQDAPPYPGIEHVEGDMFESVPKG 252 Query 543 DAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDNL 722 DAI+LKLVCHNW DE+CVKFL+NCHKALP+HGKVIV+DYIIPEVP+ S +S C D+L Sbjct 253 DAILLKLVCHNWLDEDCVKFLRNCHKALPQHGKVIVIDYIIPEVPDSSKISMQTCVADSL 312 Query 723 MFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK 857 MFLVT+GKERTE+EFESLC+ +GFS+FHVAC D ++ V+EFYK Sbjct 313 MFLVTSGKERTEKEFESLCRNSGFSRFHVACRDSPSVLSVIEFYK 357 >ref|XP_003556605.1| PREDICTED: LOW QUALITY PROTEIN: isoliquiritigenin 2'-O-methyltransferase-like [Glycine max] Length=352 Score = 360 bits (923), Expect = 3e-119 Identities = 182/285 (64%), Positives = 220/285 (77%), Gaps = 6/285 (2%) Frame = +3 Query 9 VLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSD-GGSLAPLSTLIHRGCNSVWGD 185 +LASYSLL CSIRTNEDGKRERVYALS +GQYFA D D G SLAPLSTLIHRG + Sbjct 72 LLASYSLLNCSIRTNEDGKRERVYALSPVGQYFAFDKDEGNSLAPLSTLIHRGFHDFI-- 129 Query 186 AKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTC-NKAMAHSGPLEIKRILQFYKGFE 362 KDAI+DP+ N F+ ++G Y Y + N ELND KA PLE+KR L+ Y GFE Sbjct 130 EKDAIVDPNNNNHFEYVHGIPPYDYMEKNAELNDIFFYKARILDAPLELKRALKLYIGFE 189 Query 363 GVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPTG 542 VS LVDVGGGVG+TLK ++ +YPS+KGINFD+PQV+Q AP H G+EH+ GDMFESVPTG Sbjct 190 RVSILVDVGGGVGETLKQLLPKYPSMKGINFDLPQVIQKAPPHQGIEHIEGDMFESVPTG 249 Query 543 DAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDNL 722 D I++K VCH+WADE+ +KFL+NCHKAL +HGKV+V +YIIPEVPNP +SKH C +DN+ Sbjct 250 DVILMKFVCHSWADEDGIKFLRNCHKALLQHGKVVVFEYIIPEVPNPRYISKHTCTLDNV 309 Query 723 MFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK 857 MFL +ERT+ EFE+L + GFSKF VA SD+S+ GVMEFYK Sbjct 310 MFLAQGRRERTQGEFENLXE--GFSKFDVASSDISSTLGVMEFYK 352 >dbj|BAA13683.1| O-methyltransferase [Glycyrrhiza echinata] Length=367 Score = 318 bits (816), Expect = 6e-103 Identities = 156/286 (55%), Positives = 215/286 (75%), Gaps = 8/286 (3%) Frame = +3 Query 3 LTVLASYSLLTCSIRTNEDGKRERVYALSSIGQYFACDSDGGSLAPLST-LIHRGCNSVW 179 L +LASYS+LTC+ R+ E RVY LS +G+Y D G LA +T L + +VW Sbjct 89 LRLLASYSVLTCATRSTE-----RVYGLSQVGKYLVPDGSRGYLASFTTFLCYPALMNVW 143 Query 180 GDAKDAILDPDVKNIFQSINGTSFYQYTKTNKELNDTCNKAMAHSGPLEIKRILQFYKGF 359 + K+A++D D+ ++F+ ++G S Y+Y +T+ ++N NK+MA E+KRILQ YKGF Sbjct 144 LNFKEAVVDEDI-DLFKKLHGVSKYEYMETDPKMNHIFNKSMADVCATEMKRILQIYKGF 202 Query 360 EGVSTLVDVGGGVGKTLKLIISQYPSIKGINFDMPQVVQNAPSHPGLEHVGGDMFESVPT 539 EG+STLVDVGGG G+ LK+IIS+YP IKGINFD+PQV++NAP PG+E VGGDMF SVP Sbjct 203 EGISTLVDVGGGNGQNLKMIISKYPLIKGINFDLPQVIENAPPIPGIELVGGDMFASVPQ 262 Query 540 GDAIVLKLVCHNWADEECVKFLKNCHKALPKHGKVIVLDYIIPEVPNPSNMSKHACAIDN 719 GDA++LK VCHNW+DE+C++FL NCHKAL +GKVIV+++I+PE P P+ S+ A +DN Sbjct 263 GDAMILKAVCHNWSDEKCLEFLSNCHKALSPNGKVIVVEFILPEEPEPTEESQLASTLDN 322 Query 720 LMFLVTTGKERTEEEFESLCKRAGFSKFHVACSDVSAMSGVMEFYK 857 +MF+ G+ERT++++E++CK AGFSKF VAC S++ GVMEFYK Sbjct 323 IMFITVGGRERTQKQYENMCKLAGFSKFQVACRAFSSL-GVMEFYK 367 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 572842297 Number of extensions: 12400418 Number of successful extensions: 33187 Number of sequences better than 1e-10: 13 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 33091 Number of HSP's successfully gapped: 13 Length of query: 1089 Length of database: 6150218869 Length adjustment: 142 Effective length of query: 947 Effective length of database: 3605708941 Effective search space: 796861675961 Effective search space used: 796861675961 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 177 (72.8 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE5FBTY016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_4028 Length=689 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003555129.1| PREDICTED: LOB domain-containing protein ... 95.5 2e-20 ref|XP_003541285.1| PREDICTED: LOB domain-containing protein ... 92.8 2e-19 ref|XP_002276899.1| PREDICTED: LOB domain-containing protein ... 92.4 2e-19 emb|CAN77368.1| hypothetical protein VITISV_011159 [Vitis vin... 92.4 2e-19 ref|XP_002283795.1| PREDICTED: LOB domain-containing protein ... 91.7 5e-19 ALIGNMENTS >ref|XP_003555129.1| PREDICTED: LOB domain-containing protein 41-like [Glycine max] Length=293 Score = 95.5 bits (236), Expect = 2e-20 Identities = 80/200 (40%), Positives = 105/200 (53%), Gaps = 19/200 (10%) Frame = -1 Query 689 QLCQDAVEAVLKGDPITQMESDVAKNG--PPLKACDIRHINKEDNLSGSNELRRVRTRCR 516 QLCQ AVEA+LKG+PIT + S+ A NG PPLKA DIRH++K++N +NE + +TR R Sbjct 94 QLCQAAVEAILKGEPITPITSEAAANGRAPPLKAYDIRHVSKDEN--SANETPKAKTRSR 151 Query 515 FKRSGA---KSKRSR-VWEGPVKESVNEKEVTRspshesslshqsEMAEPVVERASQLSE 348 F R+G+ K K S+ G V E E + S A +VE S+ SE Sbjct 152 FNRTGSTLIKPKASKGTPTGFVPIEPVEPETANRTTSHESGLSHLSEAAVMVEGESKESE 211 Query 347 IVGSAETS----AEQESVG--VEAALNSDIEIELDLTLGLEPLRSC-----VKPKEVKPS 201 S ETS E ESV + S EI L+LTLG EP+ +K +++ S Sbjct 212 SDVSVETSNLFHEEPESVAKTSDRTGESGNEIGLELTLGFEPVSRVHHVVPMKKRKIIAS 271 Query 200 HSDEPVCEDGLWKMELGLDY 141 S E KMELGL+Y Sbjct 272 KSYGDSAEKDSCKMELGLEY 291 >ref|XP_003541285.1| PREDICTED: LOB domain-containing protein 41-like [Glycine max] Length=288 Score = 92.8 bits (229), Expect = 2e-19 Identities = 76/196 (39%), Positives = 102/196 (52%), Gaps = 16/196 (8%) Frame = -1 Query 689 QLCQDAVEAVLKGDPITQMESDVAKNG--PPLKACDIRHINKEDNLSGSNELRRVRTRCR 516 QLCQ AVEAVLKG+PIT + S+ A NG PPLKA DIRH++K+ N +NE + +TR R Sbjct 94 QLCQAAVEAVLKGEPITPITSEAAANGRAPPLKAYDIRHVSKDQN--SANETPKTKTRSR 151 Query 515 FKRSGAKSKRSRVWEGPVKESVNEKEVTRspshesslshqsEMAEPVVERASQLSEIVGS 336 FKR+ + + +G V E E+ + S A +VE S+ SE V S Sbjct 152 FKRTSGTLIKPKASKGTGFVPV-EPEMANRTASHESGLSHLSEATAMVEGESKESESVVS 210 Query 335 AETS----AEQESVG--VEAALNSDIEIELDLTLGLEPLRSC-----VKPKEVKPSHSDE 189 +TS E E V + S EI L+LTLG EP+ +K +++ S Sbjct 211 MDTSNLIHEEPEWVAKTSDGTGESGNEIGLELTLGFEPVSRLHHVVPMKKRKIIELKSCG 270 Query 188 PVCEDGLWKMELGLDY 141 E KMELGL+Y Sbjct 271 DSAEKDSCKMELGLEY 286 >ref|XP_002276899.1| PREDICTED: LOB domain-containing protein 41 [Vitis vinifera] Length=272 Score = 92.4 bits (228), Expect = 2e-19 Identities = 76/193 (39%), Positives = 104/193 (54%), Gaps = 26/193 (13%) Frame = -1 Query 689 QLCQDAVEAVLKGDPITQM--ESDVAKNGPPLKACDIRHINKEDNLSG-SNELRRVRTRC 519 QLCQ AVEAVL G PI Q+ ES V+ PPLKA DIRH++K++N +G S+EL +V++R Sbjct 94 QLCQSAVEAVLSGAPIMQISAESAVSSMSPPLKAGDIRHVSKDENSAGSSHELHKVKSRG 153 Query 518 RFKRSGAKSKRSRVWEGPVKESVNEKEVTRspshesslshqsEMAEPVVERASQLSEIVG 339 RFKRS K R+RV ES E + + + + R S++S G Sbjct 154 RFKRSSGK-PRARV------ESAAEFDESAGVILSRCY-------KSELSRDSRVSH-PG 198 Query 338 SAETSAEQESVGVEAALNSDI-------EIELDLTLGLEPLRSCVKPKEVKPSHSDEPVC 180 S E S E +S+ VE S + ++EL+LTLGLEP+ K V + Sbjct 199 SRE-SGEADSMSVETVEASLVKPTRDGSDVELELTLGLEPIPKVQKACAVVREKEEMAGS 257 Query 179 EDGLWKMELGLDY 141 ED K+EL L+Y Sbjct 258 EDDTCKVELALEY 270 >emb|CAN77368.1| hypothetical protein VITISV_011159 [Vitis vinifera] Length=272 Score = 92.4 bits (228), Expect = 2e-19 Identities = 76/193 (39%), Positives = 104/193 (54%), Gaps = 26/193 (13%) Frame = -1 Query 689 QLCQDAVEAVLKGDPITQM--ESDVAKNGPPLKACDIRHINKEDNLSG-SNELRRVRTRC 519 QLCQ AVEAVL G PI Q+ ES V+ PPLKA DIRH++K++N +G S+EL +V++R Sbjct 94 QLCQSAVEAVLSGAPIMQISAESAVSSMSPPLKAGDIRHVSKDENSAGSSHELHKVKSRG 153 Query 518 RFKRSGAKSKRSRVWEGPVKESVNEKEVTRspshesslshqsEMAEPVVERASQLSEIVG 339 RFKRS K R+RV ES E + + + + R S++S G Sbjct 154 RFKRSSGK-PRARV------ESAAEFDESAGVILSRCY-------KSELSRDSRVSH-PG 198 Query 338 SAETSAEQESVGVEAALNSDI-------EIELDLTLGLEPLRSCVKPKEVKPSHSDEPVC 180 S E S E +S+ VE S + ++EL+LTLGLEP+ K V + Sbjct 199 SRE-SGEADSMSVETVEASLVKPTRDGSDVELELTLGLEPIPKVQKACAVVREKEEMAGS 257 Query 179 EDGLWKMELGLDY 141 ED K+EL L+Y Sbjct 258 EDDTCKVELALEY 270 >ref|XP_002283795.1| PREDICTED: LOB domain-containing protein 41 [Vitis vinifera] emb|CAN72182.1| hypothetical protein VITISV_004355 [Vitis vinifera] Length=283 Score = 91.7 bits (226), Expect = 5e-19 Identities = 70/167 (42%), Positives = 97/167 (58%), Gaps = 16/167 (10%) Frame = -1 Query 686 LCQDAVEAVLKGDPITQMESDVAK--NGPPLKACDIRHINKEDNLSGSNELRRVRTRCRF 513 LCQ AVEAVLKG+PITQ+ S+ A GPPLKA DIRH++K++N S +N+L R + RCRF Sbjct 95 LCQAAVEAVLKGEPITQITSEAAATGQGPPLKAYDIRHVSKDEN-SAANDLHRAKNRCRF 153 Query 512 KRSGAKSKR-SRVWEGPVKES---VNEKEVT-RspshesslshqsEMAEP-VVERASQLS 351 KRSG ++K ++ V ES +N + S S S +EP VE S+ + Sbjct 154 KRSGVRAKSGNKAGTNSVAESKQGMNRGSLADELNRSTSHDSTVSHPSEPHTVEGESRET 213 Query 350 EIVGSAETS-------AEQESVGVEAALNSDIEIELDLTLGLEPLRS 231 + S ET+ AEQ+ V + +EL+LTLGL P++S Sbjct 214 VSMVSVETAEPSFLFRAEQKRVQKPEKREENRGLELELTLGLGPIQS 260 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 335060043 Number of extensions: 7155018 Number of successful extensions: 22451 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 22399 Number of HSP's successfully gapped: 0 Length of query: 689 Length of database: 6150218869 Length adjustment: 136 Effective length of query: 553 Effective length of database: 3713223445 Effective search space: 345329780385 Effective search space used: 345329780385 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 174 (71.6 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE0M6BB016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_416 Length=1072 Score E Sequences producing significant alignments: (Bits) Value emb|CAN73038.1| hypothetical protein VITISV_044348 [Vitis vin... 161 6e-44 ref|XP_003597035.1| Cysteine-rich repeat secretory protein [M... 160 1e-43 dbj|BAJ34612.1| unnamed protein product [Thellungiella haloph... 157 5e-42 ref|XP_003543377.1| PREDICTED: cysteine-rich repeat secretory... 156 5e-42 ref|XP_002327834.1| predicted protein [Populus trichocarpa] >... 155 2e-41 ALIGNMENTS >emb|CAN73038.1| hypothetical protein VITISV_044348 [Vitis vinifera] Length=243 Score = 161 bits (408), Expect = 6e-44 Identities = 109/251 (43%), Positives = 139/251 (55%), Gaps = 15/251 (6%) Frame = +1 Query 49 MAYSISIRAFSLIISLILIPTALSQQSDLLNKVCIATSKNYTQNSTFSNNLKNLLDDLVQ 228 M +S SI L++SL+L + +D L C +NYT N FS+NL L L Sbjct 1 MYFSKSIPLCFLLLSLLL---QAAIGADPLFHFCFIP-ENYTANDPFSSNLNELTTLLSS 56 Query 229 KTPHQIHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-YTKSAIVM 405 K P F TGQG D+ GLALCRGDVS C C+ +A ++ RC Y K AI+ Sbjct 57 KVPPTG--FGLASTGQGQDSVNGLALCRGDVSSRDCKTCVTEATKELHNRCPYNKGAIIW 114 Query 406 YDYCQVKYSNENFFGKIDHTNDNVLINVYNV-SANQTKQFKDASIGLLNKLSNQAAHSKS 582 YD C YSN FFG+ID+ N VYNV SA+ F + LL+ LSN+A S Sbjct 115 YDNCLFNYSNVKFFGEIDNKNK---FYVYNVQSADDPTSFNVKTKELLSSLSNKAYGSPM 171 Query 583 LFAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSVGARVFGS 762 LFA G+ + + E++ +YG+ QCT+DLS+ CKK L D V +LP Y D G RV G Sbjct 172 LFATGELVLE--ESMKLYGLAQCTRDLSSLDCKKSLDDAVSELPSYCDG--KRGGRVVGG 227 Query 763 SCTVRYEEYGF 795 SC VRYE Y F Sbjct 228 SCNVRYELYPF 238 >ref|XP_003597035.1| Cysteine-rich repeat secretory protein [Medicago truncatula] gb|AES67286.1| Cysteine-rich repeat secretory protein [Medicago truncatula] Length=246 Score = 160 bits (406), Expect = 1e-43 Identities = 101/252 (40%), Positives = 144/252 (57%), Gaps = 14/252 (6%) Frame = +1 Query 46 SMAYSISIRAFSLIISLILIPTALSQQSDLLNKVCIATSKNYTQNSTFSNNLKNLLDDLV 225 S ++I + +F+ + LI T L +D L +C +TS+N+T +S + +NLK L++ L+ Sbjct 4 SKLFTIILFSFTFVF---LIQTTLG--TDPLFHIC-STSENFTAHSPYESNLKTLINSLI 57 Query 226 QKTPHQ-IHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-YTKSAI 399 KTP S + T YGLALCRGDVS C C+ A +I C Y K AI Sbjct 58 YKTPSTGFGSGSIDLTQYQNQKAYGLALCRGDVSTSECKTCVSQATKEILNVCPYKKGAI 117 Query 400 VMYDYCQVKYSNENFFGKIDHTNDNVLINVYNVSANQTKQFKDASIGLLNKLSNQAAHSK 579 + YD C KY + +FFGKID+TN L+NV NVS +F + + LL+ L+N+A+ + Sbjct 118 IWYDNCMFKYLDNDFFGKIDNTNKFALLNVQNVS--DPIKFNNMTNDLLSFLANEASMNH 175 Query 580 SLFAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSVGARVFG 759 L+A G+ + E V YG+ QCT+D+S+ CKKCL + +LP D G RV G Sbjct 176 KLYATGELKIGESERV--YGLTQCTRDISSVDCKKCLDGAISELPNCCDG--KKGGRVVG 231 Query 760 SSCTVRYEEYGF 795 SC +RYE Y F Sbjct 232 GSCNIRYEIYPF 243 >dbj|BAJ34612.1| unnamed protein product [Thellungiella halophila] Length=255 Score = 157 bits (396), Expect = 5e-42 Identities = 98/250 (39%), Positives = 133/250 (53%), Gaps = 14/250 (6%) Frame = +1 Query 61 ISIRAFSLIISLILIPTALSQQSD--LLNKVCIATSKNYTQNSTFSNNLKNLLDDLVQKT 234 +S+ +L I L+ I + SQ + L C N+T S + +NL +L + + Sbjct 11 VSVSVLALAIQLLFIQSVSSQSQNNAFLYHKCSDIEGNFTSKSPYESNLDSLFRRISYRV 70 Query 235 PHQIHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-YTKSAIVMYD 411 P F+ G PD GLALCRGD S C CL A ++ QRC K+ I+ YD Sbjct 71 PSSG--FAASSAGNSPDNVNGLALCRGDASSSDCGSCLATAIPELRQRCPNNKAGIIWYD 128 Query 412 YCQVKYSNENFFGKIDHTNDNVLINVYNVSANQTKQFKDASIGLLNKLSNQA--AHSKSL 585 C VKYS+ NFFGKID+ N L NV NVS F + LL +L+ +A ++ L Sbjct 129 NCLVKYSSTNFFGKIDYENRFYLYNVNNVS--DPASFNTQTKALLTELTQKATTGDNQKL 186 Query 586 FAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSVGARVFGSS 765 FA G+K +KK+ +YG+VQCT+DL +SCK CL ++ +LP D G RV G S Sbjct 187 FATGEKNLEKKK---LYGLVQCTRDLRRESCKACLDGIIGELPNCCDG--KEGGRVVGGS 241 Query 766 CTVRYEEYGF 795 C RYE Y F Sbjct 242 CNFRYEIYPF 251 >ref|XP_003543377.1| PREDICTED: cysteine-rich repeat secretory protein 38 [Glycine max] Length=244 Score = 156 bits (395), Expect = 5e-42 Identities = 100/258 (39%), Positives = 142/258 (55%), Gaps = 24/258 (9%) Frame = +1 Query 43 MSMAYSISIRAFSLIISLILIPTALSQQSDLLNKVCIATSKNYTQNSTFSNNLKNLLDDL 222 MS + + FSL ++L+L S +D L C + S+N+T NS + +NLK L++ L Sbjct 1 MSSSKLFTTFLFSLNLALLL---QTSFGADPLFHFC-SNSENFTANSPYESNLKTLINSL 56 Query 223 VQKTPH------QIHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC- 381 + KTP + + N+K Y LALCRGDVS C C+ +A +I RC Sbjct 57 IYKTPSTGFGVGSVGQYQNQKA-------YALALCRGDVSASECKTCVSEAPKEILSRCP 109 Query 382 YTKSAIVMYDYCQVKYSNENFFGKIDHTNDNVLINVYNVSANQTKQFKDASIGLLNKLSN 561 Y K AI+ YDYC KY + +F GKID+TN + N+ NVS T F + LL++L+ Sbjct 110 YNKGAIIWYDYCMFKYLDTDFLGKIDNTNKFYMWNLKNVSDPAT--FNYNTRDLLSQLAQ 167 Query 562 QAAHSKSLFAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSV 741 +A + L+A G+ + E ++YG+ QCT+DLS+ CKKCL D + +LP D Sbjct 168 KAYVNNKLYATGEAKLENSE--TLYGLTQCTRDLSSSDCKKCLDDAINELPNCCDG--KE 223 Query 742 GARVFGSSCTVRYEEYGF 795 G RV SC RYE Y F Sbjct 224 GGRVVSGSCNFRYEIYPF 241 >ref|XP_002327834.1| predicted protein [Populus trichocarpa] gb|EEE75622.1| predicted protein [Populus trichocarpa] Length=242 Score = 155 bits (391), Expect = 2e-41 Identities = 96/251 (38%), Positives = 137/251 (55%), Gaps = 17/251 (7%) Frame = +1 Query 46 SMAYSISIRAFSLIISLILIPTALSQQSDLLNKVCIATSKNYTQNSTFSNNLKNLLDDLV 225 + A+S+ + FSL++ + N ++ +N+T N + +NL L L Sbjct 5 NFAFSLCLITFSLLLHTVFGAGP--------NFHLCSSPENFTANGPYESNLNKLTSYLY 56 Query 226 QKTPHQIHLFSNEKTGQGPDTTYGLALCRGDVSPDYCDFCLYDARCKITQRC-YTKSAIV 402 K P F G PD TYGLALCRGDVS C C+ +A +I +RC Y K+AI+ Sbjct 57 YKAPPTG--FGMGSRGHTPDQTYGLALCRGDVSTSDCKTCVVEASSEIRKRCPYNKAAII 114 Query 403 MYDYCQVKYSNENFFGKIDHTNDNVLINVYNVSANQTKQFKDASIGLLNKLSNQAAHSKS 582 YD C +KYSN FFG+ID+ N + NV+ VS + F + + LL++L+N+A + Sbjct 115 WYDNCLLKYSNNAFFGQIDNGNKFYMWNVHVVS--EPAPFNEKTKELLSQLANEAQATPK 172 Query 583 LFAKGDKLFDKKENVSIYGMVQCTQDLSNDSCKKCLGDVVKKLPIYKDEIYSVGARVFGS 762 LFA G++ K + +YG+VQCT DLS+ CKKCL ++ +LPI D G RV Sbjct 173 LFATGERELGK--STKLYGLVQCTGDLSSAVCKKCLDGIIGELPICCDG--KQGGRVVSG 228 Query 763 SCTVRYEEYGF 795 SC YE Y F Sbjct 229 SCNFIYEIYPF 239 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 544842042 Number of extensions: 11412868 Number of successful extensions: 28347 Number of sequences better than 1e-10: 65 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 28143 Number of HSP's successfully gapped: 66 Length of query: 1072 Length of database: 6150218869 Length adjustment: 142 Effective length of query: 930 Effective length of database: 3605708941 Effective search space: 775227422315 Effective search space used: 775227422315 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 177 (72.8 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE5H41901N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_4555 Length=665 Score E Sequences producing significant alignments: (Bits) Value ref|NP_001235512.1| disease resistance protein [Glycine max] ... 237 6e-70 ref|XP_003530247.1| PREDICTED: LOW QUALITY PROTEIN: protein B... 235 2e-68 ref|XP_003551773.1| PREDICTED: LRR receptor-like serine/threo... 233 3e-67 ref|XP_002324588.1| predicted protein [Populus trichocarpa] >... 192 2e-57 ref|XP_002324600.1| predicted protein [Populus trichocarpa] >... 191 3e-54 ALIGNMENTS >ref|NP_001235512.1| disease resistance protein [Glycine max] gb|ACM89590.1| disease resistance protein [Glycine max] Length=771 Score = 237 bits (605), Expect = 6e-70 Identities = 127/173 (73%), Positives = 137/173 (79%), Gaps = 3/173 (2%) Frame = +2 Query 2 RALNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSG 181 RALNLS+N LTGK+P TFS L Q ESLDLSFNML+ QIPPQLS L LEVFSVAHNNLSG Sbjct 597 RALNLSHNDLTGKIPVTFSLLAQTESLDLSFNMLNSQIPPQLSMLTSLEVFSVAHNNLSG 656 Query 182 ATPEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYV 361 TP++KGQ STFDESSYEGN FLCGPPLPKSCNP P +PN N DGDND +DMYV Sbjct 657 PTPDFKGQFSTFDESSYEGNPFLCGPPLPKSCNP---PPTIIPNDSNTDGDNDSLLDMYV 713 Query 362 FRVSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNLYKFF 520 F VSF V+Y S +LV LYINPYWRQAWFYY+ LV MNCYYFI DNL K F Sbjct 714 FCVSFAVSYISTLLVTAAALYINPYWRQAWFYYMELVSMNCYYFIKDNLSKVF 766 >ref|XP_003530247.1| PREDICTED: LOW QUALITY PROTEIN: protein BRASSINOSTEROID INSENSITIVE 1-like [Glycine max] Length=936 Score = 235 bits (599), Expect = 2e-68 Identities = 124/173 (72%), Positives = 135/173 (78%), Gaps = 3/173 (2%) Frame = +2 Query 2 RALNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSG 181 R LNLS+N LTG++PATFS+LVQ ESLDLSFNML+GQIPPQL+ L LEVFSVAHNNLSG Sbjct 762 RTLNLSHNDLTGQIPATFSHLVQTESLDLSFNMLNGQIPPQLTMLTSLEVFSVAHNNLSG 821 Query 182 ATPEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYV 361 TPE+K Q STFDESSYEGN FLCG PLPKSCNP P +PN N DG D VDMY Sbjct 822 PTPEFKEQFSTFDESSYEGNPFLCGLPLPKSCNP---PPTVIPNDSNTDGHYDTLVDMYF 878 Query 362 FRVSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNLYKFF 520 F VSFVV+YTS +LV LYINPYWR AWFYY+ L MNCYYFIVDN K F Sbjct 879 FCVSFVVSYTSALLVTAAALYINPYWRHAWFYYMELASMNCYYFIVDNCSKVF 931 >ref|XP_003551773.1| PREDICTED: LRR receptor-like serine/threonine-protein kinase GSO2-like [Glycine max] Length=1133 Score = 233 bits (593), Expect = 3e-67 Identities = 118/171 (69%), Positives = 134/171 (78%), Gaps = 3/171 (2%) Frame = +2 Query 2 RALNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSG 181 RALNLS+N L G++PATFSNLVQ ESLDLSFN LSGQIPPQLS L LEVFSVAHNNLSG Sbjct 879 RALNLSHNDLIGQIPATFSNLVQTESLDLSFNKLSGQIPPQLSKLTSLEVFSVAHNNLSG 938 Query 182 ATPEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYV 361 TPEWKGQ STF+ SSYEGN FLCGPPL KSCNP P+ +PN + D+ VDMYV Sbjct 939 TTPEWKGQFSTFENSSYEGNPFLCGPPLSKSCNP---PPSIIPNDSHTHVDDGSLVDMYV 995 Query 362 FRVSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNLYK 514 F VSF V++++ +L I LYINPY R+AWFYY+ LVC NCYYFIVD+ K Sbjct 996 FYVSFAVSFSAALLATAIALYINPYCRRAWFYYMELVCSNCYYFIVDSFSK 1046 >ref|XP_002324588.1| predicted protein [Populus trichocarpa] gb|EEF03153.1| predicted protein [Populus trichocarpa] Length=243 Score = 192 bits (488), Expect = 2e-57 Identities = 95/169 (56%), Positives = 120/169 (71%), Gaps = 1/169 (1%) Frame = +2 Query 2 RALNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSG 181 + LNLS+N LTG +P TFSNL ++E+LDLS+N L+G+IPPQL L++L FSVAHNNLSG Sbjct 63 KLLNLSHNSLTGPIPPTFSNLKEIETLDLSYNNLNGEIPPQLLDLNFLSAFSVAHNNLSG 122 Query 182 ATPEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYV 361 TP+ Q STF++S YEGN LCGPPL K+C P+ LP + + + +DM Sbjct 123 KTPKMVAQFSTFNKSCYEGNPLLCGPPLAKNCT-GAIPPSPLPRSQTHKKEENGVIDMEA 181 Query 362 FRVSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNL 508 F V+F VAY ++L I VLYINP WRQAWFY+IG NCYYF+VDNL Sbjct 182 FYVTFSVAYIMVLLAIGAVLYINPQWRQAWFYFIGESINNCYYFLVDNL 230 >ref|XP_002324600.1| predicted protein [Populus trichocarpa] gb|EEF03165.1| predicted protein [Populus trichocarpa] Length=534 Score = 191 bits (486), Expect = 3e-54 Identities = 95/167 (57%), Positives = 119/167 (71%), Gaps = 1/167 (1%) Frame = +2 Query 8 LNLSNNLLTGKVPATFSNLVQVESLDLSFNMLSGQIPPQLSGLHYLEVFSVAHNNLSGAT 187 LNLS+N LTG +P TFSNL ++E+LDLS+N L+G+IPPQL L++L FSVAHNNLSG T Sbjct 356 LNLSHNSLTGPIPPTFSNLKEIETLDLSYNNLNGEIPPQLLDLNFLSAFSVAHNNLSGKT 415 Query 188 PEWKGQLSTFDESSYEGNQFLCGPPLPKSCNPSEQAPATLPnglnndgdndIWVDMYVFR 367 PE Q STF++S YEGN LCGPPL K+C P+ +P + + + +DM F Sbjct 416 PEMVAQFSTFNKSCYEGNLLLCGPPLAKNCT-GAIPPSPVPRSQTHKKEENGVIDMEAFY 474 Query 368 VSFVVAYTSIVLVIPIVLYINPYWRQAWFYYIGLVCMNCYYFIVDNL 508 V+F VAY ++L I VLYINP WRQAWFY+IG NCYYF+VDNL Sbjct 475 VTFSVAYIIVLLAIGAVLYINPQWRQAWFYFIGESINNCYYFLVDNL 521 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 361315866 Number of extensions: 7764389 Number of successful extensions: 30031 Number of sequences better than 1e-10: 119 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 28965 Number of HSP's successfully gapped: 120 Length of query: 665 Length of database: 6150218869 Length adjustment: 135 Effective length of query: 530 Effective length of database: 3731142529 Effective search space: 320878257494 Effective search space used: 320878257494 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 174 (71.6 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE5YZ9E01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_4931 Length=650 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002298184.1| predicted protein [Populus trichocarpa] >... 85.9 4e-18 ref|NP_001150703.1| chemocyanin precursor [Zea mays] >gb|ACG4... 81.6 2e-16 dbj|BAJ95771.1| predicted protein [Hordeum vulgare subsp. vul... 81.3 2e-16 gb|ACG40792.1| chemocyanin precursor [Zea mays] >gb|ACR37830.... 80.9 3e-16 gb|ACG29532.1| chemocyanin precursor [Zea mays] 80.5 4e-16 ALIGNMENTS >ref|XP_002298184.1| predicted protein [Populus trichocarpa] gb|EEE82989.1| predicted protein [Populus trichocarpa] Length=125 Score = 85.9 bits (211), Expect = 4e-18 Identities = 50/136 (37%), Positives = 74/136 (54%), Gaps = 13/136 (10%) Frame = -3 Query 648 MSGKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNK 469 + G+G+ +VA + + +L+LHFDMA++ Y VG G W + + P K+ Sbjct 2 VQGRGSAMVATVAVMLCMLLLHFDMAHAATYTVGGPGG---WTFNV---SGWPKGKSFKA 55 Query 468 GDTLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESR 289 GD L F Y+ HNVV V GY++C +G+ Y T+G + I + KG N+FIC+ Sbjct 56 GDILVFNYSTAAHNVVAVNKAGYSSCTSPRGAKVY-TSGKDQIKLVKGQNFFICSF---- 110 Query 288 GGNKCSNYGMKIAVKA 241 C + GMKIAV A Sbjct 111 -AGHCQS-GMKIAVNA 124 >ref|NP_001150703.1| chemocyanin precursor [Zea mays] gb|ACG40061.1| chemocyanin precursor [Zea mays] Length=132 Score = 81.6 bits (200), Expect = 2e-16 Identities = 52/134 (39%), Positives = 68/134 (51%), Gaps = 13/134 (10%) Frame = -3 Query 642 GKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNKGD 463 G+G+ AA L ++ ++LH ++A S Y VG G W+ P K GD Sbjct 11 GRGSGAAAALALVLLCVLLHGELAESAVYTVGDRGG---WS---FNTANWPKGKRFRAGD 64 Query 462 TLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESRGG 283 L F YN HNVV V GY +C KG TTGN+ +++K+G NYFIC+ P Sbjct 65 VLAFRYNAKAHNVVPVSAAGYKSCSAPKGVRAL-TTGNDRVTLKRGTNYFICSFP----- 118 Query 282 NKCSNYGMKIAVKA 241 C GMKIAV A Sbjct 119 GHC-QAGMKIAVTA 131 >dbj|BAJ95771.1| predicted protein [Hordeum vulgare subsp. vulgare] Length=121 Score = 81.3 bits (199), Expect = 2e-16 Identities = 49/134 (37%), Positives = 67/134 (50%), Gaps = 17/134 (13%) Frame = -3 Query 642 GKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNKGD 463 G G A L ++ ++LH + A SK Y VG S W P K GD Sbjct 4 GSGGSATAVLALVLLCVLLHGEFAESKVYTVGWAVSSGGW----------PRGKRFRAGD 53 Query 462 TLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESRGG 283 L F Y + HNVV V GY +C ++GS TY +G++ +++ +G NYFIC+VP Sbjct 54 VLLFKYGRGAHNVVAVNAAGYKSCSAARGSRTY-NSGSDRVTLSRGTNYFICSVP----- 107 Query 282 NKCSNYGMKIAVKA 241 C GMK+AV A Sbjct 108 GHC-QAGMKMAVTA 120 >gb|ACG40792.1| chemocyanin precursor [Zea mays] gb|ACR37830.1| unknown [Zea mays] Length=130 Score = 80.9 bits (198), Expect = 3e-16 Identities = 52/135 (39%), Positives = 68/135 (50%), Gaps = 13/135 (10%) Frame = -3 Query 645 SGKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNKG 466 S +G+ AA L ++ ++LH ++A S Y VG G W+ P K G Sbjct 8 SARGSGAAAALALVLLCVLLHGELAESAVYTVGDRGG---WS---FNTANWPKGKRFRAG 61 Query 465 DTLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESRG 286 D L F YN HNVV V GY +C KG TTGN+ +++K+G NYFIC+ P Sbjct 62 DVLAFRYNAKAHNVVPVSAAGYKSCSAPKGVRAL-TTGNDRVTLKRGANYFICSFP---- 116 Query 285 GNKCSNYGMKIAVKA 241 C GMKIAV A Sbjct 117 -GHC-QAGMKIAVTA 129 >gb|ACG29532.1| chemocyanin precursor [Zea mays] Length=129 Score = 80.5 bits (197), Expect = 4e-16 Identities = 49/134 (37%), Positives = 71/134 (53%), Gaps = 13/134 (10%) Frame = -3 Query 642 GKGALVVAAAMLTIMLLVLHFDMANSKNYVVGAEDGSYKWAAKIHRPTYSPSRKTLNKGD 463 G GA+ +AAA ++ ++LH +A S + VG G W+ T + K GD Sbjct 8 GSGAVALAAAAAVLLCVLLHAHVAESAVFTVGDRGG---WSFSTGTWT---NGKRFKAGD 61 Query 462 TLQFIYNKDVHNVVGVGLNGYTNCDDSKGSNTYPTTGNNTISVKKGMNYFICTVPESRGG 283 L F Y+ HNVV V GY C +G+ Y T+GN+ +++ +G NYFIC++P Sbjct 62 VLVFKYDSTAHNVVAVNAAGYKGCSAPRGAKVY-TSGNDRVTLARGTNYFICSIP----- 115 Query 282 NKCSNYGMKIAVKA 241 C + GMKIAV A Sbjct 116 GHCQS-GMKIAVTA 128 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 278294648 Number of extensions: 5229159 Number of successful extensions: 11404 Number of sequences better than 1e-10: 1 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 11386 Number of HSP's successfully gapped: 1 Length of query: 650 Length of database: 6150218869 Length adjustment: 135 Effective length of query: 515 Effective length of database: 3731142529 Effective search space: 302222544849 Effective search space used: 302222544849 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 174 (71.6 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE6FV6M016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_5067 Length=644 No significant similarity found. For reasons why, click here.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 294853118 Number of extensions: 5561555 Number of successful extensions: 12950 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 12938 Number of HSP's successfully gapped: 0 Length of query: 644 Length of database: 6150218869 Length adjustment: 135 Effective length of query: 509 Effective length of database: 3731142529 Effective search space: 294760259791 Effective search space used: 294760259791 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 174 (71.6 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE6P1NR013 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_5537 Length=626 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003593357.1| Pathogenesis-related protein 1a [Medicago... 302 2e-99 ref|XP_003593358.1| Pathogenesis-related protein 1a [Medicago... 278 2e-92 ref|XP_003593453.1| Pathogenesis-related protein 1a [Medicago... 276 2e-91 ref|XP_003606006.1| Maturase K [Medicago truncatula] >gb|AES8... 271 4e-82 ref|XP_003593359.1| Pathogenesis-related protein 1a [Medicago... 240 6e-78 ALIGNMENTS >ref|XP_003593357.1| Pathogenesis-related protein 1a [Medicago truncatula] gb|AES63608.1| Pathogenesis-related protein 1a [Medicago truncatula] Length=338 Score = 302 bits (774), Expect = 2e-99 Identities = 141/148 (95%), Positives = 145/148 (98%), Gaps = 1/148 (1%) Frame = -3 Query 624 GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH 445 GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH Sbjct 11 GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH 70 Query 444 SGGRYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQRVG 265 SGGRYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRN++R+G Sbjct 71 SGGRYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNTKRIG 130 Query 264 CAKVRCDNNRGTFITCNYDPPGNYVGEK 181 CAKVRC NN GTFI CNYDPPGNYVG+K Sbjct 131 CAKVRC-NNGGTFIICNYDPPGNYVGQK 157 >ref|XP_003593358.1| Pathogenesis-related protein 1a [Medicago truncatula] gb|AES63609.1| Pathogenesis-related protein 1a [Medicago truncatula] Length=162 Score = 278 bits (711), Expect = 2e-92 Identities = 126/152 (83%), Positives = 142/152 (93%), Gaps = 2/152 (1%) Frame = -3 Query 624 GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH 445 GL +I+ HVAHAQNSQ+DYVN+HN+ARRQVGVAN+VWDN +A+ AQ+YAN R+GDC+L H Sbjct 11 GLSLIMVHVAHAQNSQSDYVNAHNDARRQVGVANIVWDNTVASFAQDYANQRKGDCQLIH 70 Query 444 SGG--RYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQR 271 SGG RYGENLA S+GD+SG+DAV+LWVNEK DY+YNSNTCASGKVCGHYTQVVWRNSQR Sbjct 71 SGGGGRYGENLAWSSGDMSGSDAVKLWVNEKADYDYNSNTCASGKVCGHYTQVVWRNSQR 130 Query 270 VGCAKVRCDNNRGTFITCNYDPPGNYVGEKPY 175 VGCAKVRCDNNRGTFITCNYDPPGNYVGEKPY Sbjct 131 VGCAKVRCDNNRGTFITCNYDPPGNYVGEKPY 162 >ref|XP_003593453.1| Pathogenesis-related protein 1a [Medicago truncatula] gb|AES63704.1| Pathogenesis-related protein 1a [Medicago truncatula] Length=181 Score = 276 bits (706), Expect = 2e-91 Identities = 124/151 (82%), Positives = 141/151 (93%), Gaps = 2/151 (1%) Frame = -3 Query 621 LFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTHS 442 L ++IGHVA+AQNS+ADYVN+HN+ARRQVGV ++VWDN +A+ AQ+YAN R+GDC+L HS Sbjct 12 LLLVIGHVANAQNSRADYVNAHNDARRQVGVGDIVWDNTVASFAQDYANQRKGDCQLIHS 71 Query 441 GG--RYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQRV 268 GG RYGENLA S+GD+SG+DAV+LWVNEK DYNYNSNTCASGKVCGHYTQVVWRNSQRV Sbjct 72 GGGGRYGENLAWSSGDMSGSDAVKLWVNEKADYNYNSNTCASGKVCGHYTQVVWRNSQRV 131 Query 267 GCAKVRCDNNRGTFITCNYDPPGNYVGEKPY 175 GCAKVRCDNNRGTFITCNYDPPGNYVGEKPY Sbjct 132 GCAKVRCDNNRGTFITCNYDPPGNYVGEKPY 162 >ref|XP_003606006.1| Maturase K [Medicago truncatula] gb|AES88203.1| Maturase K [Medicago truncatula] Length=855 Score = 271 bits (692), Expect = 4e-82 Identities = 122/149 (82%), Positives = 139/149 (93%), Gaps = 2/149 (1%) Frame = -3 Query 621 LFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTHS 442 L ++IGHVA+AQNS+ADYVN+HN+ARRQVGV ++VWDN +A+ AQ+YAN R+GDC+L HS Sbjct 342 LLLVIGHVANAQNSRADYVNAHNDARRQVGVGDIVWDNTVASFAQDYANQRKGDCQLIHS 401 Query 441 GG--RYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQRV 268 GG RYGENLA S+GD+SG+DAV+LWVNEK DYNYNSNTCASGKVCGHYTQVVWRNSQRV Sbjct 402 GGGGRYGENLAWSSGDMSGSDAVKLWVNEKADYNYNSNTCASGKVCGHYTQVVWRNSQRV 461 Query 267 GCAKVRCDNNRGTFITCNYDPPGNYVGEK 181 GCAKVRCDNNRGTFITCNYDPPGNYVGEK Sbjct 462 GCAKVRCDNNRGTFITCNYDPPGNYVGEK 490 >ref|XP_003593359.1| Pathogenesis-related protein 1a [Medicago truncatula] gb|AES63610.1| Pathogenesis-related protein 1a [Medicago truncatula] Length=138 Score = 240 bits (613), Expect = 6e-78 Identities = 115/150 (77%), Positives = 124/150 (83%), Gaps = 22/150 (15%) Frame = -3 Query 624 GLFIIIGHVAHAQNSQADYVNSHNEARRQVGVANVVWDNNLATVAQNYANSRRGDCRLTH 445 GL +IIGHVAHAQ+SQADYVN+HNEAR +VGV GDC+L H Sbjct 11 GLLLIIGHVAHAQDSQADYVNAHNEARSEVGV---------------------GDCQLIH 49 Query 444 SGGRYGENLAGSTGDLSGTDAVRLWVNEKNDYNYNSNTCASGKVCGHYTQVVWRNSQRVG 265 SGGRYGENLAGSTGDLSG+DAV+LWVNEK DY+YNSNTCASGKVCGHYTQVVWRNSQRVG Sbjct 50 SGGRYGENLAGSTGDLSGSDAVKLWVNEKADYDYNSNTCASGKVCGHYTQVVWRNSQRVG 109 Query 264 CAKVRCDNNRGTFITCNYDPPGNYVGEKPY 175 CAKVRCDNNRGTFITCNYDPPGN+ GEKPY Sbjct 110 CAKVRCDNNRGTFITCNYDPPGNF-GEKPY 138 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 6905539149 Number of extensions: 154640758 Number of successful extensions: 389294 Number of sequences better than 1e-10: 1563 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 385289 Number of HSP's successfully gapped: 1575 Length of query: 626 Length of database: 6150218869 Length adjustment: 134 Effective length of query: 492 Effective length of database: 3749061613 Effective search space: 277430559362 Effective search space used: 277430559362 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE6TT4M01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_5544 Length=626 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002326231.1| predicted protein [Populus trichocarpa] >... 235 5e-73 ref|XP_002528628.1| zinc finger protein, putative [Ricinus co... 232 6e-72 ref|XP_002284087.2| PREDICTED: GDSL esterase/lipase At5g03820... 229 1e-70 emb|CBI30208.3| unnamed protein product [Vitis vinifera] 229 2e-70 ref|XP_002527431.1| zinc finger protein, putative [Ricinus co... 222 8e-68 ALIGNMENTS >ref|XP_002326231.1| predicted protein [Populus trichocarpa] gb|EEE71901.1| predicted protein [Populus trichocarpa] Length=351 Score = 235 bits (599), Expect = 5e-73 Identities = 112/150 (75%), Positives = 126/150 (84%), Gaps = 0/150 (0%) Frame = -2 Query 625 KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT 446 ++L LG RIGVTSLPP GCLPAAITLFG GSN CV LN DA++FN+KLN+TSQ L Sbjct 202 QNLYGLGARRIGVTSLPPTGCLPAAITLFGAGSNQCVESLNQDAILFNDKLNSTSQGLVQ 261 Query 445 KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ 266 KL LKLVVFDIY PLLD++ KPSD+GF ESRR CCGTGTLETS LCN RS+GTCSNAT+ Sbjct 262 KLPGLKLVVFDIYQPLLDMIRKPSDNGFFESRRACCGTGTLETSVLCNDRSVGTCSNATE 321 Query 265 YVFWDGFHPSESANEKLAQTLLEQGFDLIS 176 YVFWDGFHPSE+AN+ LA LL+QGFDLIS Sbjct 322 YVFWDGFHPSEAANQVLAGDLLQQGFDLIS 351 >ref|XP_002528628.1| zinc finger protein, putative [Ricinus communis] gb|EEF33731.1| zinc finger protein, putative [Ricinus communis] Length=352 Score = 232 bits (592), Expect = 6e-72 Identities = 111/150 (74%), Positives = 125/150 (83%), Gaps = 0/150 (0%) Frame = -2 Query 625 KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT 446 ++L LG RIGVT LPP GCLPAAITLFG GSN CV RLN DA+ FNNKLN+TSQSL + Sbjct 203 QNLYQLGARRIGVTGLPPTGCLPAAITLFGAGSNQCVERLNRDAISFNNKLNSTSQSLVS 262 Query 445 KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ 266 L LKLVVFDIY PLLD++ KP+D+GF E+RR CCGTGTLETS LCN RS+GTCS+ATQ Sbjct 263 NLPGLKLVVFDIYQPLLDMILKPTDNGFFEARRACCGTGTLETSVLCNARSLGTCSDATQ 322 Query 265 YVFWDGFHPSESANEKLAQTLLEQGFDLIS 176 YVFWDGFHPSE+AN+ LA LL QGFDLIS Sbjct 323 YVFWDGFHPSEAANKVLAGDLLAQGFDLIS 352 >ref|XP_002284087.2| PREDICTED: GDSL esterase/lipase At5g03820-like [Vitis vinifera] Length=351 Score = 229 bits (583), Expect = 1e-70 Identities = 108/149 (72%), Positives = 127/149 (85%), Gaps = 0/149 (0%) Frame = -2 Query 625 KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT 446 ++L LGV +IGVT+LPP GCLPAAITLF GSN CV+RLN DA+ FN+KLN TSQ L+ Sbjct 202 QNLYGLGVRKIGVTTLPPTGCLPAAITLFSSGSNQCVARLNQDAINFNSKLNITSQVLQN 261 Query 445 KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ 266 KL LKLVVFDIY PLL+L+TKP+D+GF ESR+ CCGTGT+ETS LCN RS+GTCSNA+Q Sbjct 262 KLPGLKLVVFDIYQPLLNLITKPTDNGFFESRKACCGTGTIETSLLCNARSVGTCSNASQ 321 Query 265 YVFWDGFHPSESANEKLAQTLLEQGFDLI 179 YVFWDGFHPSESAN+ LA +LLEQG +LI Sbjct 322 YVFWDGFHPSESANQLLAGSLLEQGINLI 350 >emb|CBI30208.3| unnamed protein product [Vitis vinifera] Length=363 Score = 229 bits (583), Expect = 2e-70 Identities = 108/149 (72%), Positives = 127/149 (85%), Gaps = 0/149 (0%) Frame = -2 Query 625 KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT 446 ++L LGV +IGVT+LPP GCLPAAITLF GSN CV+RLN DA+ FN+KLN TSQ L+ Sbjct 202 QNLYGLGVRKIGVTTLPPTGCLPAAITLFSSGSNQCVARLNQDAINFNSKLNITSQVLQN 261 Query 445 KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ 266 KL LKLVVFDIY PLL+L+TKP+D+GF ESR+ CCGTGT+ETS LCN RS+GTCSNA+Q Sbjct 262 KLPGLKLVVFDIYQPLLNLITKPTDNGFFESRKACCGTGTIETSLLCNARSVGTCSNASQ 321 Query 265 YVFWDGFHPSESANEKLAQTLLEQGFDLI 179 YVFWDGFHPSESAN+ LA +LLEQG +LI Sbjct 322 YVFWDGFHPSESANQLLAGSLLEQGINLI 350 >ref|XP_002527431.1| zinc finger protein, putative [Ricinus communis] gb|EEF34923.1| zinc finger protein, putative [Ricinus communis] Length=359 Score = 222 bits (565), Expect = 8e-68 Identities = 105/149 (70%), Positives = 123/149 (83%), Gaps = 0/149 (0%) Frame = -2 Query 625 KSLNSLGVNRIGVTSLPPLGCLPAAITLFGLGSNNCVSRLNDDAVMFNNKLNATSQSLKT 446 K+L +LG +IGVT+LPPLGCLPAAIT+FG SN+CV+ LN D+V FNNKLNATSQSL+ Sbjct 210 KNLYNLGARKIGVTTLPPLGCLPAAITIFGSDSNDCVANLNQDSVSFNNKLNATSQSLRN 269 Query 445 KLHSLKLVVFDIYHPLLDLVTKPSDSGFSESRRGCCGTGTLETSYLCNVRSIGTCSNATQ 266 KL LKLVVFDIY PL D+VTKPSD+GF E+RR CCGTG LE+S LCN +SIGTC NA++ Sbjct 270 KLSGLKLVVFDIYQPLYDIVTKPSDNGFVEARRACCGTGLLESSILCNSKSIGTCKNASE 329 Query 265 YVFWDGFHPSESANEKLAQTLLEQGFDLI 179 YVFWDGFHPSE+AN+ LA LL G LI Sbjct 330 YVFWDGFHPSEAANKILADDLLTSGISLI 358 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 323187050 Number of extensions: 6813078 Number of successful extensions: 18085 Number of sequences better than 1e-10: 65 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 17933 Number of HSP's successfully gapped: 67 Length of query: 626 Length of database: 6150218869 Length adjustment: 134 Effective length of query: 492 Effective length of database: 3749061613 Effective search space: 277430559362 Effective search space used: 277430559362 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE7PCVP012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_5779 Length=617 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002520279.1| ATP binding protein, putative [Ricinus co... 71.6 2e-11 ALIGNMENTS >ref|XP_002520279.1| ATP binding protein, putative [Ricinus communis] gb|EEF42065.1| ATP binding protein, putative [Ricinus communis] Length=1433 Score = 71.6 bits (174), Expect = 2e-11 Identities = 32/58 (55%), Positives = 41/58 (71%), Gaps = 0/58 (0%) Frame = -1 Query 605 VLREGSLEQIKGMGELVRRCLRLQSEERPTMNEVAMELEGFRKISRHPWAVQEEVDEE 432 +L EG++EQIK + L +RCLR++ EERPTM EVAMELEG R + +HPW E E Sbjct 653 ILNEGNIEQIKEVSSLAKRCLRVKGEERPTMKEVAMELEGLRLMVKHPWVNNESNSSE 710 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 244073953 Number of extensions: 4381041 Number of successful extensions: 11480 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 11478 Number of HSP's successfully gapped: 0 Length of query: 617 Length of database: 6150218869 Length adjustment: 134 Effective length of query: 483 Effective length of database: 3749061613 Effective search space: 266183374523 Effective search space used: 266183374523 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE84NHW016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_5876 Length=613 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002281329.1| PREDICTED: uncharacterized protein LOC100... 119 2e-30 emb|CAN66257.1| hypothetical protein VITISV_001236 [Vitis vin... 119 2e-30 ref|NP_200876.1| uncharacterized protein [Arabidopsis thalian... 118 5e-30 ref|XP_003597242.1| hypothetical protein MTR_2g094410 [Medica... 117 6e-30 ref|XP_003543482.1| PREDICTED: uncharacterized protein LOC100... 114 1e-28 ALIGNMENTS >ref|XP_002281329.1| PREDICTED: uncharacterized protein LOC100258555 isoform 1 [Vitis vinifera] ref|XP_003632611.1| PREDICTED: uncharacterized protein LOC100258555 isoform 2 [Vitis vinifera] Length=163 Score = 119 bits (298), Expect = 2e-30 Identities = 61/85 (72%), Positives = 66/85 (78%), Gaps = 1/85 (1%) Frame = -3 Query 611 LPVNVPDWSKILRGEYTPRACEWGDGYESDDWGDDGND-KIPPHEYLARTRVASLSVHEG 435 LPVN+PDWSKILR +Y + D D DD +D +IPPHEYLARTRVAS SVHEG Sbjct 79 LPVNIPDWSKILRDDYRLSQRKESDEDVDDVEEDDDHDSRIPPHEYLARTRVASFSVHEG 138 Query 434 IGRTLKGRDLSRVRNAIWKKTGFED 360 IGRTLKGRDLSRVRNAIWKK GFED Sbjct 139 IGRTLKGRDLSRVRNAIWKKVGFED 163 >emb|CAN66257.1| hypothetical protein VITISV_001236 [Vitis vinifera] Length=163 Score = 119 bits (298), Expect = 2e-30 Identities = 61/85 (72%), Positives = 66/85 (78%), Gaps = 1/85 (1%) Frame = -3 Query 611 LPVNVPDWSKILRGEYTPRACEWGDGYESDDWGDDGND-KIPPHEYLARTRVASLSVHEG 435 LPVN+PDWSKILR +Y + D D DD +D +IPPHEYLARTRVAS SVHEG Sbjct 79 LPVNIPDWSKILRDDYRLSQRKESDEDVDDVEEDDDHDSRIPPHEYLARTRVASFSVHEG 138 Query 434 IGRTLKGRDLSRVRNAIWKKTGFED 360 IGRTLKGRDLSRVRNAIWKK GFED Sbjct 139 IGRTLKGRDLSRVRNAIWKKVGFED 163 >ref|NP_200876.1| uncharacterized protein [Arabidopsis thaliana] dbj|BAB09841.1| unnamed protein product [Arabidopsis thaliana] gb|AAL36227.1| unknown protein [Arabidopsis thaliana] gb|AAM14170.1| unknown protein [Arabidopsis thaliana] gb|AED97366.1| uncharacterized protein [Arabidopsis thaliana] Length=163 Score = 118 bits (295), Expect = 5e-30 Identities = 58/85 (68%), Positives = 68/85 (80%), Gaps = 1/85 (1%) Frame = -3 Query 611 LPVNVPDWSKILRGEYTP-RACEWGDGYESDDWGDDGNDKIPPHEYLARTRVASLSVHEG 435 LPVNVPDWSKILRGEY R D + DD +DG D +PPHE+LA+TR+AS SVHEG Sbjct 79 LPVNVPDWSKILRGEYRDNRRRSIEDNDDDDDDNEDGGDWLPPHEFLAKTRMASFSVHEG 138 Query 434 IGRTLKGRDLSRVRNAIWKKTGFED 360 +GRTLKGRDLSRVRNAI++K GF+D Sbjct 139 VGRTLKGRDLSRVRNAIFEKFGFQD 163 >ref|XP_003597242.1| hypothetical protein MTR_2g094410 [Medicago truncatula] gb|AES67493.1| hypothetical protein MTR_2g094410 [Medicago truncatula] Length=156 Score = 117 bits (294), Expect = 6e-30 Identities = 58/84 (69%), Positives = 66/84 (79%), Gaps = 2/84 (2%) Frame = -3 Query 611 LPVNVPDWSKILRGEYTPRACEWGDGYESDDWGDDGNDKIPPHEYLARTRVASLSVHEGI 432 LPVNVPDWSKIL +Y D + +D GDD +KIPPHE+LARTR+AS SVHEG+ Sbjct 75 LPVNVPDWSKILGEDYRHNRRRNYDDVDEEDEGDD--EKIPPHEFLARTRMASFSVHEGV 132 Query 431 GRTLKGRDLSRVRNAIWKKTGFED 360 GRTLKGRDLSRVRNAIW KTGF+D Sbjct 133 GRTLKGRDLSRVRNAIWAKTGFQD 156 >ref|XP_003543482.1| PREDICTED: uncharacterized protein LOC100814909 [Glycine max] Length=154 Score = 114 bits (285), Expect = 1e-28 Identities = 56/85 (66%), Positives = 66/85 (78%), Gaps = 1/85 (1%) Frame = -3 Query 611 LPVNVPDWSKILRGEY-TPRACEWGDGYESDDWGDDGNDKIPPHEYLARTRVASLSVHEG 435 LPVNVPDWSKIL EY + + D E+ +DG ++PPHE+LARTR+AS SVHEG Sbjct 70 LPVNVPDWSKILGDEYGRNQRRNYDDDDEARSDEEDGVGRVPPHEFLARTRIASFSVHEG 129 Query 434 IGRTLKGRDLSRVRNAIWKKTGFED 360 +GRTLKGRDLSRVRNAIW KTGF+D Sbjct 130 VGRTLKGRDLSRVRNAIWAKTGFQD 154 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 246135601 Number of extensions: 4540019 Number of successful extensions: 9641 Number of sequences better than 1e-10: 3 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 9632 Number of HSP's successfully gapped: 3 Length of query: 613 Length of database: 6150218869 Length adjustment: 134 Effective length of query: 479 Effective length of database: 3749061613 Effective search space: 262434312910 Effective search space used: 262434312910 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE8KNMW01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_5883 Length=613 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002304535.1| predicted protein [Populus trichocarpa] >... 192 1e-58 ref|NP_176598.1| Disease resistance-responsive (dirigent-like... 172 1e-50 ref|XP_002887917.1| disease resistance-responsive family prot... 170 5e-50 ref|XP_002276477.1| PREDICTED: disease resistance response pr... 167 6e-49 gb|ACC91257.1| disease resistance-responsive family protein [... 167 9e-49 ALIGNMENTS >ref|XP_002304535.1| predicted protein [Populus trichocarpa] gb|EEE79514.1| predicted protein [Populus trichocarpa] Length=162 Score = 192 bits (487), Expect = 1e-58 Identities = 103/147 (70%), Positives = 119/147 (81%), Gaps = 3/147 (2%) Frame = -3 Query 608 VLYLHDILFNGTDQanatsaattnsTRLALNPSGHFGAIVVFNDPVTKDNRLQSEPVAQA 429 VLY HD LFNGTD ANATSAA T+ T+L + FG +VVF+DP+TKDN L S PVA+A Sbjct 19 VLYYHDTLFNGTDVANATSAAATDPTKLG---NFKFGMLVVFDDPMTKDNHLLSRPVARA 75 Query 428 QGFYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFFMARG 249 QGFY Y+KK AWFAFTL+FNST++KGTLNIMGAD+M +TRDFSVVGGTGDFFMARG Sbjct 76 QGFYFYDKKSTYTAWFAFTLIFNSTKHKGTLNIMGADLMTEETRDFSVVGGTGDFFMARG 135 Query 248 ICTVSTDVLQGGFYFRLILDIKLYECY 168 I T+ TD QG +YFRL +DIKLYECY Sbjct 136 IATIHTDTFQGDYYFRLKMDIKLYECY 162 >ref|NP_176598.1| Disease resistance-responsive (dirigent-like protein) family protein [Arabidopsis thaliana] gb|AAF24580.1|AC007764_22 F22C12.8 [Arabidopsis thaliana] gb|AAQ65109.1| At1g64160 [Arabidopsis thaliana] dbj|BAD43018.1| putative dirigent protein [Arabidopsis thaliana] dbj|BAD44501.1| putative dirigent protein [Arabidopsis thaliana] gb|AEE34203.1| Disease resistance-responsive (dirigent-like protein) family protein [Arabidopsis thaliana] Length=182 Score = 172 bits (435), Expect = 1e-50 Identities = 86/151 (57%), Positives = 106/151 (70%), Gaps = 11/151 (7%) Frame = -3 Query 608 VLYLHDILFNGTDQanatsaattnsTRLALNPSG----HFGAIVVFNDPVTKDNRLQSEP 441 VLY HDI+F D ++ NP G FG +V+F+DP+T D QSEP Sbjct 39 VLYYHDIMFGVDD-------VQNATSAAVTNPPGLGNFKFGKLVIFDDPMTIDKNFQSEP 91 Query 440 VAQAQGFYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFF 261 VA+AQGFY Y+ K + NAWFA+TLVFNST++KGTLNIMGAD+M ++RD SVVGGTGDFF Sbjct 92 VARAQGFYFYDMKNDYNAWFAYTLVFNSTQHKGTLNIMGADLMMVQSRDLSVVGGTGDFF 151 Query 260 MARGICTVSTDVLQGGFYFRLILDIKLYECY 168 M+RGI T TD +G YFR+ +DIKLYECY Sbjct 152 MSRGIVTFETDTFEGAKYFRVKMDIKLYECY 182 >ref|XP_002887917.1| disease resistance-responsive family protein [Arabidopsis lyrata subsp. lyrata] gb|EFH64176.1| disease resistance-responsive family protein [Arabidopsis lyrata subsp. lyrata] Length=182 Score = 170 bits (431), Expect = 5e-50 Identities = 85/151 (56%), Positives = 106/151 (70%), Gaps = 11/151 (7%) Frame = -3 Query 608 VLYLHDILFNGTDQanatsaattnsTRLALNPSG----HFGAIVVFNDPVTKDNRLQSEP 441 VLY HDI+F D ++ NP G FG +V+F+DP+T D QSEP Sbjct 39 VLYYHDIMFGVDD-------VQNATSAAITNPPGLGNFKFGKLVIFDDPMTIDKNFQSEP 91 Query 440 VAQAQGFYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFF 261 VA+AQGFY Y+ K + NAWFA+TLVFNST++KGTLNIMGAD+M ++RD SVVGGTGDFF Sbjct 92 VARAQGFYFYDMKNDYNAWFAYTLVFNSTQHKGTLNIMGADLMMVQSRDLSVVGGTGDFF 151 Query 260 MARGICTVSTDVLQGGFYFRLILDIKLYECY 168 M+RGI T TD +G YFR+ +DIKLY+CY Sbjct 152 MSRGIVTFETDTFEGAKYFRVKMDIKLYDCY 182 >ref|XP_002276477.1| PREDICTED: disease resistance response protein 206 [Vitis vinifera] Length=186 Score = 167 bits (424), Expect = 6e-49 Identities = 85/150 (57%), Positives = 106/150 (71%), Gaps = 3/150 (2%) Frame = -3 Query 608 VLYLHDILFNGTDQanatsaattnsT---RLALNPSGHFGAIVVFNDPVTKDNRLQSEPV 438 V Y HDI++NG + NAT+A + L HFG +VVF+DP+T DN L S PV Sbjct 37 VFYFHDIIYNGKNSKNATAAIVGAPAWGNKTILGGKNHFGDLVVFDDPITLDNNLHSTPV 96 Query 437 AQAQGFYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFFM 258 +AQGFYIY+KK+ AW F+ VFNSTE+KG++N GAD + KTRD SVVGGTGDFFM Sbjct 97 GRAQGFYIYDKKDVFTAWLGFSFVFNSTEHKGSINFAGADPLMNKTRDISVVGGTGDFFM 156 Query 257 ARGICTVSTDVLQGGFYFRLILDIKLYECY 168 ARGI T++TD +G YFRL +DIKLYEC+ Sbjct 157 ARGIATLTTDAFEGEVYFRLCVDIKLYECW 186 >gb|ACC91257.1| disease resistance-responsive family protein [Capsella rubella] Length=187 Score = 167 bits (423), Expect = 9e-49 Identities = 90/145 (62%), Positives = 108/145 (74%), Gaps = 3/145 (2%) Frame = -3 Query 602 YLHDILFNGTDQanatsaattnsTRLALNPSGHFGAIVVFNDPVTKDNRLQSEPVAQAQG 423 Y HDIL++G + ANATSAA + L + FG V+F+DP+T D SEPVA+AQG Sbjct 46 YFHDILYDGDNVANATSAAIVSPPGLG---NFKFGKFVIFDDPITMDKNYLSEPVARAQG 102 Query 422 FYIYNKKEEANAWFAFTLVFNSTEYKGTLNIMGADIMNAKTRDFSVVGGTGDFFMARGIC 243 FY Y+ K + NAWF +TLVFNST++KGTLNIMGAD+M TRD SVVGGTGDFFMARGI Sbjct 103 FYFYDMKMDFNAWFCYTLVFNSTQHKGTLNIMGADLMMEPTRDLSVVGGTGDFFMARGIA 162 Query 242 TVSTDVLQGGFYFRLILDIKLYECY 168 T TD+ QG YFR+ +DIKLYECY Sbjct 163 TFVTDLFQGAKYFRVKMDIKLYECY 187 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 364805134 Number of extensions: 8583160 Number of successful extensions: 22093 Number of sequences better than 1e-10: 27 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 22019 Number of HSP's successfully gapped: 27 Length of query: 613 Length of database: 6150218869 Length adjustment: 134 Effective length of query: 479 Effective length of database: 3749061613 Effective search space: 262434312910 Effective search space used: 262434312910 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPDWXTR301N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_6 Length=2060 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precurs... 258 6e-75 emb|CBI34137.3| unnamed protein product [Vitis vinifera] 246 3e-74 ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesi... 256 4e-74 ref|XP_002300215.1| predicted protein [Populus trichocarpa] >... 254 1e-73 ref|XP_002307860.1| ZIP transporter [Populus trichocarpa] >gb... 246 8e-72 ALIGNMENTS >ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length=442 Score = 258 bits (659), Expect = 6e-75 Identities = 172/413 (42%), Positives = 225/413 (54%), Gaps = 36/413 (9%) Frame = +1 Query 100 GFMATLSHVDQGKSLTKTELLRHAIQRRTMRVEMPQAVMDQLASPLPIKAPVSGYNGEFS 279 GF TL HVD K+LTK + ++H I+R R+E A++ +S I +PV NGEF Sbjct 42 GFRITLKHVDSDKNLTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLSGNGEFL 101 Query 280 MTLAIGTPPINHSLIVDTDTDLIW-----INQC-DNHSP--------SNTKLSCSSNLCK 417 M LAIGTPP +S I+DT +DLIW QC D SP S +KLSCSS LCK Sbjct 102 MNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCK 161 Query 418 NAGTCSKKNTCQYKYQYTSNKTVQVDLATETFTFGK-NVLKLGFSCGVSLEG---VVAKG 585 S ++C+Y Y Y + Q +ATETFTFGK ++ +GF CG EG G Sbjct 162 ALPQSSCSDSCEYLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSG 221 Query 586 AVGLGHGPLSLVSQMNAQNFSMCLPSYDNPNNTGFVLIGSTIK---TRAKMITASLIKTN 756 VGLG GPLSLVSQ+ FS CL S D+ T +L+GS T A + T LI+ Sbjct 222 LVGLGRGPLSLVSQLKEAKFSYCLTSIDD-TKTSTLLMGSLASVNGTSAAIRTTPLIQNP 280 Query 757 FFPSKYYINVTGISVGKVRLNIDSSSFSINANDGTGGMAIDTGSSVTYFEQSIFDKVIKA 936 PS YY+++ GISVG RL I S+F + +DGTGG+ ID+G+++TY E+S FD V K Sbjct 281 LQPSFYYLSLEGISVGGTRLPIKESTFQLQ-DDGTGGLIIDSGTTITYLEESAFDLVKKE 339 Query 937 FKTQTRHPTSIDPATSLPICFPTPPN-SKNAYPYITFHLGGPNGPSKLVLSIGNTYREYD 1113 F +Q P AT L +C+ P + S+ P + H G + L L G Y D Sbjct 340 FTSQMGLPVDNSGATGLELCYNLPSDTSELEVPKLVLHFTG----ADLELP-GENYMIAD 394 Query 1114 NYVPGKNKTCLAFKAAPPSDRGMSTLGNIQLQDTLVTFDLSKNTVAFLQKKCG 1272 + CLA S GMS GN+Q Q+ V+ DL K T++FL CG Sbjct 395 S---SMGVICLAM----GSSGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNCG 440 >emb|CBI34137.3| unnamed protein product [Vitis vinifera] Length=165 Score = 246 bits (629), Expect = 3e-74 Identities = 119/158 (75%), Positives = 140/158 (89%), Gaps = 0/158 (0%) Frame = -3 Query 1974 GETELLRHRIISQVLELGIIVHSVIIGIALGASQNPKTIRPLIAALTFHQFFEGIGLGGC 1795 G EL+RHR+ISQVLELGI+VHSVIIGI+LGAS++PKTI+PL+AALTFHQFFEG+GLGGC Sbjct 2 GSAELIRHRVISQVLELGIVVHSVIIGISLGASESPKTIKPLVAALTFHQFFEGMGLGGC 61 Query 1794 IAQARFNTRAVVMMAFFFSITTPSGIAIGIGISRFYSETSRNALIIEGVFNSASAGILIY 1615 I QA+F RA +MA FFS+TTP GIAIGIGIS Y E S ALI+EG+FN+ASAGIL+Y Sbjct 62 IVQAKFKLRAAAIMALFFSLTTPVGIAIGIGISNVYDENSSTALIVEGIFNAASAGILVY 121 Query 1614 MALVDLLAADFMSPKMQNNGKLQLLANASLLIGAGCMS 1501 MALVDLLAADFM+P+MQ NG+LQ+ AN SLL+GAGCMS Sbjct 122 MALVDLLAADFMNPRMQGNGRLQVGANISLLVGAGCMS 159 >ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max] Length=453 Score = 256 bits (654), Expect = 4e-74 Identities = 162/423 (38%), Positives = 228/423 (54%), Gaps = 48/423 (11%) Frame = +1 Query 94 TTGFMATLSHVDQGKSLTKTELLRHAIQRRTMRVEMPQAVM---DQLASPLPIKAPVSGY 264 T GF L HVD GK+LTK E ++H I+R R++ A++ L S ++AP+ Sbjct 45 TKGFRVMLRHVDSGKNLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAG 104 Query 265 NGEFSMTLAIGTPPINHSLIVDTDTDLIWIN-----QC---------DNHSPSNTKLSCS 402 NGE+ M LAIGTPP+++ ++DT +DLIW QC S S +K+SC Sbjct 105 NGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCG 164 Query 403 SNLCKNAGTCSKKNTCQYKYQYTSNKTVQVDLATETFTFGKN-----VLKLGFSCGVSLE 567 S+LC + + + C+Y Y Y Q LATETFTFGK+ V +GF CG E Sbjct 165 SSLCSAVPSSTCSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNE 224 Query 568 G---VVAKGAVGLGHGPLSLVSQMNAQNFSMCLPSYDNPNNTGFVLIGST--IKTRAKMI 732 G A G VGLG GPLSLVSQ+ FS CL D+ + +L+GS +K +++ Sbjct 225 GDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCLTPMDDTKES-ILLLGSLGKVKDAKEVV 283 Query 733 TASLIKTNFFPSKYYINVTGISVGKVRLNIDSSSFSINANDGTGGMAIDTGSSVTYFEQS 912 T L+K PS YY+++ GISVG RL+I+ S+F + +DG GG+ ID+G+++TY EQ Sbjct 284 TTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVG-DDGNGGVIIDSGTTITYIEQK 342 Query 913 IFDKVIKAFKTQTRHPTSIDPATSLPICFPTPPNSKNA-YPYITFHLGGPNGPSKLVLSI 1089 F+ + K F +QT+ P +T L +CF P S P I FH G Sbjct 343 AFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKG----------- 391 Query 1090 GNTYREYDNYVPGKNK---TCLAFKAAPPSDRGMSTLGNIQLQDTLVTFDLSKNTVAFLQ 1260 G+ +NY+ G + CLA A+ GMS GN+Q Q+ LV DL K T++F+ Sbjct 392 GDLELPAENYMIGDSNLGVACLAMGAS----SGMSIFGNVQQQNILVNHDLEKETISFVP 447 Query 1261 KKC 1269 C Sbjct 448 TSC 450 >ref|XP_002300215.1| predicted protein [Populus trichocarpa] gb|EEE85020.1| predicted protein [Populus trichocarpa] Length=439 Score = 254 bits (650), Expect = 1e-73 Identities = 166/412 (40%), Positives = 220/412 (53%), Gaps = 36/412 (9%) Frame = +1 Query 100 GFMATLSHVDQGKSLTKTELLRHAIQRRTMRVEMPQAVMDQLASPLPIKAPVSGYNGEFS 279 GF L HVD GK+LTK E +RH ++R R++ QA+ +S I+APV NGEF Sbjct 39 GFRVRLKHVDSGKNLTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLPGNGEFL 98 Query 280 MTLAIGTPPINHSLIVDTDTDLIW-----INQC---------DNHSPSNTKLSCSSNLCK 417 M LAIGTPP +S I+DT +DLIW QC S S +KLSCSS LC+ Sbjct 99 MKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCE 158 Query 418 NAGTCSKKNTCQYKYQYTSNKTVQVDLATETFTFGK-NVLKLGFSCGVSLEG---VVAKG 585 S N C+Y Y Y + Q LA+ET TFGK +V + F CG EG G Sbjct 159 ALPQSSCNNGCEYLYSYGDYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAG 218 Query 586 AVGLGHGPLSLVSQMNAQNFSMCLPSYDNPNNTGFVLIGSTIKTRAK---MITASLIKTN 756 VGLG GPLSLVSQ+ FS CL + D+ T +L+GS A + T LI + Sbjct 219 LVGLGRGPLSLVSQLKEPKFSYCLTTVDD-TKTSTLLMGSLASVNASSSAIKTTPLIHSP 277 Query 757 FFPSKYYINVTGISVGKVRLNIDSSSFSINANDGTGGMAIDTGSSVTYFEQSIFDKVIKA 936 PS YY+++ GISVG RL I S+FS+ +DG+GG+ ID+G+++TY E+S F+ V K Sbjct 278 AHPSFYYLSLEGISVGDTRLPIKKSTFSLQ-DDGSGGLIIDSGTTITYLEESAFNLVAKE 336 Query 937 FKTQTRHPTSIDPATSLPICFPTPPNSKN-AYPYITFHLGGPNGPSKLVLSIGNTYREYD 1113 F + P +T L +CF P S N P + FH G + + Sbjct 337 FTAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDG-----------ADLELPAE 385 Query 1114 NYVPGKNKTCLAFKAAPPSDRGMSTLGNIQLQDTLVTFDLSKNTVAFLQKKC 1269 NY+ G + +A A S GMS GN+Q Q+ LV DL K T++FL +C Sbjct 386 NYMIGDSSMGVAC-LAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436 >ref|XP_002307860.1| ZIP transporter [Populus trichocarpa] gb|EEE91383.1| ZIP transporter [Populus trichocarpa] Length=343 Score = 246 bits (629), Expect = 8e-72 Identities = 118/160 (74%), Positives = 144/160 (90%), Gaps = 0/160 (0%) Frame = -3 Query 1980 GSGETELLRHRIISQVLELGIIVHSVIIGIALGASQNPKTIRPLIAALTFHQFFEGIGLG 1801 GSG ++L+RHR+I+QVLELGI+VHSVIIG++LGAS +PKTIRPL+AAL+FHQFFEG+GLG Sbjct 178 GSGPSQLIRHRVITQVLELGIVVHSVIIGVSLGASGSPKTIRPLVAALSFHQFFEGMGLG 237 Query 1800 GCIAQARFNTRAVVMMAFFFSITTPSGIAIGIGISRFYSETSRNALIIEGVFNSASAGIL 1621 GCI QA+F T+ +V+MA FFS+TTP GIAIG+GIS Y+E+S NALI+EG+FN+ASAGIL Sbjct 238 GCITQAKFKTKTIVIMALFFSLTTPVGIAIGLGISNVYNESSPNALIVEGIFNAASAGIL 297 Query 1620 IYMALVDLLAADFMSPKMQNNGKLQLLANASLLIGAGCMS 1501 IYMALVDLLAADFM PK+Q+NG LQ N SLL+GAGCMS Sbjct 298 IYMALVDLLAADFMHPKVQSNGALQFGVNVSLLLGAGCMS 337 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 1137216252 Number of extensions: 25999694 Number of successful extensions: 64215 Number of sequences better than 1e-10: 85 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 63937 Number of HSP's successfully gapped: 85 Length of query: 2060 Length of database: 6150218869 Length adjustment: 148 Effective length of query: 1912 Effective length of database: 3498194437 Effective search space: 1882028607106 Effective search space used: 1882028607106 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 181 (74.3 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE8X1XU013 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_6242 Length=601 Score E Sequences producing significant alignments: (Bits) Value gb|ACG60665.1| basic helix-loop-helix protein [Nicotiana taba... 166 8e-48 ref|XP_002518568.1| DNA binding protein, putative [Ricinus co... 163 1e-46 emb|CBI28976.3| unnamed protein product [Vitis vinifera] 162 2e-46 ref|XP_002274829.2| PREDICTED: transcription factor ILR3-like... 162 3e-46 ref|XP_002316706.1| predicted protein [Populus trichocarpa] >... 158 1e-44 ALIGNMENTS >gb|ACG60665.1| basic helix-loop-helix protein [Nicotiana tabacum] Length=233 Score = 166 bits (420), Expect = 8e-48 Identities = 82/134 (61%), Positives = 99/134 (74%), Gaps = 0/134 (0%) Frame = -3 Query 599 LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM 420 LLE GRP K DK+ +L D +RMV QLR EAQKLK+SN +LQ K+ ELK EKNELRDEK Sbjct 100 LLEPGRPPKTDKSAILVDAVRMVTQLRGEAQKLKDSNLNLQEKIKELKAEKNELRDEKQK 159 Query 419 LKAKKEKLEHQVKELNARPQISPHHLPPMHSPYAHVVGSKFVPVIGFSGVPMWQMMPPTS 240 LKA+KEKLE Q+K NA+P P +P +P+ V GSK VP++ + GV MWQ MPP + Sbjct 160 LKAEKEKLEQQLKTTNAQPGFLPPAIPAAFAPHGQVPGSKLVPIMSYPGVAMWQFMPPAA 219 Query 239 VDTSKDHTLRPPVA 198 VDTS+DH LRPPVA Sbjct 220 VDTSQDHVLRPPVA 233 >ref|XP_002518568.1| DNA binding protein, putative [Ricinus communis] gb|EEF43955.1| DNA binding protein, putative [Ricinus communis] Length=229 Score = 163 bits (412), Expect = 1e-46 Identities = 82/137 (60%), Positives = 106/137 (77%), Gaps = 4/137 (3%) Frame = -3 Query 599 LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM 420 LL+ GRP K+DK+V+L+D ++MV QLR EAQKLKESN++LQ KVNELKVEKNELRDEK Sbjct 94 LLDPGRPPKMDKSVILADAMKMVNQLRAEAQKLKESNENLQEKVNELKVEKNELRDEKQR 153 Query 419 LKAKKEKLEHQVKELNARPQISPHHLPPMHSPY---AHVVGSKFVPVIGFSGVPMWQMMP 249 LK +KE +E QV L+A + P HLP + +P+ + V+GSK VP++G+ GVPMWQ+MP Sbjct 154 LKTEKESIERQVNALSASARFLP-HLPAIPAPFSSPSQVIGSKLVPIVGYPGVPMWQLMP 212 Query 248 PTSVDTSKDHTLRPPVA 198 P +VDTS+D LR P A Sbjct 213 PATVDTSQDPVLRSPAA 229 >emb|CBI28976.3| unnamed protein product [Vitis vinifera] Length=223 Score = 162 bits (410), Expect = 2e-46 Identities = 82/136 (60%), Positives = 97/136 (71%), Gaps = 2/136 (1%) Frame = -3 Query 599 LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM 420 +LE GRP K DK +LSD +RMV QLR EAQKLKESN DLQ K+ ELK EKNELRDEK Sbjct 88 ILEPGRPPKTDKAAILSDAVRMVTQLRSEAQKLKESNGDLQEKIKELKAEKNELRDEKQR 147 Query 419 LKAKKEKLEHQVKELNARPQISPH--HLPPMHSPYAHVVGSKFVPVIGFSGVPMWQMMPP 246 LKA+KEKLE QVK ++A+P PH +P + G+K +P IG+ V MWQ MPP Sbjct 148 LKAEKEKLEQQVKAISAQPGFLPHPSAMPAAFAAQGRAPGNKLMPFIGYPSVAMWQFMPP 207 Query 245 TSVDTSKDHTLRPPVA 198 +VDTS+DH LRPPVA Sbjct 208 AAVDTSQDHVLRPPVA 223 >ref|XP_002274829.2| PREDICTED: transcription factor ILR3-like [Vitis vinifera] Length=240 Score = 162 bits (410), Expect = 3e-46 Identities = 82/136 (60%), Positives = 97/136 (71%), Gaps = 2/136 (1%) Frame = -3 Query 599 LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM 420 +LE GRP K DK +LSD +RMV QLR EAQKLKESN DLQ K+ ELK EKNELRDEK Sbjct 105 ILEPGRPPKTDKAAILSDAVRMVTQLRSEAQKLKESNGDLQEKIKELKAEKNELRDEKQR 164 Query 419 LKAKKEKLEHQVKELNARPQISPH--HLPPMHSPYAHVVGSKFVPVIGFSGVPMWQMMPP 246 LKA+KEKLE QVK ++A+P PH +P + G+K +P IG+ V MWQ MPP Sbjct 165 LKAEKEKLEQQVKAISAQPGFLPHPSAMPAAFAAQGRAPGNKLMPFIGYPSVAMWQFMPP 224 Query 245 TSVDTSKDHTLRPPVA 198 +VDTS+DH LRPPVA Sbjct 225 AAVDTSQDHVLRPPVA 240 >ref|XP_002316706.1| predicted protein [Populus trichocarpa] gb|EEE97318.1| predicted protein [Populus trichocarpa] Length=243 Score = 158 bits (400), Expect = 1e-44 Identities = 80/136 (59%), Positives = 96/136 (71%), Gaps = 2/136 (1%) Frame = -3 Query 599 LLEHGRPIKIDKNVVLSDTIRMVIQLREEAQKLKESNDDLQGKVNELKVEKNELRDEKTM 420 LL+ GRP K+DK+ +L D RMV QLR+E+QKLKESN LQ K++ELK EKNELRDEK Sbjct 108 LLDPGRPPKVDKSAILVDAARMVTQLRDESQKLKESNVSLQEKIDELKAEKNELRDEKQR 167 Query 419 LKAKKEKLEHQVKELNARPQISPH--HLPPMHSPYAHVVGSKFVPVIGFSGVPMWQMMPP 246 LK +KE LE QVK L+ P PH +P S VVGSK +P +G+ G+ MWQ MPP Sbjct 168 LKTEKENLERQVKALSTPPNFLPHPSAIPAPFSAPGQVVGSKLMPFVGYPGISMWQFMPP 227 Query 245 TSVDTSKDHTLRPPVA 198 VDTS+DH LRPPVA Sbjct 228 AVVDTSQDHVLRPPVA 243 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 249484877 Number of extensions: 4467113 Number of successful extensions: 21439 Number of sequences better than 1e-10: 5 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 21287 Number of HSP's successfully gapped: 5 Length of query: 601 Length of database: 6150218869 Length adjustment: 133 Effective length of query: 468 Effective length of database: 3766980697 Effective search space: 252387706699 Effective search space used: 252387706699 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE8ZT8Z012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_6381 Length=596 No significant similarity found. For reasons why, click here.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 279756301 Number of extensions: 5600591 Number of successful extensions: 12703 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 12576 Number of HSP's successfully gapped: 0 Length of query: 596 Length of database: 6150218869 Length adjustment: 133 Effective length of query: 463 Effective length of database: 3766980697 Effective search space: 244853745305 Effective search space used: 244853745305 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE9BP3001N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_6390 Length=596 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003627280.1| Germin-like protein [Medicago truncatula]... 231 2e-73 ref|XP_003627279.1| Germin-like protein [Medicago truncatula]... 219 8e-69 ref|XP_003627277.1| Germin-like protein [Medicago truncatula]... 219 8e-69 emb|CAI56441.1| germin-like protein [Cicer arietinum] 204 3e-63 ref|XP_003528531.1| PREDICTED: auxin-binding protein ABP19a-l... 203 1e-62 ALIGNMENTS >ref|XP_003627280.1| Germin-like protein [Medicago truncatula] gb|AET01756.1| Germin-like protein [Medicago truncatula] Length=209 Score = 231 bits (589), Expect = 2e-73 Identities = 123/124 (99%), Positives = 124/124 (100%), Gaps = 0/124 (0%) Frame = +1 Query 1 ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM 180 ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM Sbjct 86 ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM 145 Query 181 LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF 360 LHFQVNSGKGKATAFLTFSSANPGAQLLDLLLFSNNLPSELVAQTTFLDLAQV+KLKARF Sbjct 146 LHFQVNSGKGKATAFLTFSSANPGAQLLDLLLFSNNLPSELVAQTTFLDLAQVKKLKARF 205 Query 361 GGRG 372 GGRG Sbjct 206 GGRG 209 >ref|XP_003627279.1| Germin-like protein [Medicago truncatula] gb|AET01755.1| Germin-like protein [Medicago truncatula] Length=209 Score = 219 bits (558), Expect = 8e-69 Identities = 116/124 (94%), Positives = 121/124 (98%), Gaps = 0/124 (0%) Frame = +1 Query 1 ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM 180 ISAARLDIA+ GSIPMHTHPGATELLI+V GEITAGFLT T+VYSKTLKPGDLMVFPQGM Sbjct 86 ISAARLDIAENGSIPMHTHPGATELLIIVQGEITAGFLTPTSVYSKTLKPGDLMVFPQGM 145 Query 181 LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF 360 LHFQ+N+GKGKATAFLTFSSANPGAQLLDLLLFSNNLPSELVAQTTFLDLAQVQKLKARF Sbjct 146 LHFQINTGKGKATAFLTFSSANPGAQLLDLLLFSNNLPSELVAQTTFLDLAQVQKLKARF 205 Query 361 GGRG 372 GGRG Sbjct 206 GGRG 209 >ref|XP_003627277.1| Germin-like protein [Medicago truncatula] gb|AET01753.1| Germin-like protein [Medicago truncatula] Length=209 Score = 219 bits (558), Expect = 8e-69 Identities = 116/124 (94%), Positives = 121/124 (98%), Gaps = 0/124 (0%) Frame = +1 Query 1 ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM 180 ISAARLDIA+ GSIPMHTHPGATELLI+V GEITAGFLT TAVYSKTLKPGDLMVFPQGM Sbjct 86 ISAARLDIAENGSIPMHTHPGATELLIIVQGEITAGFLTPTAVYSKTLKPGDLMVFPQGM 145 Query 181 LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF 360 LHFQ+N+GKGKATAFLTFSSANPGAQLLDLLLFSNNLPS+LVAQTTFLDLAQVQKLKARF Sbjct 146 LHFQINTGKGKATAFLTFSSANPGAQLLDLLLFSNNLPSQLVAQTTFLDLAQVQKLKARF 205 Query 361 GGRG 372 GGRG Sbjct 206 GGRG 209 >emb|CAI56441.1| germin-like protein [Cicer arietinum] Length=185 Score = 204 bits (519), Expect = 3e-63 Identities = 108/124 (87%), Positives = 118/124 (95%), Gaps = 0/124 (0%) Frame = +1 Query 1 ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM 180 ISAARLDIA+GGSIPMHTHPGATELLIMV GEITAGF+TT+AVYSK LK GD+MVFPQGM Sbjct 62 ISAARLDIAEGGSIPMHTHPGATELLIMVKGEITAGFMTTSAVYSKKLKVGDVMVFPQGM 121 Query 181 LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF 360 LHFQVNSGKG+ATAFL+FSSANPGAQLLDLLLF+NNL S+ VAQTTFLD+AQV+KLK RF Sbjct 122 LHFQVNSGKGEATAFLSFSSANPGAQLLDLLLFANNLSSDFVAQTTFLDVAQVKKLKTRF 181 Query 361 GGRG 372 GGRG Sbjct 182 GGRG 185 >ref|XP_003528531.1| PREDICTED: auxin-binding protein ABP19a-like [Glycine max] Length=208 Score = 203 bits (517), Expect = 1e-62 Identities = 105/124 (85%), Positives = 115/124 (93%), Gaps = 0/124 (0%) Frame = +1 Query 1 ISAARLDIAKGGSIPMHTHPGATELLIMVHGEITAGFLTTTAVYSKTLKPGDLMVFPQGM 180 +S ARLDIAKGGSIPMHTHP ATELLIMV G+ITAGF+T TA+Y+KTLKPGD+MVFPQG Sbjct 85 VSVARLDIAKGGSIPMHTHPAATELLIMVEGQITAGFMTPTALYTKTLKPGDIMVFPQGQ 144 Query 181 LHFQVNSGKGKATAFLTFSSANPGAQlldlllFSNNLPSELVAQTTFLDLAQVQKLKARF 360 LHFQVNSG GKATAFL FSSANPGAQLLDLLLF N LPS+LVAQTTFLD+AQV+K+KARF Sbjct 145 LHFQVNSGNGKATAFLAFSSANPGAQLLDLLLFGNTLPSDLVAQTTFLDVAQVKKVKARF 204 Query 361 GGRG 372 GGRG Sbjct 205 GGRG 208 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 314077135 Number of extensions: 6648021 Number of successful extensions: 11798 Number of sequences better than 1e-10: 77 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 11710 Number of HSP's successfully gapped: 77 Length of query: 596 Length of database: 6150218869 Length adjustment: 133 Effective length of query: 463 Effective length of database: 3766980697 Effective search space: 244853745305 Effective search space used: 244853745305 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE0RJN301N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_646 Length=1000 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003519788.1| PREDICTED: uncharacterized protein LOC100... 270 5e-84 ref|XP_003517602.1| PREDICTED: uncharacterized protein LOC100... 270 3e-83 ref|NP_974121.1| uncharacterized protein [Arabidopsis thalian... 268 3e-83 ref|NP_177212.2| uncharacterized protein [Arabidopsis thalian... 268 1e-82 ref|XP_002893280.1| hypothetical protein ARALYDRAFT_472593 [A... 264 9e-82 ALIGNMENTS >ref|XP_003519788.1| PREDICTED: uncharacterized protein LOC100776135 [Glycine max] Length=417 Score = 270 bits (691), Expect = 5e-84 Identities = 143/278 (51%), Positives = 188/278 (68%), Gaps = 27/278 (10%) Frame = -2 Query 999 QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGV-VKKESDGYTHERAIAYMEG 823 Q+W MSGESCPE T+PIRR TE+D+LRASSV FGRK+ V+++++ HE A+ Y+ G Sbjct 119 QLWTMSGESCPEGTIPIRRTTEQDMLRASSVSRFGRKIRRRVRRDTNSNGHEHAVGYVSG 178 Query 822 KKYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ------------ 679 ++YYGA A+++VW P+V N++EFSLSQ+WV +GSF DLNTIEAGWQ Sbjct 179 EQYYGAKASINVWAPRVENQDEFSLSQMWVISGSFGD-DLNTIEAGWQVSPEIYGDRYPR 237 Query 678 ------RDAYQKTGCYNLNCQGFVQKSTTIVLGEAIR-TTQYGGQLKELPFDIYKDSISG 520 DAYQ TGCYNL C GFVQ + I +G AI T+ Y G ++ I+KD G Sbjct 238 FFTYWTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPTSSYAGGQFDISLLIWKDPKHG 297 Query 519 NWWLKVNN-VPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEG 343 NWWL+ + + VGYWPS LFT+L+ A +QFGGEI+N + SG+HT TQMGSGHF +EG Sbjct 298 NWWLEFGSGILVGYWPSFLFTHLR-DHASMVQFGGEIVNSRQSGSHTSTQMGSGHFASEG 356 Query 342 YGKAAYFKQIQVYSSTNSKNP-ANLLELATSYDNQKCY 232 +GKA+YF+ +QV N+ P +NL LA D+ CY Sbjct 357 FGKASYFRNMQVVDWDNNLVPLSNLRVLA---DHPNCY 391 >ref|XP_003517602.1| PREDICTED: uncharacterized protein LOC100776639 [Glycine max] Length=471 Score = 270 bits (690), Expect = 3e-83 Identities = 142/278 (51%), Positives = 188/278 (68%), Gaps = 27/278 (10%) Frame = -2 Query 999 QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGV-VKKESDGYTHERAIAYMEG 823 Q+W MSGESCPE T+PIRR TE+D+LRASSV FGRK+ V+++++ HE A+ Y+ G Sbjct 173 QLWTMSGESCPEGTIPIRRTTEQDMLRASSVSRFGRKIRRRVRRDTNSNGHEHAVGYVSG 232 Query 822 KKYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ------------ 679 ++YYGA A+++VW P+V N++EFSLSQ+WV +GSF DLNTIE+GWQ Sbjct 233 EQYYGAKASINVWAPRVANQDEFSLSQMWVISGSFGD-DLNTIESGWQVSPELYGDRYPR 291 Query 678 ------RDAYQKTGCYNLNCQGFVQKSTTIVLGEAIR-TTQYGGQLKELPFDIYKDSISG 520 DAYQ TGCYNL C GFVQ + I +G AI T+ Y G ++ I+KD G Sbjct 292 FFTYWTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPTSSYAGGQFDISLLIWKDPKHG 351 Query 519 NWWLKVNN-VPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEG 343 NWWL+ + + VGYWPS LFT+L+ A +QFGGEI+N + SG+HT TQMGSGHF +EG Sbjct 352 NWWLEFGSGILVGYWPSFLFTHLR-DHASMVQFGGEIVNSRQSGSHTSTQMGSGHFASEG 410 Query 342 YGKAAYFKQIQVYSSTNSKNP-ANLLELATSYDNQKCY 232 +GKA+YF+ +QV N+ P +NL LA D+ CY Sbjct 411 FGKASYFRNMQVVDWDNNLVPLSNLRVLA---DHPNCY 445 >ref|NP_974121.1| uncharacterized protein [Arabidopsis thaliana] gb|AAG52324.1|AC011663_3 unknown protein; 106914-104701 [Arabidopsis thaliana] gb|AAG52474.1|AC010796_13 unknown protein; 47588-49801 [Arabidopsis thaliana] gb|AEE35077.1| uncharacterized protein [Arabidopsis thaliana] Length=410 Score = 268 bits (685), Expect = 3e-83 Identities = 141/285 (49%), Positives = 188/285 (66%), Gaps = 26/285 (9%) Frame = -2 Query 999 QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGVVKKESDGYTHERAIAYMEGK 820 Q+W +SGESCPE T+PIRR TE+D+LRASSV+ FGRK+ VK++S HE A+ Y+ G+ Sbjct 113 QLWSLSGESCPEGTIPIRRTTEQDMLRASSVQRFGRKIRRVKRDSTNNGHEHAVGYVTGR 172 Query 819 KYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ------------- 679 +YYGA A+++VW P+V ++ EFSLSQIWV AGSF DLNTIEAGWQ Sbjct 173 QYYGAKASINVWSPRVTSQYEFSLSQIWVIAGSFTH-DLNTIEAGWQISPELYGDTYPRF 231 Query 678 -----RDAYQKTGCYNLNCQGFVQKSTTIVLGEAI--RTTQYGGQLKELPFDIYKDSISG 520 DAY+ TGCYNL C GFVQ + I +G AI R++ GGQ ++ I+KD G Sbjct 232 FTYWTSDAYRTTGCYNLLCSGFVQTNRRIAIGAAISPRSSYKGGQF-DISLLIWKDPKHG 290 Query 519 NWWLKV-NNVPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEG 343 +WWL+ + VGYWP+ LFT+LK +QFGGEI+N + G+HT TQMGSGHF EG Sbjct 291 HWWLQFGSGALVGYWPAFLFTHLK-QHGSMVQFGGEIVNNRPGGSHTTTQMGSGHFAGEG 349 Query 342 YGKAAYFKQIQVYSSTNSKNPANLLELATSYDNQKCYFSGFGKDR 208 +GKA+YF+ +Q+ N+ PA+ L++ + N CY G +R Sbjct 350 FGKASYFRNLQIVDWDNTLIPASNLKILADHPN--CYDIRGGTNR 392 >ref|NP_177212.2| uncharacterized protein [Arabidopsis thaliana] gb|AEE35078.1| uncharacterized protein [Arabidopsis thaliana] Length=465 Score = 268 bits (685), Expect = 1e-82 Identities = 141/285 (49%), Positives = 188/285 (66%), Gaps = 26/285 (9%) Frame = -2 Query 999 QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGVVKKESDGYTHERAIAYMEGK 820 Q+W +SGESCPE T+PIRR TE+D+LRASSV+ FGRK+ VK++S HE A+ Y+ G+ Sbjct 168 QLWSLSGESCPEGTIPIRRTTEQDMLRASSVQRFGRKIRRVKRDSTNNGHEHAVGYVTGR 227 Query 819 KYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ------------- 679 +YYGA A+++VW P+V ++ EFSLSQIWV AGSF DLNTIEAGWQ Sbjct 228 QYYGAKASINVWSPRVTSQYEFSLSQIWVIAGSFTH-DLNTIEAGWQISPELYGDTYPRF 286 Query 678 -----RDAYQKTGCYNLNCQGFVQKSTTIVLGEAI--RTTQYGGQLKELPFDIYKDSISG 520 DAY+ TGCYNL C GFVQ + I +G AI R++ GGQ ++ I+KD G Sbjct 287 FTYWTSDAYRTTGCYNLLCSGFVQTNRRIAIGAAISPRSSYKGGQF-DISLLIWKDPKHG 345 Query 519 NWWLKV-NNVPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEG 343 +WWL+ + VGYWP+ LFT+LK +QFGGEI+N + G+HT TQMGSGHF EG Sbjct 346 HWWLQFGSGALVGYWPAFLFTHLK-QHGSMVQFGGEIVNNRPGGSHTTTQMGSGHFAGEG 404 Query 342 YGKAAYFKQIQVYSSTNSKNPANLLELATSYDNQKCYFSGFGKDR 208 +GKA+YF+ +Q+ N+ PA+ L++ + N CY G +R Sbjct 405 FGKASYFRNLQIVDWDNTLIPASNLKILADHPN--CYDIRGGTNR 447 >ref|XP_002893280.1| hypothetical protein ARALYDRAFT_472593 [Arabidopsis lyrata subsp. lyrata] gb|EFH69539.1| hypothetical protein ARALYDRAFT_472593 [Arabidopsis lyrata subsp. lyrata] Length=408 Score = 264 bits (675), Expect = 9e-82 Identities = 136/276 (49%), Positives = 181/276 (66%), Gaps = 24/276 (9%) Frame = -2 Query 999 QIWRMSGESCPENTVPIRRITEKDVLRASSVRTFGRKLGVVKKESDGYTHERAIAYMEGK 820 Q+W +SGESCPE T+PIRR TE+D+LRASSVR FGRK+ V+++S HE A+ Y+ G Sbjct 111 QLWSLSGESCPEGTIPIRRTTEQDMLRASSVRRFGRKIRRVRRDSSSNGHEHAVGYVSGS 170 Query 819 KYYGASATMDVWGPKVVNENEFSLSQIWVRAGSFDKGDLNTIEAGWQ------------- 679 +YYGA A+++VW P+V+++ EFSLSQIWV AGSF DLNTIEAGWQ Sbjct 171 QYYGAKASINVWTPRVISQYEFSLSQIWVIAGSF-ADDLNTIEAGWQISPELYGDTNPRF 229 Query 678 -----RDAYQKTGCYNLNCQGFVQKSTTIVLGEAIR-TTQYGGQLKELPFDIYKDSISGN 517 DAYQ TGCYNL C GFVQ + I +G AI + Y G ++ I+KD G+ Sbjct 230 FTYWTSDAYQATGCYNLLCSGFVQTNNRIAIGAAISPVSSYKGGQFDISLLIWKDPKHGH 289 Query 516 WWLKV-NNVPVGYWPSILFTYLKVGPADSIQFGGEIINLKSSGTHTKTQMGSGHFPNEGY 340 WWL+ + VGYWP LFT+L+ + +QFGGEI+N + G+HT TQMGSGHF EG+ Sbjct 290 WWLQFGSGTLVGYWPVSLFTHLR-EHGNMVQFGGEIVNTRPGGSHTSTQMGSGHFAGEGF 348 Query 339 GKAAYFKQIQVYSSTNSKNPANLLELATSYDNQKCY 232 GKA+YF+ +Q+ N+ P + L++ + N CY Sbjct 349 GKASYFRNLQMVDWDNTLIPISNLKVLADHPN--CY 382 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 1,855,251,573 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 535716129 Number of extensions: 12055677 Number of successful extensions: 27009 Number of sequences better than 1e-10: 37 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 26862 Number of HSP's successfully gapped: 38 Length of query: 1000 Length of database: 6150218869 Length adjustment: 141 Effective length of query: 859 Effective length of database: 3623628025 Effective search space: 695736580800 Effective search space used: 695736580800 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 177 (72.8 bits)A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE9VTXP01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_6809 Length=581 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002271428.2| PREDICTED: pathogenesis-related protein S... 145 1e-40 ref|XP_002328585.1| predicted protein [Populus trichocarpa] >... 144 3e-40 gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutin... 141 4e-39 gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltior... 135 7e-37 gb|AAS00053.1| Mal d 1-like [Malus x domestica] >emb|CBL94148... 134 3e-36 ALIGNMENTS >ref|XP_002271428.2| PREDICTED: pathogenesis-related protein STH-2 [Vitis vinifera] emb|CBI22930.3| unnamed protein product [Vitis vinifera] Length=160 Score = 145 bits (365), Expect = 1e-40 Identities = 71/160 (44%), Positives = 110/160 (69%), Gaps = 2/160 (1%) Frame = -2 Query 502 MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNGGPGTIIQTNFS 323 M VT +QE VSP RM KALV D+ +P+++ +++SI + G+GG G+I QTNFS Sbjct 1 MGVTTFTQEFVTPVSPARMFKALVVDSHILVPRLVPESVKSIEFVEGDGGAGSITQTNFS 60 Query 322 DVAGAPVPTAKHRIDALDAEKGTTKFTMIEGAYLGEKIESVVYDVKFEEAGNGGCTIKVT 143 + K++I+A+D EK ++T+IEG LG+++ES+VY++KFEE+G+GGC K Sbjct 61 --GDSDCEYLKYKINAVDKEKLECRYTLIEGGVLGDQLESIVYEMKFEESGDGGCICKTR 118 Query 142 NEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIANLNE 23 +EY++KG+ +K+E I+E ++A G Y EAYL+AN +E Sbjct 119 SEYHTKGEFEIKEESIREGKEKAMGVYKLVEAYLLANPDE 158 >ref|XP_002328585.1| predicted protein [Populus trichocarpa] gb|EEE76932.1| predicted protein [Populus trichocarpa] Length=160 Score = 144 bits (363), Expect = 3e-40 Identities = 70/157 (45%), Positives = 102/157 (65%), Gaps = 2/157 (1%) Frame = -2 Query 502 MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNGGPGTIIQTNFS 323 M V ++QE T +SP RM KAL+ D++ +PK++ ++S+ ++ G+GG G+I Q NF+ Sbjct 1 MGVASYTQEFTCPISPARMFKALILDSNNLIPKLLPQIVKSVDLIHGDGGAGSIEQVNFT 60 Query 322 DVAGAPVPTAKHRIDALDAEKGTTKFTMIEGAYLGEKIESVVYDVKFEEAGNGGCTIKVT 143 + G + KHRID LD K+TMIEG LGEK+ES+ Y+V+FE +GGC K+T Sbjct 61 E--GTDIKYVKHRIDELDRVNLVCKYTMIEGDSLGEKLESIAYEVRFEVGSDGGCDCKMT 118 Query 142 NEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIAN 32 + Y GD LK+E+IK D+A+G Y EAYL+ N Sbjct 119 SSYLMLGDFTLKEEEIKAGQDKARGIYKVVEAYLLEN 155 >gb|ACB12048.1| pathogenesis-related protein [Rehmannia glutinosa] Length=154 Score = 141 bits (355), Expect = 4e-39 Identities = 78/162 (48%), Positives = 111/162 (69%), Gaps = 8/162 (5%) Frame = -2 Query 502 MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNGGPGTIIQTNFS 323 M +TKH QE+ + VS KRM KALVT++ + +P + I+SI +L G+G GTI +TN + Sbjct 1 MGITKHIQELKLRVSAKRMFKALVTESHS-IP--LPDAIKSIEILHGDGSAGTIRKTNLA 57 Query 322 DVAGAPVPTAKHRIDALDAEKGTTKFTMIEGAYLGEKIESVVYDVKFEEAGNGGCTIKVT 143 D G+ V K RI+A+D + +K+T+IEG LG+KIES+ Y+ KFE++ +GGC K+ Sbjct 58 D--GSYV---KIRIEAVDIDNQVSKYTVIEGPMLGDKIESIHYEQKFEDSSDGGCVAKIV 112 Query 142 NEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIANLNECA 17 EY++KGD LK+E +K I D+A GFY +E YL AN N CA Sbjct 113 CEYHTKGDIQLKEEGVKAINDQALGFYTLSEEYLHANPNVCA 154 >gb|ABR10301.1| pathogen-related protein STH-2 [Salvia miltiorrhiza] Length=160 Score = 135 bits (340), Expect = 7e-37 Identities = 71/163 (44%), Positives = 106/163 (65%), Gaps = 4/163 (2%) Frame = -2 Query 502 MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNG-GPGTIIQTNF 326 M V QE+ +S R+ KALVT++ +PK + +I+SI ++ G+G PG I QTNF Sbjct 1 MGVKSFFQEMKTKISSSRLFKALVTESPEVVPKFTT-SIKSIELIQGSGYAPGAIFQTNF 59 Query 325 SDVAGAPVPTAKHRIDALDAEKGTTKFTMIEGAYLGEKIESVVYDVKFEEAGNGGCTIKV 146 + GA K R+D +D EK + K+T+IEG LG+K+E + YD+KFE+ +GGC +KV Sbjct 60 PE--GAHFKYMKCRVDEIDHEKHSIKYTLIEGDMLGDKLEKICYDMKFEDTEDGGCVVKV 117 Query 145 TNEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIANLNECA 17 T+EY++KG L DED+K +++ G Y + E YL+AN + CA Sbjct 118 TSEYHTKGGYELADEDLKGAKEQSLGMYKSCEDYLLANPHVCA 160 >gb|AAS00053.1| Mal d 1-like [Malus x domestica] emb|CBL94148.1| putative Mal d 1.11 isoallergen [Malus x domestica] Length=163 Score = 134 bits (336), Expect = 3e-36 Identities = 75/160 (47%), Positives = 101/160 (63%), Gaps = 5/160 (3%) Frame = -2 Query 502 MAVTKHSQEITVSVSPKRMIKALVTDASTFLPKIMSGTIQSIVVLSGNGGPGTIIQTNFS 323 M VTK SQ+ V+P+RM AL+ DA PK+M +I+SI LSG+G GTI Q NF+ Sbjct 1 MGVTKISQKFVTQVTPQRMFNALILDAHNICPKLMFSSIKSIEFLSGSGEVGTIKQINFT 60 Query 322 DVAGAPVPTAKHRIDALDAEKGTTKFTMIEGA---YLGEKIESVVYDVKFEEAGNGGCTI 152 + + P+ AKHRIDALD E + +T IE +L +K+E + YDVKFE G GGC Sbjct 61 EAS--PMKYAKHRIDALDKEALSCTYTFIESDATDHLLDKLEYITYDVKFEGYGRGGCIC 118 Query 151 KVTNEYNSKGDAALKDEDIKEIVDRAKGFYIAAEAYLIAN 32 +T+ Y +K D +K+EDI+ DRA G Y EAYL+A+ Sbjct 119 HLTSTYKAKDDIQIKEEDIELGKDRAIGMYEVLEAYLMAH 158 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 327883953 Number of extensions: 7719891 Number of successful extensions: 21928 Number of sequences better than 1e-10: 27 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 21866 Number of HSP's successfully gapped: 28 Length of query: 581 Length of database: 6150218869 Length adjustment: 133 Effective length of query: 448 Effective length of database: 3766980697 Effective search space: 226018841820 Effective search space used: 226018841820 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 173 (71.2 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEA40KK016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_7456 Length=562 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002313837.1| predicted protein [Populus trichocarpa] >... 122 1e-31 ref|XP_002885981.1| hypothetical protein ARALYDRAFT_480428 [A... 111 1e-26 ref|XP_002521848.1| hypothetical protein RCOM_0774340 [Ricinu... 108 1e-26 ref|XP_002281892.2| PREDICTED: uncharacterized protein LOC100... 107 9e-26 ref|XP_002281852.1| PREDICTED: uncharacterized protein LOC100... 107 9e-26 ALIGNMENTS >ref|XP_002313837.1| predicted protein [Populus trichocarpa] gb|EEE87792.1| predicted protein [Populus trichocarpa] Length=170 Score = 122 bits (305), Expect = 1e-31 Identities = 61/123 (50%), Positives = 77/123 (63%), Gaps = 10/123 (8%) Frame = -1 Query 529 PGRRNVTQKPKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKY-------TKAHSV 371 P R+N T P + VG + W FG++YA W K PF+ DTLVFKY T HSV Sbjct 41 PCRQNSTAAPNKIVVGGSQNWTFGINYADWALKNGPFYFNDTLVFKYDPPSDTNTHPHSV 100 Query 370 YLLPDLDSYTKCDIAKGKLIGA-TNAGSKGFKYTLKA-QKSYFACGEGGNGLHCNQGNMK 197 YLLP+L S+ KCD+++ KL+ + T G GF++ LK+ Q YFACG GG G HCN G MK Sbjct 101 YLLPNLWSFLKCDLSRAKLVASETQGGGDGFEFVLKSWQPHYFACG-GGAGFHCNNGTMK 159 Query 196 FIV 188 F V Sbjct 160 FFV 162 >ref|XP_002885981.1| hypothetical protein ARALYDRAFT_480428 [Arabidopsis lyrata subsp. lyrata] gb|EFH62240.1| hypothetical protein ARALYDRAFT_480428 [Arabidopsis lyrata subsp. lyrata] Length=261 Score = 111 bits (277), Expect = 1e-26 Identities = 55/111 (50%), Positives = 72/111 (65%), Gaps = 7/111 (6%) Frame = -1 Query 502 PKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKYTK----AHSVYLLPDLDSYTKC 335 P+ + VG WN+GV+YA W +K +PF L D LVFKY HSVYLLP+ SY KC Sbjct 145 PRKIIVGGDKEWNYGVNYAEWASKTAPFFLNDILVFKYNPPAPFTHSVYLLPNPSSYEKC 204 Query 334 DIAKGKLIGATNAGS-KGFKYTLKAQKSYF-ACGEGGNGLHCNQGNMKFIV 188 D+ KGK+I + G+ KGF++ LK + Y+ +CGE +G HCN G MKF V Sbjct 205 DVKKGKMIASPKQGAGKGFEFVLKQMRPYYISCGE-HDGAHCNNGTMKFTV 254 >ref|XP_002521848.1| hypothetical protein RCOM_0774340 [Ricinus communis] gb|EEF40484.1| hypothetical protein RCOM_0774340 [Ricinus communis] Length=138 Score = 108 bits (269), Expect = 1e-26 Identities = 55/117 (47%), Positives = 70/117 (60%), Gaps = 10/117 (9%) Frame = -1 Query 511 TQKPKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKY-------TKAHSVYLLPDL 353 T+ PK + VG + W FG DY W + SPF++ DTLVFKY T HSVYLLP+L Sbjct 17 TRTPKKIVVGGSAKWTFGFDYTDWAFRNSPFYVNDTLVFKYKLPKDNSTHPHSVYLLPNL 76 Query 352 DSYTKCDIAKG-KLIGATNAGSKGFKYTLKAQKS-YFACGEGGNGLHCNQGNMKFIV 188 S+ C++ K K+ G KGF++ L K YFACG GG+G+HC G MKF V Sbjct 77 SSFVTCNLTKAVKVADGKQGGGKGFRFVLNKWKPYYFACG-GGDGIHCGLGQMKFYV 132 >ref|XP_002281892.2| PREDICTED: uncharacterized protein LOC100255503 [Vitis vinifera] emb|CBI32457.3| unnamed protein product [Vitis vinifera] Length=181 Score = 107 bits (266), Expect = 9e-26 Identities = 57/126 (45%), Positives = 72/126 (57%), Gaps = 9/126 (7%) Frame = -1 Query 541 PPAAPGRRNVTQKPKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKY------TKA 380 P P N T+ PK VG + W +G +Y W K PF++ DTLVFKY T Sbjct 48 PRNNPFHPNNTRPPKKFIVGGSERWRYGFNYTDWALKNGPFYINDTLVFKYDPPNSTTFP 107 Query 379 HSVYLLPDLDSYTKCDIAKGKLIG-ATNAGSKGFKYTLK-AQKSYFACGEGGNGLHCNQG 206 HSVYLLP+ S+ CD+++ K + GSKGF++ LK YFACGE NGLHC +G Sbjct 108 HSVYLLPNFGSFLTCDLSRAKQVATVAQGGSKGFEFVLKNLWPHYFACGE-HNGLHCKEG 166 Query 205 NMKFIV 188 MKF V Sbjct 167 MMKFSV 172 >ref|XP_002281852.1| PREDICTED: uncharacterized protein LOC100248603 [Vitis vinifera] emb|CBI32459.3| unnamed protein product [Vitis vinifera] Length=181 Score = 107 bits (266), Expect = 9e-26 Identities = 57/126 (45%), Positives = 72/126 (57%), Gaps = 9/126 (7%) Frame = -1 Query 541 PPAAPGRRNVTQKPKIVYVGDTNTWNFGVDYATWVAKKSPFHLGDTLVFKY------TKA 380 P P N T+ PK VG + W +G +Y W K PF++ DTLVFKY T Sbjct 48 PRNNPFHPNNTRPPKKFIVGGSERWRYGFNYTDWALKNGPFYINDTLVFKYDPPNSTTFP 107 Query 379 HSVYLLPDLDSYTKCDIAKGKLIG-ATNAGSKGFKYTLK-AQKSYFACGEGGNGLHCNQG 206 HSVYLLP+ S+ CD+++ K + GSKGF++ LK YFACGE NGLHC +G Sbjct 108 HSVYLLPNFGSFLTCDLSRAKQVATVAQGGSKGFEFVLKNLWPHYFACGE-HNGLHCKEG 166 Query 205 NMKFIV 188 MKF V Sbjct 167 MMKFSV 172 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 263332965 Number of extensions: 5551168 Number of successful extensions: 11337 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 11323 Number of HSP's successfully gapped: 0 Length of query: 562 Length of database: 6150218869 Length adjustment: 132 Effective length of query: 430 Effective length of database: 3784899781 Effective search space: 208169487955 Effective search space used: 208169487955 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 172 (70.9 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEA6VHH01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_7457 Length=562 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003634199.1| PREDICTED: uncharacterized GPI-anchored p... 141 2e-37 emb|CBI15549.3| unnamed protein product [Vitis vinifera] 141 5e-37 emb|CAN82280.1| hypothetical protein VITISV_044064 [Vitis vin... 139 1e-36 ref|XP_003525199.1| PREDICTED: uncharacterized GPI-anchored p... 125 4e-31 gb|ACU18611.1| unknown [Glycine max] 123 1e-30 ALIGNMENTS >ref|XP_003634199.1| PREDICTED: uncharacterized GPI-anchored protein At4g28100-like [Vitis vinifera] Length=327 Score = 141 bits (355), Expect = 2e-37 Identities = 82/139 (59%), Positives = 102/139 (73%), Gaps = 7/139 (5%) Frame = +2 Query 2 EAVKRLEKDCFNhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKMHNQD 181 E+VK+LE+DC + GLGGCSKCLN+LY L DK+ S+ E + KMHN+D Sbjct 181 ESVKKLERDCLSTNGFPGLGGCSKCLNTLYLLGKDKTGNSSKLEARSR------KMHNRD 234 Query 182 CEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSEINSQ 361 CE+MGLTWLL KNR+AYIHTVSAVLR++M++ D SDP C+LNSDGMPLAVDS+EI+S Sbjct 235 CELMGLTWLLAKNRTAYIHTVSAVLRAIMMSTD-GSDPLSCTLNSDGMPLAVDSAEISSH 293 Query 362 SLSTVLRFYNHMLPISLLF 418 S ST L F H+ +SL F Sbjct 294 SSSTSLHFSIHLCILSLSF 312 >emb|CBI15549.3| unnamed protein product [Vitis vinifera] Length=374 Score = 141 bits (355), Expect = 5e-37 Identities = 82/139 (59%), Positives = 102/139 (73%), Gaps = 7/139 (5%) Frame = +2 Query 2 EAVKRLEKDCFNhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKMHNQD 181 E+VK+LE+DC + GLGGCSKCLN+LY L DK+ S+ E + KMHN+D Sbjct 181 ESVKKLERDCLSTNGFPGLGGCSKCLNTLYLLGKDKTGNSSKLEARSR------KMHNRD 234 Query 182 CEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSEINSQ 361 CE+MGLTWLL KNR+AYIHTVSAVLR++M++ D SDP C+LNSDGMPLAVDS+EI+S Sbjct 235 CELMGLTWLLAKNRTAYIHTVSAVLRAIMMSTD-GSDPLSCTLNSDGMPLAVDSAEISSH 293 Query 362 SLSTVLRFYNHMLPISLLF 418 S ST L F H+ +SL F Sbjct 294 SSSTSLHFSIHLCILSLSF 312 >emb|CAN82280.1| hypothetical protein VITISV_044064 [Vitis vinifera] Length=328 Score = 139 bits (350), Expect = 1e-36 Identities = 81/139 (58%), Positives = 101/139 (73%), Gaps = 7/139 (5%) Frame = +2 Query 2 EAVKRLEKDCFNhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKMHNQD 181 E+VK+LE+DC + GLGGCSKCLN+LY L DK+ S+ E + KMHN+D Sbjct 182 ESVKKLERDCLSTNGFPGLGGCSKCLNTLYLLGKDKTGNSSKLEARSR------KMHNRD 235 Query 182 CEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSEINSQ 361 CE+MGLTWLL KNR+ YIHTVSAVLR++M++ D SDP C+LNSDGMPLAVDS+EI+S Sbjct 236 CELMGLTWLLAKNRTXYIHTVSAVLRAIMMSTD-GSDPLSCTLNSDGMPLAVDSAEISSH 294 Query 362 SLSTVLRFYNHMLPISLLF 418 S ST L F H+ +SL F Sbjct 295 SSSTSLHFSIHLCILSLSF 313 >ref|XP_003525199.1| PREDICTED: uncharacterized GPI-anchored protein At4g28100-like [Glycine max] Length=360 Score = 125 bits (313), Expect = 4e-31 Identities = 76/131 (58%), Positives = 97/131 (74%), Gaps = 12/131 (9%) Frame = +2 Query 2 EAVKRLEKDCF----NhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKM 169 ++VKRLE+DCF N GLGGCSKCL+SLYSL + S S + +R+ K+ Sbjct 201 QSVKRLERDCFSSSTNVNKFPGLGGCSKCLHSLYSLRKNSSNSSKSEDRTT-------KI 253 Query 170 HNQDCEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSE 349 HN+DCE+MGLTWLL KNR+AYIHTVS VLR+LM++ + SDP C+LNSDGMPLAVDSSE Sbjct 254 HNKDCELMGLTWLLAKNRTAYIHTVSGVLRALMLS-TEGSDPQSCTLNSDGMPLAVDSSE 312 Query 350 INSQSLSTVLR 382 ++ +S ST L+ Sbjct 313 MSDESSSTNLQ 323 >gb|ACU18611.1| unknown [Glycine max] Length=360 Score = 123 bits (309), Expect = 1e-30 Identities = 76/131 (58%), Positives = 96/131 (73%), Gaps = 12/131 (9%) Frame = +2 Query 2 EAVKRLEKDCF----NhghggglggCSKCLNSLYSLNVDKsersnttersntttdrtrKM 169 ++VKRLE DCF N GLGGCSKCL+SLYSL + S S + +R+ K+ Sbjct 201 QSVKRLEGDCFSSGTNVNKFPGLGGCSKCLHSLYSLRKNSSNSSKSEDRTT-------KI 253 Query 170 HNQDCEIMGLTWLLNKNRSAYIHTVSAVLRSLMINLDQDSDPTYCSLNSDGMPLAVDSSE 349 HN+DCE+MGLTWLL KNR+AYIHTVS VLR+LM++ + SDP C+LNSDGMPLAVDSSE Sbjct 254 HNKDCELMGLTWLLAKNRTAYIHTVSGVLRALMLS-TEGSDPQSCTLNSDGMPLAVDSSE 312 Query 350 INSQSLSTVLR 382 ++ +S ST L+ Sbjct 313 MSDESSSTNLQ 323 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 244898668 Number of extensions: 4517584 Number of successful extensions: 10596 Number of sequences better than 1e-10: 6 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 10583 Number of HSP's successfully gapped: 6 Length of query: 562 Length of database: 6150218869 Length adjustment: 132 Effective length of query: 430 Effective length of database: 3784899781 Effective search space: 208169487955 Effective search space used: 208169487955 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 172 (70.9 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEAKT7901N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_7825 Length=552 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002298184.1| predicted protein [Populus trichocarpa] >... 66.2 3e-11 ref|XP_003560817.1| PREDICTED: chemocyanin-like [Brachypodium... 66.2 3e-11 gb|ACG29532.1| chemocyanin precursor [Zea mays] 65.5 5e-11 ref|NP_001149596.1| chemocyanin precursor [Zea mays] >gb|ACG3... 65.1 7e-11 gb|AEC10985.1| basic blue protein [Camellia sinensis] 64.7 9e-11 ALIGNMENTS >ref|XP_002298184.1| predicted protein [Populus trichocarpa] gb|EEE82989.1| predicted protein [Populus trichocarpa] Length=125 Score = 66.2 bits (160), Expect = 3e-11 Identities = 34/60 (57%), Positives = 39/60 (65%), Gaps = 1/60 (2%) Frame = -3 Query 334 KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT 155 K AGD L F Y+ AA +NVV VNKAGY +C + K YTSG D I L KG NFFIC+ Sbjct 51 KSFKAGDILVFNYSTAA-HNVVAVNKAGYSSCTSPRGAKVYTSGKDQIKLVKGQNFFICS 109 >ref|XP_003560817.1| PREDICTED: chemocyanin-like [Brachypodium distachyon] Length=130 Score = 66.2 bits (160), Expect = 3e-11 Identities = 30/60 (50%), Positives = 42/60 (70%), Gaps = 1/60 (2%) Frame = -3 Query 334 KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT 155 K AGD LQF Y + A +NVV VN AGY +C+ + K Y+SGND++ L++G N+FIC+ Sbjct 56 KRFRAGDVLQFKYGRGA-HNVVAVNAAGYKSCSAPRGAKVYSSGNDSVKLSRGTNYFICS 114 >gb|ACG29532.1| chemocyanin precursor [Zea mays] Length=129 Score = 65.5 bits (158), Expect = 5e-11 Identities = 31/60 (52%), Positives = 39/60 (65%), Gaps = 1/60 (2%) Frame = -3 Query 334 KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT 155 K AGD L F Y+ A +NVV VN AGY C+ + K YTSGND +TL +G N+FIC+ Sbjct 55 KRFKAGDVLVFKYDSTA-HNVVAVNAAGYKGCSAPRGAKVYTSGNDRVTLARGTNYFICS 113 >ref|NP_001149596.1| chemocyanin precursor [Zea mays] gb|ACG35999.1| chemocyanin precursor [Zea mays] Length=129 Score = 65.1 bits (157), Expect = 7e-11 Identities = 31/60 (52%), Positives = 39/60 (65%), Gaps = 1/60 (2%) Frame = -3 Query 334 KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT 155 K AGD L F Y+ A +NVV VN AGY C+ + K YTSGND +TL +G N+FIC+ Sbjct 55 KRFKAGDVLVFKYDSTA-HNVVVVNAAGYKGCSAPRGAKVYTSGNDRVTLARGTNYFICS 113 >gb|AEC10985.1| basic blue protein [Camellia sinensis] Length=122 Score = 64.7 bits (156), Expect = 9e-11 Identities = 32/60 (53%), Positives = 38/60 (63%), Gaps = 1/60 (2%) Frame = -3 Query 334 KHIYAGDTLQFIYNKAAGYNVVHVNKAGYDNCNGNKAIKTYTSGNDTITLTKGLNFFICT 155 K AGD L F YN A +NVV VNKAGYD+C + ++SG D I L KG NFFIC+ Sbjct 48 KRFRAGDILAFNYN-AQAHNVVSVNKAGYDSCKAPAGARVFSSGKDQIKLVKGQNFFICS 106 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 255915547 Number of extensions: 5362215 Number of successful extensions: 12058 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 12044 Number of HSP's successfully gapped: 0 Length of query: 552 Length of database: 6150218869 Length adjustment: 132 Effective length of query: 420 Effective length of database: 3784899781 Effective search space: 196814788612 Effective search space used: 196814788612 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 172 (70.9 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEB3BST013 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_7997 Length=547 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003523208.1| PREDICTED: small nuclear ribonucleoprotei... 165 1e-49 ref|XP_003542280.1| PREDICTED: small nuclear ribonucleoprotei... 165 2e-49 ref|XP_003597961.1| Small nuclear ribonucleoprotein E [Medica... 164 4e-49 ref|XP_002522808.1| Small nuclear ribonucleoprotein E, putati... 164 4e-49 ref|XP_002331664.1| predicted protein [Populus trichocarpa] >... 164 4e-49 ALIGNMENTS >ref|XP_003523208.1| PREDICTED: small nuclear ribonucleoprotein E-like [Glycine max] ref|XP_003526883.1| PREDICTED: small nuclear ribonucleoprotein E [Glycine max] Length=88 Score = 165 bits (418), Expect = 1e-49 Identities = 83/86 (97%), Positives = 85/86 (99%), Gaps = 0/86 (0%) Frame = +1 Query 1 ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN 180 ASTKVQR+MTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLD+AEEVN Sbjct 2 ASTKVQRVMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDDAEEVN 61 Query 181 VKKKSRKQLGRILLKGDNITLMMNTG 258 VKKKSRK LGRILLKGDNITLMMNTG Sbjct 62 VKKKSRKTLGRILLKGDNITLMMNTG 87 >ref|XP_003542280.1| PREDICTED: small nuclear ribonucleoprotein E-like [Glycine max] gb|ACU14300.1| unknown [Glycine max] Length=88 Score = 165 bits (417), Expect = 2e-49 Identities = 82/86 (95%), Positives = 85/86 (99%), Gaps = 0/86 (0%) Frame = +1 Query 1 ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN 180 ASTKVQR+MTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLD+AEEVN Sbjct 2 ASTKVQRVMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDDAEEVN 61 Query 181 VKKKSRKQLGRILLKGDNITLMMNTG 258 +KKKSRK LGRILLKGDNITLMMNTG Sbjct 62 IKKKSRKTLGRILLKGDNITLMMNTG 87 >ref|XP_003597961.1| Small nuclear ribonucleoprotein E [Medicago truncatula] ref|XP_003615807.1| Small nuclear ribonucleoprotein E [Medicago truncatula] gb|AES68212.1| Small nuclear ribonucleoprotein E [Medicago truncatula] gb|AES98765.1| Small nuclear ribonucleoprotein E [Medicago truncatula] Length=88 Score = 164 bits (415), Expect = 4e-49 Identities = 82/86 (95%), Positives = 85/86 (99%), Gaps = 0/86 (0%) Frame = +1 Query 1 ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN 180 ASTKVQR+MTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLD+AEEVN Sbjct 2 ASTKVQRVMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDDAEEVN 61 Query 181 VKKKSRKQLGRILLKGDNITLMMNTG 258 VKKKS+K LGRILLKGDNITLMMNTG Sbjct 62 VKKKSKKTLGRILLKGDNITLMMNTG 87 >ref|XP_002522808.1| Small nuclear ribonucleoprotein E, putative [Ricinus communis] gb|EEF39659.1| Small nuclear ribonucleoprotein E, putative [Ricinus communis] Length=88 Score = 164 bits (415), Expect = 4e-49 Identities = 83/86 (97%), Positives = 84/86 (98%), Gaps = 0/86 (0%) Frame = +1 Query 1 ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN 180 ASTKVQRIMTQPINLIFRFLQSKARIQ WLFEQKDLRIEGRIIGFDEYMNLVLD+AEEVN Sbjct 2 ASTKVQRIMTQPINLIFRFLQSKARIQFWLFEQKDLRIEGRIIGFDEYMNLVLDDAEEVN 61 Query 181 VKKKSRKQLGRILLKGDNITLMMNTG 258 VKKKSRK LGRILLKGDNITLMMNTG Sbjct 62 VKKKSRKTLGRILLKGDNITLMMNTG 87 >ref|XP_002331664.1| predicted protein [Populus trichocarpa] ref|XP_002325079.1| predicted protein [Populus trichocarpa] gb|ABK93602.1| unknown [Populus trichocarpa] gb|EEF03644.1| predicted protein [Populus trichocarpa] gb|EEF11214.1| predicted protein [Populus trichocarpa] Length=88 Score = 164 bits (415), Expect = 4e-49 Identities = 82/86 (95%), Positives = 85/86 (99%), Gaps = 0/86 (0%) Frame = +1 Query 1 ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLDEAEEVN 180 ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVL++AEEVN Sbjct 2 ASTKVQRIMTQPINLIFRFLQSKARIQIWLFEQKDLRIEGRIIGFDEYMNLVLEDAEEVN 61 Query 181 VKKKSRKQLGRILLKGDNITLMMNTG 258 +KKKSRK LGRILLKGDNITLMMNTG Sbjct 62 IKKKSRKSLGRILLKGDNITLMMNTG 87 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 231300735 Number of extensions: 4031322 Number of successful extensions: 9114 Number of sequences better than 1e-10: 11 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 9099 Number of HSP's successfully gapped: 11 Length of query: 547 Length of database: 6150218869 Length adjustment: 132 Effective length of query: 415 Effective length of database: 3784899781 Effective search space: 189244989050 Effective search space used: 189244989050 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 172 (70.9 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPE1825Y01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_837 Length=960 Score E Sequences producing significant alignments: (Bits) Value ref|NP_566697.1| cysteine-rich repeat secretory protein 38 [A... 122 2e-29 dbj|BAJ34612.1| unnamed protein product [Thellungiella haloph... 119 2e-28 ref|XP_002883339.1| hypothetical protein ARALYDRAFT_479720 [A... 118 4e-28 ref|XP_002274668.1| PREDICTED: cysteine-rich repeat secretory... 117 8e-28 ref|XP_002327925.1| predicted protein [Populus trichocarpa] >... 115 4e-27 ALIGNMENTS >ref|NP_566697.1| cysteine-rich repeat secretory protein 38 [Arabidopsis thaliana] sp|Q9LRJ9.1|CRR38_ARATH RecName: Full=Cysteine-rich repeat secretory protein 38; Flags: Precursor dbj|BAB01391.1| unnamed protein product [Arabidopsis thaliana] gb|AAK59407.1| unknown protein [Arabidopsis thaliana] gb|ABD91491.1| At3g22060 [Arabidopsis thaliana] gb|AEE76585.1| cysteine-rich repeat secretory protein 38 [Arabidopsis thaliana] Length=252 Score = 122 bits (305), Expect = 2e-29 Identities = 80/222 (36%), Positives = 114/222 (51%), Gaps = 13/222 (6%) Frame = +1 Query 109 YNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVTFSNTSYGKSPDQVYGLALCHADT 288 + CS G + +S Y+SNL L + L+ + F+ +S G +P+ V GLALC D Sbjct 37 HKCSDIEGSFTSKSLYESNLNNLFSQLSYKVPSTG--FAASSTGNTPNNVNGLALCRGDA 94 Query 289 YSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTL--VMDV 462 S +C C+ + C N KA I W D C VKYS+ NFFG++D +N+ L V +V Sbjct 95 SSSDCRSCLETAIPELRQRCPNNKAGIVWYDNCLVKYSSTNFFGKIDFENRFYLYNVKNV 154 Query 463 HDISYsltkslttttfkllsdlkNKATANNNHMIMFARGTAVGRQDRFVIFGLVQCSGDL 642 D S + + T LL++L KAT +N + +G+ ++GLVQC+ DL Sbjct 155 SDPS-----TFNSQTKALLTELTKKATTRDNQKLFATGEKNIGKNK---LYGLVQCTRDL 206 Query 643 SKYNCGQCLNYTIGQLPKNLTSL-GMKAVTGSCVVRYDSYHF 765 C CLN IG+LP G + V GSC RY+ Y F Sbjct 207 KSITCKACLNGIIGELPNCCDGKEGGRVVGGSCNFRYEIYPF 248 >dbj|BAJ34612.1| unnamed protein product [Thellungiella halophila] Length=255 Score = 119 bits (298), Expect = 2e-28 Identities = 82/220 (37%), Positives = 119/220 (54%), Gaps = 9/220 (4%) Frame = +1 Query 109 YNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVTFSNTSYGKSPDQVYGLALCHADT 288 + CS G + +S Y+SNL L ++ + + F+ +S G SPD V GLALC D Sbjct 40 HKCSDIEGNFTSKSPYESNLDSLFRRISYRVPSSG--FAASSAGNSPDNVNGLALCRGDA 97 Query 289 YSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTLVMDVHD 468 S +C C+ + C N KA I W D C VKYS+ NFFG++D +N+ L +V++ Sbjct 98 SSSDCGSCLATAIPELRQRCPNNKAGIIWYDNCLVKYSSTNFFGKIDYENRFYL-YNVNN 156 Query 469 ISYsltkslttttfkllsdlkNKATANNNHMIMFARGTAVGRQDRFVIFGLVQCSGDLSK 648 +S S T T LL++L KAT +N +FA G ++ ++GLVQC+ DL + Sbjct 157 VS--DPASFNTQTKALLTELTQKATTGDNQK-LFATGEK--NLEKKKLYGLVQCTRDLRR 211 Query 649 YNCGQCLNYTIGQLPKNLTSL-GMKAVTGSCVVRYDSYHF 765 +C CL+ IG+LP G + V GSC RY+ Y F Sbjct 212 ESCKACLDGIIGELPNCCDGKEGGRVVGGSCNFRYEIYPF 251 >ref|XP_002883339.1| hypothetical protein ARALYDRAFT_479720 [Arabidopsis lyrata subsp. lyrata] gb|EFH59598.1| hypothetical protein ARALYDRAFT_479720 [Arabidopsis lyrata subsp. lyrata] Length=252 Score = 118 bits (296), Expect = 4e-28 Identities = 80/220 (36%), Positives = 113/220 (51%), Gaps = 9/220 (4%) Frame = +1 Query 109 YNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVTFSNTSYGKSPDQVYGLALCHADT 288 + CS G + +S Y+SNL L L+ + F+ +S G +PD V GLALC D Sbjct 37 HKCSDIEGSFTSKSPYESNLNNLFPQLSYKVPSTG--FATSSAGITPDNVNGLALCRGDA 94 Query 289 YSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTLVMDVHD 468 S +C C+ I C + KA I W D C VKYS+ NFFG++D +N+ L +V++ Sbjct 95 SSSDCSSCLATAIPEIRQRCPSNKAGIIWYDNCLVKYSSTNFFGKIDFENRFYL-YNVNN 153 Query 469 ISYsltkslttttfkllsdlkNKATANNNHMIMFARGTAVGRQDRFVIFGLVQCSGDLSK 648 +S + T T LL+ L KAT +N + +G + ++GLVQC+ DL Sbjct 154 VS--DPSTFNTQTKALLTKLTKKATTGDNQKLFATGEKNIGMKK---LYGLVQCTRDLKS 208 Query 649 YNCGQCLNYTIGQLPKNLTSL-GMKAVTGSCVVRYDSYHF 765 C CLN IG+LP G + V GSC RY+ Y F Sbjct 209 EACKACLNGIIGELPNCCDGKEGGRVVGGSCNFRYEIYPF 248 >ref|XP_002274668.1| PREDICTED: cysteine-rich repeat secretory protein 38-like [Vitis vinifera] Length=242 Score = 117 bits (293), Expect = 8e-28 Identities = 79/224 (35%), Positives = 116/224 (52%), Gaps = 10/224 (4%) Frame = +1 Query 97 TKPQYNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVTFSNTSYGKSPDQVYGLALC 276 T P Y+ S+S + TY++NL KL+ L L F S G++ DQV GLALC Sbjct 25 TDPLYHFCSSSQKFIDNGTYETNLNKLMGYLY--LAAPPTGFRKGSVGENNDQVNGLALC 82 Query 277 HADTYSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTLVM 456 D + +C C+ + I C + KA I W DYC +KYSN NFFG+VD N + + Sbjct 83 RGDVSNTDCKACITESSSEIRKRCPDNKAAIIWYDYCLLKYSNVNFFGQVDHQN-MFYMW 141 Query 457 DVHDISYsltkslttttfkllsdlkNKATANNNHMIMFARGTAVGRQDRFVIFGLVQCSG 636 +++++S + K L N A N M M+A G + ++ ++GL QC+ Sbjct 142 NLNNVS-----DPDSFNQKTKELLSNLAQQAFNAMKMYATG-ELELEESEKLYGLTQCTR 195 Query 637 DLSKYNCGQCLNYTIGQLPKNLTSL-GMKAVTGSCVVRYDSYHF 765 DLS +C +CL+ I +LP G + V GSC +RY+ Y F Sbjct 196 DLSSSDCKKCLDDAISKLPNCCDGKEGGRVVGGSCNIRYEIYPF 239 >ref|XP_002327925.1| predicted protein [Populus trichocarpa] gb|EEE75713.1| predicted protein [Populus trichocarpa] Length=242 Score = 115 bits (288), Expect = 4e-27 Identities = 81/224 (36%), Positives = 112/224 (50%), Gaps = 14/224 (6%) Frame = +1 Query 103 PQYNCSSTSGMYKPQSTYKSNLMKLLANLNNTLYGNNVT-FSNTSYGKSPDQVYGLALCH 279 P ++ S+ + Y+SNL KL + L Y T F S G +PDQ YGLALC Sbjct 27 PNFHLCSSPENFTANGPYESNLKKLTSYL---YYKAPPTGFGMGSRGHTPDQTYGLALCR 83 Query 280 ADTYSHNCHDCVNFITQYITHFCQNKKAVIYWRDYCYVKYSNRNFFGEVDTDNKLTLVMD 459 D + +C CV + I C KA I W D C +KYSN FFG+VDT NK + + Sbjct 84 GDVSTSDCKTCVFEASSEIRKRCPYNKAAIIWYDNCLLKYSNTGFFGQVDTGNKF-YMWN 142 Query 460 VHDISYsltkslttttfkllsdlkNKATANNNHMIMFARG-TAVGRQDRFVIFGLVQCSG 636 VH +S + T + +AT +FA G +G+ + ++GLVQC+G Sbjct 143 VHVVSKPAPFNKKTKELLSQLANEAQATPK-----LFATGERELGKSTK--LYGLVQCTG 195 Query 637 DLSKYNCGQCLNYTIGQLPKNLT-SLGMKAVTGSCVVRYDSYHF 765 DLS C +CL+ IG+LP G + V+GSC Y+ Y F Sbjct 196 DLSSAVCKKCLDGIIGELPSCCDGKQGGRVVSGSCNFIYELYPF 239 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 510557841 Number of extensions: 10815912 Number of successful extensions: 24913 Number of sequences better than 1e-10: 14 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 24856 Number of HSP's successfully gapped: 18 Length of query: 960 Length of database: 6150218869 Length adjustment: 140 Effective length of query: 820 Effective length of database: 3641547109 Effective search space: 655478479620 Effective search space used: 655478479620 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 177 (72.8 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPDY3X4S01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_84 Length=1352 Score E Sequences producing significant alignments: (Bits) Value gb|AAQ09999.1| putative fructokinase 2 [Petunia integrifolia ... 407 1e-136 gb|AAQ10000.1| putative fructokinase 2 [Petunia integrifolia ... 407 1e-136 ref|XP_002533363.1| fructokinase, putative [Ricinus communis]... 405 6e-136 gb|AAA80675.1| fructokinase [Beta vulgaris] 398 3e-133 emb|CAD31714.1| fructokinase-like protein [Cicer arietinum] 393 9e-133 ALIGNMENTS >gb|AAQ09999.1| putative fructokinase 2 [Petunia integrifolia subsp. inflata] Length=328 Score = 407 bits (1045), Expect = 1e-136 Identities = 198/234 (85%), Positives = 216/234 (92%), Gaps = 0/234 (0%) Frame = -2 Query 1351 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP 1172 D GARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNL +IRSAK+FHYGSISLIVEP Sbjct 91 DTGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLDVIRSAKIFHYGSISLIVEP 150 Query 1171 CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL 992 CRSAHLKAM+ AK AGALLSYDPNLRLPLWPSA+ A++QI SIW+KAD+IKVSDNELEFL Sbjct 151 CRSAHLKAMEVAKEAGALLSYDPNLRLPLWPSAEEARKQIKSIWDKADVIKVSDNELEFL 210 Query 991 TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV 812 TGS KIDDE+AMSLWHP L+LLLVTLGEKGCRYY KNF G ++G+HVKTVDTTGAGDSFV Sbjct 211 TGSDKIDDESAMSLWHPNLKLLLVTLGEKGCRYYTKNFHGGVEGFHVKTVDTTGAGDSFV 270 Query 811 GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSIL 650 GALL KIVDD SI+ DEARLKEVL +ACACGAITTTKKGAIPALP S+ L++L Sbjct 271 GALLTKIVDDQSILEDEARLKEVLTFACACGAITTTKKGAIPALPTESEALTLL 324 >gb|AAQ10000.1| putative fructokinase 2 [Petunia integrifolia subsp. inflata] Length=328 Score = 407 bits (1045), Expect = 1e-136 Identities = 198/234 (85%), Positives = 216/234 (92%), Gaps = 0/234 (0%) Frame = -2 Query 1351 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP 1172 D GARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNL +IRSAK+FHYGSISLIVEP Sbjct 91 DTGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLDVIRSAKIFHYGSISLIVEP 150 Query 1171 CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL 992 CRSAHLKAM+ AK AGALLSYDPNLRLPLWPSA+ A++QI SIW+KAD+IKVSDNELEFL Sbjct 151 CRSAHLKAMEVAKEAGALLSYDPNLRLPLWPSAEEARKQIKSIWDKADVIKVSDNELEFL 210 Query 991 TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV 812 TGS KIDDE+AMSLWHP L+LLLVTLGEKGCRYY KNF G ++G+HVKTVDTTGAGDSFV Sbjct 211 TGSDKIDDESAMSLWHPNLKLLLVTLGEKGCRYYTKNFHGGVEGFHVKTVDTTGAGDSFV 270 Query 811 GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSIL 650 GALL KIVDD SI+ DEARLKEVL +ACACGAITTTKKGAIPALP S+ L++L Sbjct 271 GALLTKIVDDQSILEDEARLKEVLTFACACGAITTTKKGAIPALPTESEALTLL 324 >ref|XP_002533363.1| fructokinase, putative [Ricinus communis] gb|EEF29025.1| fructokinase, putative [Ricinus communis] Length=330 Score = 405 bits (1040), Expect = 6e-136 Identities = 197/237 (83%), Positives = 220/237 (93%), Gaps = 0/237 (0%) Frame = -2 Query 1351 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP 1172 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTP+ELNL +IRSAK+FHYGSISLIVEP Sbjct 93 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPEELNLEVIRSAKIFHYGSISLIVEP 152 Query 1171 CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL 992 CRSAHLKAM+EAK+AGALLSYDPNLRLPLWPSA+YA+EQIMSIW+KADIIKVSD ELEFL Sbjct 153 CRSAHLKAMEEAKNAGALLSYDPNLRLPLWPSAEYAREQIMSIWDKADIIKVSDVELEFL 212 Query 991 TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV 812 TGS KIDDE+A+SLWHP L+LLLVTLGE GCRYY KNF GS+D +HVKTVDTTGAGDSFV Sbjct 213 TGSDKIDDESALSLWHPNLKLLLVTLGENGCRYYTKNFHGSVDAFHVKTVDTTGAGDSFV 272 Query 811 GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSILNGA 641 GALL KIVDD S++ +E RL+EVL++A ACGAITTTKKGAIPALP +DVLS++ + Sbjct 273 GALLCKIVDDLSVLEEEPRLREVLRFANACGAITTTKKGAIPALPTEADVLSLMKAS 329 >gb|AAA80675.1| fructokinase [Beta vulgaris] Length=331 Score = 398 bits (1022), Expect = 3e-133 Identities = 194/234 (83%), Positives = 215/234 (92%), Gaps = 0/234 (0%) Frame = -2 Query 1351 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP 1172 DKGARTALAFVTL++DGEREFMFYRNPSADMLLTPDELNL +IRSAKVFHYGSI LIVEP Sbjct 93 DKGARTALAFVTLKSDGEREFMFYRNPSADMLLTPDELNLDLIRSAKVFHYGSIRLIVEP 152 Query 1171 CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL 992 CRSAHLKAM+EAK AGALLSYDPNLRLPLWPSA+ A+EQIMSIW+KA++IKVSDNELEFL Sbjct 153 CRSAHLKAMEEAKKAGALLSYDPNLRLPLWPSAEEAREQIMSIWDKAEVIKVSDNELEFL 212 Query 991 TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV 812 TG+S IDD AMSLWHP L+LLLVTLG++GCRYY KNF+GS+DG+ V VDTTGAGDSFV Sbjct 213 TGNSTIDDATAMSLWHPNLKLLLVTLGDQGCRYYTKNFKGSLDGFKVNAVDTTGAGDSFV 272 Query 811 GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSIL 650 GALL KIVDDHSII DE+RLKEVLK+A ACGAITTTKKGAIPALP +D L ++ Sbjct 273 GALLNKIVDDHSIIEDESRLKEVLKFANACGAITTTKKGAIPALPTVADALELI 326 >emb|CAD31714.1| fructokinase-like protein [Cicer arietinum] Length=238 Score = 393 bits (1010), Expect = 9e-133 Identities = 193/236 (82%), Positives = 213/236 (90%), Gaps = 0/236 (0%) Frame = -2 Query 1351 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPDELNLGIIRSAKVFHYGSISLIVEP 1172 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTP++LNL +IRSAKVFHYGSISLIVEP Sbjct 2 DKGARTALAFVTLRADGEREFMFYRNPSADMLLTPEDLNLELIRSAKVFHYGSISLIVEP 61 Query 1171 CRSAHLKAMDEAKSAGALLSYDPNLRLPLWPSAKYAKEQIMSIWEKADIIKVSDNELEFL 992 CRSAHLKAM+ AK AG LLSYDPNLRLPLWPS + A+ QI+SIW+KAD+IKVSD ELEFL Sbjct 62 CRSAHLKAMEVAKDAGCLLSYDPNLRLPLWPSPEEARNQILSIWDKADLIKVSDVELEFL 121 Query 991 TGSSKIDDEAAMSLWHPKLQLLLVTLGEKGCRYYNKNFRGSIDGYHVKTVDTTGAGDSFV 812 TGS KIDD +A+SLWHP L+LLLVTLGE G RYY KNF GS+D +HV TVDTTGAGDSFV Sbjct 122 TGSDKIDDASALSLWHPNLKLLLVTLGENGSRYYTKNFHGSVDAFHVNTVDTTGAGDSFV 181 Query 811 GALLGKIVDDHSIIHDEARLKEVLKYACACGAITTTKKGAIPALPQHSDVLSILNG 644 GALLGKIVDD SI+ DEARL+EVLK+A ACGAITTTKKGAIPALP +DVLS++ G Sbjct 182 GALLGKIVDDQSILEDEARLREVLKFANACGAITTTKKGAIPALPTEADVLSLIKG 237 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 1,855,251,573 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 766044481 Number of extensions: 17406778 Number of successful extensions: 41200 Number of sequences better than 1e-10: 135 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 40779 Number of HSP's successfully gapped: 144 Length of query: 1352 Length of database: 6150218869 Length adjustment: 144 Effective length of query: 1208 Effective length of database: 3569870773 Effective search space: 1092380456538 Effective search space used: 1092380456538 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 179 (73.6 bits)A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEBB8R7016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_8780 Length=525 Score E Sequences producing significant alignments: (Bits) Value emb|CAJ34815.1| amino acid permease [Plantago major] 183 4e-56 ref|XP_002327101.1| lysine/histidine transporter [Populus tri... 181 3e-51 ref|XP_002510286.1| amino acid transporter, putative [Ricinus... 181 4e-51 ref|XP_002284114.1| PREDICTED: lysine histidine transporter-l... 180 8e-51 ref|XP_003545851.1| PREDICTED: lysine histidine transporter-l... 178 4e-50 ALIGNMENTS >emb|CAJ34815.1| amino acid permease [Plantago major] Length=136 Score = 183 bits (465), Expect = 4e-56 Identities = 100/110 (91%), Positives = 106/110 (96%), Gaps = 0/110 (0%) Frame = +1 Query 1 ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW 180 A YTSRTNRPCSIWVRSGFRVFYGF+SLLIGVA PFLSSLAGLLGGLTLPVTFAYPCFMW Sbjct 26 AGYTSRTNRPCSIWVRSGFRVFYGFISLLIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW 85 Query 181 VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKP 330 VLIK+P KY+FNWYFNW+LGWLG+AFS AFS+GGIWSMVNSGLKLKFFKP Sbjct 86 VLIKKPTKYTFNWYFNWILGWLGIAFSLAFSIGGIWSMVNSGLKLKFFKP 135 >ref|XP_002327101.1| lysine/histidine transporter [Populus trichocarpa] gb|EEE73851.1| lysine/histidine transporter [Populus trichocarpa] Length=521 Score = 181 bits (459), Expect = 3e-51 Identities = 98/110 (89%), Positives = 105/110 (95%), Gaps = 0/110 (0%) Frame = +1 Query 1 ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW 180 ASYT+RTNRPCSIWVRSGFRVFYGF+S IGVA PFLSSLAGLLGGLTLPVTFAYPCFMW Sbjct 410 ASYTTRTNRPCSIWVRSGFRVFYGFISFFIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW 469 Query 181 VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKP 330 VLIK+P KYSFNWYFNW+LGWLG+AFS AFS+GG+WSMVNSGLKLKFFKP Sbjct 470 VLIKKPSKYSFNWYFNWILGWLGIAFSLAFSIGGVWSMVNSGLKLKFFKP 519 >ref|XP_002510286.1| amino acid transporter, putative [Ricinus communis] gb|EEF52473.1| amino acid transporter, putative [Ricinus communis] Length=521 Score = 181 bits (458), Expect = 4e-51 Identities = 98/110 (89%), Positives = 104/110 (95%), Gaps = 0/110 (0%) Frame = +1 Query 1 ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW 180 A YTSRTNRPCSIWVRSGFRVFYGF+S IGVA PFLSSLAGLLGGLTLPVTFAYPCFMW Sbjct 410 AGYTSRTNRPCSIWVRSGFRVFYGFISFFIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW 469 Query 181 VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKP 330 VLIKRP KYSFNWYFNW+LGWLG+AFS AFS+GG+WSMVNSGL+LKFFKP Sbjct 470 VLIKRPSKYSFNWYFNWILGWLGIAFSLAFSIGGVWSMVNSGLRLKFFKP 519 >ref|XP_002284114.1| PREDICTED: lysine histidine transporter-like 8 [Vitis vinifera] emb|CBI19587.3| unnamed protein product [Vitis vinifera] Length=514 Score = 180 bits (456), Expect = 8e-51 Identities = 97/111 (87%), Positives = 105/111 (95%), Gaps = 0/111 (0%) Frame = +1 Query 1 ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW 180 A YTSRTNRPCSIWVRSGFRVFYGF+S IGVA PFLSSLAGLLGGLTLPVTFAYPCFMW Sbjct 404 AGYTSRTNRPCSIWVRSGFRVFYGFISFFIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW 463 Query 181 VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKPA 333 VLIK+P K+SFNWYFNW+LGWLG+AFS AFS+GG+WSMVNSGLKLKFFKP+ Sbjct 464 VLIKKPTKFSFNWYFNWILGWLGIAFSLAFSIGGVWSMVNSGLKLKFFKPS 514 >ref|XP_003545851.1| PREDICTED: lysine histidine transporter-like 8-like [Glycine max] Length=516 Score = 178 bits (451), Expect = 4e-50 Identities = 99/110 (90%), Positives = 103/110 (94%), Gaps = 0/110 (0%) Frame = +1 Query 1 ASYTSRTNRPCSIWVRSGFRVFYGFVSLLIGVafpflsslagllggltlpVTFAYPCFMW 180 A YTSRTNRPCSIWVRSGFRVFYGFVS IGVA PFLSSLAGLLGGLTLPVTFAYPCFMW Sbjct 406 AGYTSRTNRPCSIWVRSGFRVFYGFVSFFIGVALPFLSSLAGLLGGLTLPVTFAYPCFMW 465 Query 181 VLIKRPIKYSFNWYFNWVLGWLGVAFSFAFSMGGIWSMVNSGLKLKFFKP 330 VLIK+P KYSFNWYFNW+LGWLGVAFS AFS+GGIWS+VN GLKLKFFKP Sbjct 466 VLIKQPPKYSFNWYFNWILGWLGVAFSLAFSIGGIWSIVNDGLKLKFFKP 515 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 262786732 Number of extensions: 5527508 Number of successful extensions: 15061 Number of sequences better than 1e-10: 1 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 15038 Number of HSP's successfully gapped: 1 Length of query: 525 Length of database: 6150218869 Length adjustment: 131 Effective length of query: 394 Effective length of database: 3802818865 Effective search space: 167324030060 Effective search space used: 167324030060 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 172 (70.9 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPEBE7W201N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_8801 Length=524 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003634857.1| PREDICTED: uncharacterized protein LOC100... 103 5e-25 ref|XP_002265064.1| PREDICTED: uncharacterized protein LOC100... 102 2e-24 ref|XP_002316639.1| predicted protein [Populus trichocarpa] >... 95.5 8e-22 ref|NP_001237779.1| uncharacterized protein LOC100527043 [Gly... 92.0 2e-20 ref|XP_003542773.1| PREDICTED: uncharacterized protein LOC100... 90.5 6e-20 ALIGNMENTS >ref|XP_003634857.1| PREDICTED: uncharacterized protein LOC100854976 [Vitis vinifera] Length=155 Score = 103 bits (258), Expect = 5e-25 Identities = 53/98 (54%), Positives = 71/98 (72%), Gaps = 6/98 (6%) Frame = -1 Query 524 AIFEKLLRTTSTDISPGGGRRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVMPG 345 AIFE+ LR T +SP GR R D+E+Q + F+ KLG+PSPK++V AR VSV+MPG Sbjct 52 AIFERFLRPTPPSLSPATGR-DRGDVESQ----MGFNGKLGHPSPKMTVYARGVSVLMPG 106 Query 344 NKLPTFIANPAPVPCPPDRMPWPSHQQQTNLPDPHSSS 231 +PTFIA+PAPVPCPPDR+PWP ++ + +P S+S Sbjct 107 EDIPTFIAHPAPVPCPPDRIPWPL-KEHNSTANPSSNS 143 >ref|XP_002265064.1| PREDICTED: uncharacterized protein LOC100241762 [Vitis vinifera] Length=155 Score = 102 bits (254), Expect = 2e-24 Identities = 50/83 (60%), Positives = 63/83 (76%), Gaps = 5/83 (6%) Frame = -1 Query 524 AIFEKLLRTTSTDISPGGGRRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVMPG 345 AIFE+ LR T +SP GR R D+E+Q + F+ KLG+PSPK++V AR VSV+MPG Sbjct 52 AIFERFLRPTPPSLSPATGR-DRGDVESQ----MGFNGKLGHPSPKMTVYARGVSVLMPG 106 Query 344 NKLPTFIANPAPVPCPPDRMPWP 276 +PTFIA+PAPVPCPPDR+PWP Sbjct 107 EDIPTFIAHPAPVPCPPDRIPWP 129 >ref|XP_002316639.1| predicted protein [Populus trichocarpa] gb|EEE97251.1| predicted protein [Populus trichocarpa] Length=157 Score = 95.5 bits (236), Expect = 8e-22 Identities = 52/95 (55%), Positives = 66/95 (69%), Gaps = 12/95 (13%) Frame = -1 Query 524 AIFEKLLRTTSTDISPGGGRRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVMPG 345 AIFE+ L TS GR G D+E+Q RF+SKLG+PSPK++V A VSV+MPG Sbjct 56 AIFERFLGPTS-------GRGGHGDLESQ----TRFNSKLGHPSPKMTVYANGVSVLMPG 104 Query 344 NKLPTFIANPAPVPCPPDRMPWPSHQQQTN-LPDP 243 + +PTFIA PAPVPCPP+R +P +QQ N LP+P Sbjct 105 DNIPTFIALPAPVPCPPERPSYPHNQQHINQLPNP 139 >ref|NP_001237779.1| uncharacterized protein LOC100527043 [Glycine max] gb|ACU16082.1| unknown [Glycine max] Length=155 Score = 92.0 bits (227), Expect = 2e-20 Identities = 45/89 (51%), Positives = 58/89 (65%), Gaps = 6/89 (7%) Frame = -1 Query 524 AIFEKLLRTTSTDISPGGGRRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVMPG 345 AIFE+ L+ TS I P GGR R + F+ KLG+PSPK+S+ A VSV+MPG Sbjct 58 AIFERFLKPTSPPILPSGGRNRRRSSQMD------FNGKLGHPSPKMSLYASWVSVLMPG 111 Query 344 NKLPTFIANPAPVPCPPDRMPWPSHQQQT 258 + P+FIA+P P PC P+R+ WPSHQ T Sbjct 112 DATPSFIAHPVPAPCCPERISWPSHQHST 140 >ref|XP_003542773.1| PREDICTED: uncharacterized protein LOC100794476 [Glycine max] Length=155 Score = 90.5 bits (223), Expect = 6e-20 Identities = 50/100 (50%), Positives = 65/100 (65%), Gaps = 8/100 (8%) Frame = -1 Query 524 AIFEKLLRTTSTDISPGGG--RRGRPDIEAQGHNVVRFDSKLGYPSPKISVNAREVSVVM 351 AIFE+ LR TS +SP RR D+EAQ + F KL + SPK+SV A VSV+M Sbjct 53 AIFERYLRPTSPPLSPSATTRRRSPSDVEAQ----IGFSGKLAHASPKMSVYASGVSVLM 108 Query 350 PGNKLPTFIANPAPVPCPPDRMPWPSHQQQTNLPDPHSSS 231 PG+++PTFIA+PA PC P+R+ WPSHQ LP S++ Sbjct 109 PGDEIPTFIAHPA--PCYPERISWPSHQHNNTLPCSSSNT 146 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 303724249 Number of extensions: 7275648 Number of successful extensions: 22166 Number of sequences better than 1e-10: 0 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 21999 Number of HSP's successfully gapped: 0 Length of query: 524 Length of database: 6150218869 Length adjustment: 131 Effective length of query: 393 Effective length of database: 3802818865 Effective search space: 163521211195 Effective search space used: 163521211195 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPECAW31016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_9545 Length=504 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003632587.1| PREDICTED: regulatory-associated protein ... 243 3e-71 ref|XP_003632588.1| PREDICTED: regulatory-associated protein ... 243 3e-71 ref|XP_003533671.1| PREDICTED: regulatory-associated protein ... 240 3e-70 ref|XP_003623550.1| Regulatory-associated protein of mTOR [Me... 240 3e-70 ref|XP_003551595.1| PREDICTED: regulatory-associated protein ... 238 1e-69 ALIGNMENTS >ref|XP_003632587.1| PREDICTED: regulatory-associated protein of TOR 1-like isoform 1 [Vitis vinifera] emb|CBI18073.3| unnamed protein product [Vitis vinifera] Length=1363 Score = 243 bits (619), Expect = 3e-71 Identities = 118/150 (79%), Positives = 133/150 (89%), Gaps = 0/150 (0%) Frame = +2 Query 2 FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR 181 F DG V+L+D+RTPEMLV +PH Q VERVVGIGFQPGL+PAK+VSASQAG+IQFLD+R Sbjct 1208 FVDGSVKLFDVRTPEMLVCAARPHTQRVERVVGIGFQPGLDPAKIVSASQAGDIQFLDVR 1267 Query 182 FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK 361 YLTI AHRGSLTALA+HRHAP+IASGSAKQ+IK+FNLEG LGTIR+YPTFMAQK Sbjct 1268 NGNCAYLTIDAHRGSLTALAIHRHAPLIASGSAKQIIKVFNLEGSQLGTIRFYPTFMAQK 1327 Query 362 IGSVSCLTFHPYQVMLGAGAADACVSIYAD 451 IGSV+CLTFHPYQV+L AGAADA VSIYAD Sbjct 1328 IGSVNCLTFHPYQVLLAAGAADALVSIYAD 1357 >ref|XP_003632588.1| PREDICTED: regulatory-associated protein of TOR 1-like isoform 2 [Vitis vinifera] Length=1370 Score = 243 bits (619), Expect = 3e-71 Identities = 118/150 (79%), Positives = 133/150 (89%), Gaps = 0/150 (0%) Frame = +2 Query 2 FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR 181 F DG V+L+D+RTPEMLV +PH Q VERVVGIGFQPGL+PAK+VSASQAG+IQFLD+R Sbjct 1215 FVDGSVKLFDVRTPEMLVCAARPHTQRVERVVGIGFQPGLDPAKIVSASQAGDIQFLDVR 1274 Query 182 FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK 361 YLTI AHRGSLTALA+HRHAP+IASGSAKQ+IK+FNLEG LGTIR+YPTFMAQK Sbjct 1275 NGNCAYLTIDAHRGSLTALAIHRHAPLIASGSAKQIIKVFNLEGSQLGTIRFYPTFMAQK 1334 Query 362 IGSVSCLTFHPYQVMLGAGAADACVSIYAD 451 IGSV+CLTFHPYQV+L AGAADA VSIYAD Sbjct 1335 IGSVNCLTFHPYQVLLAAGAADALVSIYAD 1364 >ref|XP_003533671.1| PREDICTED: regulatory-associated protein of TOR 1-like [Glycine max] Length=1373 Score = 240 bits (612), Expect = 3e-70 Identities = 118/150 (79%), Positives = 130/150 (87%), Gaps = 0/150 (0%) Frame = +2 Query 2 FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR 181 F DG VRLYD+RTP+MLV +PH Q VE+VVGIGFQPGL+ K+VSASQAG+IQFLD+R Sbjct 1218 FIDGSVRLYDVRTPDMLVCGLRPHTQRVEKVVGIGFQPGLDQGKIVSASQAGDIQFLDIR 1277 Query 182 FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK 361 YLTI AHRGSLTALAVHRHAPIIASGSAKQ+IK+F+LEGD LGTIRYYPT MAQK Sbjct 1278 NHSSAYLTIEAHRGSLTALAVHRHAPIIASGSAKQLIKVFSLEGDQLGTIRYYPTLMAQK 1337 Query 362 IGSVSCLTFHPYQVMLGAGAADACVSIYAD 451 IGSVSCL FHPYQV+L AGAADACV IYAD Sbjct 1338 IGSVSCLNFHPYQVLLAAGAADACVCIYAD 1367 >ref|XP_003623550.1| Regulatory-associated protein of mTOR [Medicago truncatula] gb|AES79768.1| Regulatory-associated protein of mTOR [Medicago truncatula] Length=1430 Score = 240 bits (612), Expect = 3e-70 Identities = 118/150 (79%), Positives = 131/150 (87%), Gaps = 0/150 (0%) Frame = +2 Query 2 FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR 181 F DG VRLYD RTPEMLV +PH Q VE+V+GIGFQPGL+P K+VSASQAG+IQFLD+R Sbjct 1275 FVDGSVRLYDARTPEMLVCGLRPHTQRVEKVMGIGFQPGLDPGKLVSASQAGDIQFLDIR 1334 Query 182 FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK 361 YLTI AHRGSLTALAVHRHAPIIASGSAKQ+IK+F+LEGD LGTIRYYPT MAQK Sbjct 1335 NHSSAYLTIEAHRGSLTALAVHRHAPIIASGSAKQLIKVFSLEGDQLGTIRYYPTLMAQK 1394 Query 362 IGSVSCLTFHPYQVMLGAGAADACVSIYAD 451 IGSVSCL+FHPYQ++L AGAADACV IYAD Sbjct 1395 IGSVSCLSFHPYQLLLAAGAADACVCIYAD 1424 >ref|XP_003551595.1| PREDICTED: regulatory-associated protein of TOR 1-like [Glycine max] Length=1365 Score = 238 bits (607), Expect = 1e-69 Identities = 117/150 (78%), Positives = 129/150 (86%), Gaps = 0/150 (0%) Frame = +2 Query 2 FSDGYVRLYDIRTPEMLVSETKPHPQPVERVVGIGFQPGLEPAKVVSASQAGNIQFLDMR 181 F DG VRLYD+RTP+MLV +PH Q VE+VVGIGFQPGL+ K+VSASQAG+IQFLD+R Sbjct 1210 FVDGSVRLYDVRTPDMLVCGLRPHTQRVEKVVGIGFQPGLDQGKIVSASQAGDIQFLDIR 1269 Query 182 FAKEKYLTIHAHRGSLTALAVHRHAPIIASGSAKQVIKIFNLEGDSLGTIRYYPTFMAQK 361 YLTI AHRGSLTALAVHRHAPIIASGSAKQ IK+F+LEGD LGTI+YYPT MAQK Sbjct 1270 NHSSAYLTIEAHRGSLTALAVHRHAPIIASGSAKQFIKVFSLEGDQLGTIKYYPTLMAQK 1329 Query 362 IGSVSCLTFHPYQVMLGAGAADACVSIYAD 451 IGSVSCL FHPYQV+L AGAADACV IYAD Sbjct 1330 IGSVSCLNFHPYQVLLAAGAADACVCIYAD 1359 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 274557214 Number of extensions: 5904993 Number of successful extensions: 14931 Number of sequences better than 1e-10: 8 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 14866 Number of HSP's successfully gapped: 8 Length of query: 504 Length of database: 6150218869 Length adjustment: 128 Effective length of query: 376 Effective length of database: 3856576117 Effective search space: 154263044680 Effective search space used: 154263044680 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPECJ6WG01S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_9620 Length=502 Score E Sequences producing significant alignments: (Bits) Value ref|XP_003544765.1| PREDICTED: ubiquitin-fold modifier 1-like... 170 1e-51 ref|XP_003523461.1| PREDICTED: ubiquitin-fold modifier 1-like... 168 5e-51 ref|XP_002285835.1| PREDICTED: ubiquitin-fold modifier 1 [Vit... 168 6e-51 ref|XP_002519920.1| Ubiquitin-fold modifier 1 precursor, puta... 167 2e-50 ref|XP_003614786.1| Ubiquitin-fold modifier [Medicago truncat... 167 2e-50 ALIGNMENTS >ref|XP_003544765.1| PREDICTED: ubiquitin-fold modifier 1-like [Glycine max] Length=94 Score = 170 bits (430), Expect = 1e-51 Identities = 84/85 (99%), Positives = 85/85 (100%), Gaps = 0/85 (0%) Frame = +2 Query 2 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 181 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ Sbjct 4 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 63 Query 182 QSAGNVFLKHGSELRLIPRDRVGSY 256 QSAGNVFLKHGSELRLIPRDRVG+Y Sbjct 64 QSAGNVFLKHGSELRLIPRDRVGAY 88 >ref|XP_003523461.1| PREDICTED: ubiquitin-fold modifier 1-like [Glycine max] ref|XP_003526612.1| PREDICTED: ubiquitin-fold modifier 1 [Glycine max] gb|ACU14017.1| unknown [Glycine max] Length=88 Score = 168 bits (426), Expect = 5e-51 Identities = 83/85 (98%), Positives = 85/85 (100%), Gaps = 0/85 (0%) Frame = +2 Query 2 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 181 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ Sbjct 4 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 63 Query 182 QSAGNVFLKHGSELRLIPRDRVGSY 256 QSAGNVFLKHGSELRLIPRDRVG++ Sbjct 64 QSAGNVFLKHGSELRLIPRDRVGAF 88 >ref|XP_002285835.1| PREDICTED: ubiquitin-fold modifier 1 [Vitis vinifera] emb|CBI19064.3| unnamed protein product [Vitis vinifera] Length=97 Score = 168 bits (426), Expect = 6e-51 Identities = 83/85 (98%), Positives = 85/85 (100%), Gaps = 0/85 (0%) Frame = +2 Query 2 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 181 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ Sbjct 3 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 62 Query 182 QSAGNVFLKHGSELRLIPRDRVGSY 256 QSAGNVFLKHGSELRLIPRDRVG++ Sbjct 63 QSAGNVFLKHGSELRLIPRDRVGAF 87 >ref|XP_002519920.1| Ubiquitin-fold modifier 1 precursor, putative [Ricinus communis] gb|EEF42524.1| Ubiquitin-fold modifier 1 precursor, putative [Ricinus communis] Length=92 Score = 167 bits (423), Expect = 2e-50 Identities = 83/84 (99%), Positives = 84/84 (100%), Gaps = 0/84 (0%) Frame = +2 Query 2 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 181 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ Sbjct 7 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 66 Query 182 QSAGNVFLKHGSELRLIPRDRVGS 253 QSAGNVFLKHGSELRLIPRDRVG+ Sbjct 67 QSAGNVFLKHGSELRLIPRDRVGA 90 >ref|XP_003614786.1| Ubiquitin-fold modifier [Medicago truncatula] gb|AES97744.1| Ubiquitin-fold modifier [Medicago truncatula] Length=95 Score = 167 bits (423), Expect = 2e-50 Identities = 83/84 (99%), Positives = 84/84 (100%), Gaps = 0/84 (0%) Frame = +2 Query 2 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 181 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ Sbjct 5 GGKVSFKVTLTSDPKLPFKVFSVPEAAPFTAVLKFAAEEFKVPPQTSAIITNDGVGINPQ 64 Query 182 QSAGNVFLKHGSELRLIPRDRVGS 253 QSAGNVFLKHGSELRLIPRDRVG+ Sbjct 65 QSAGNVFLKHGSELRLIPRDRVGA 88 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 237055771 Number of extensions: 4771997 Number of successful extensions: 10348 Number of sequences better than 1e-10: 10 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 10345 Number of HSP's successfully gapped: 10 Length of query: 502 Length of database: 6150218869 Length adjustment: 128 Effective length of query: 374 Effective length of database: 3856576117 Effective search space: 150406468563 Effective search space used: 150406468563 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. RID: UPECMGTD016 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 17,919,084 sequences; 6,150,218,869 total letters Query= TrVeIntMedtrGB1_9746 Length=499 Score E Sequences producing significant alignments: (Bits) Value ref|XP_002327206.1| predicted protein [Populus trichocarpa] >... 125 1e-33 ref|XP_002326029.1| predicted protein [Populus trichocarpa] >... 125 3e-33 ref|XP_002279275.1| PREDICTED: glucan endo-1,3-beta-glucosida... 124 7e-33 ref|XP_002525141.1| hydrolase, hydrolyzing O-glycosyl compoun... 121 1e-31 ref|XP_002878188.1| predicted protein [Arabidopsis lyrata sub... 115 3e-29 ALIGNMENTS >ref|XP_002327206.1| predicted protein [Populus trichocarpa] gb|EEE74011.1| predicted protein [Populus trichocarpa] Length=143 Score = 125 bits (315), Expect = 1e-33 Identities = 57/81 (70%), Positives = 66/81 (81%), Gaps = 0/81 (0%) Frame = +1 Query 1 KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN 180 KNNA D LQ AI+WACG G +CGPIQ+ G CYD D+QRTAS+AFNDYYL+NG +DD Sbjct 7 KNNAADQALQEAINWACGQGGANCGPIQQGGACYDSNDIQRTASWAFNDYYLKNGLTDDA 66 Query 181 CNFSNTAALTSIDPSHDKCKF 243 C FSNTAALTS++PS DKCKF Sbjct 67 CYFSNTAALTSLNPSFDKCKF 87 >ref|XP_002326029.1| predicted protein [Populus trichocarpa] gb|ABK94511.1| unknown [Populus trichocarpa] gb|EEF00411.1| predicted protein [Populus trichocarpa] Length=175 Score = 125 bits (314), Expect = 3e-33 Identities = 57/81 (70%), Positives = 66/81 (81%), Gaps = 0/81 (0%) Frame = +1 Query 1 KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN 180 KNNA D LQ +IDWACG G +CGPIQ+ G CYD D+QRTAS+AFNDYYL+NG +DD Sbjct 40 KNNAADQALQESIDWACGPGGANCGPIQQGGPCYDSSDVQRTASWAFNDYYLKNGLTDDA 99 Query 181 CNFSNTAALTSIDPSHDKCKF 243 C FSNTAALTS++PS DKCKF Sbjct 100 CYFSNTAALTSLNPSFDKCKF 120 >ref|XP_002279275.1| PREDICTED: glucan endo-1,3-beta-glucosidase-like protein 1 [Vitis vinifera] emb|CBI21947.3| unnamed protein product [Vitis vinifera] Length=174 Score = 124 bits (312), Expect = 7e-33 Identities = 55/81 (68%), Positives = 65/81 (80%), Gaps = 0/81 (0%) Frame = +1 Query 1 KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN 180 KNNAD+ LQTA+DWACG G DC PIQ+ G CYD DLQ+TAS+AFNDYYL++G SDD+ Sbjct 40 KNNADNPALQTALDWACGPGGADCSPIQQGGPCYDSQDLQKTASFAFNDYYLKHGLSDDS 99 Query 181 CNFSNTAALTSIDPSHDKCKF 243 C F NTAALTS++PS CKF Sbjct 100 CGFDNTAALTSLNPSFGNCKF 120 >ref|XP_002525141.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus communis] gb|EEF37268.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus communis] Length=175 Score = 121 bits (303), Expect = 1e-31 Identities = 55/81 (68%), Positives = 64/81 (79%), Gaps = 0/81 (0%) Frame = +1 Query 1 KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN 180 KNNADD LQ+AIDWACG G +C PIQ+ G CYD D+Q TAS+AFNDYYL+NG +DD Sbjct 37 KNNADDQSLQSAIDWACGPGGANCSPIQQGGPCYDPNDIQTTASWAFNDYYLKNGLTDDA 96 Query 181 CNFSNTAALTSIDPSHDKCKF 243 C FSNTAA TS++PSH CKF Sbjct 97 CFFSNTAAPTSLNPSHGNCKF 117 >ref|XP_002878188.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gb|EFH54447.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length=173 Score = 115 bits (287), Expect = 3e-29 Identities = 49/81 (60%), Positives = 63/81 (78%), Gaps = 0/81 (0%) Frame = +1 Query 1 KNNADDAQLQTAIDWACGLGRVDCGPIQKDGLCYDGWDLQRTASYAFNDYYLRNGPSDDN 180 KNNA+D+ LQTAI+WACG G DCGPIQ+ G C D D+Q+ AS+ FN+YYL+NG D+ Sbjct 38 KNNAEDSSLQTAIEWACGQGGADCGPIQQGGPCNDPTDVQKMASFVFNNYYLKNGEEDEA 97 Query 181 CNFSNTAALTSIDPSHDKCKF 243 CNF+N AALTS++PS CK+ Sbjct 98 CNFNNNAALTSLNPSQGTCKY 118 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Apr 23, 2012 4:44 PM Number of letters in database: 6,150,218,869 Number of sequences in database: 17,919,084 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 17919084 Number of Hits to DB: 240022639 Number of extensions: 4743736 Number of successful extensions: 10768 Number of sequences better than 1e-10: 12 Number of HSP's better than 1e-10 without gapping: 0 Number of HSP's gapped: 10745 Number of HSP's successfully gapped: 12 Length of query: 499 Length of database: 6150218869 Length adjustment: 127 Effective length of query: 372 Effective length of database: 3874495201 Effective search space: 151105312839 Effective search space used: 151105312839 T: 12 A: 40 X1: 16 (7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 171 (70.5 bits) ka-blk-alpha gapped: 1.9 ka-blk-alpha ungapped: 0.7916 ka-blk-alpha_v gapped: 42.6028 ka-blk-alpha_v ungapped: 4.96466 ka-blk-sigma gapped: 43.6362