CSPI04G20580.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI04G20580.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionUncharacterised conserved protein UCP015417, vWA
LocationChr4 : 18827889 .. 18830269 (+)
Sequence length1917
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATTCCATTCCCTCTTCATTTCTCATTTCCGATTCCCATGGCTCCTCCAAACCTTCTCGGTCCCCCGGAGCTCTACCACGCCGCTGCCCCCGTCTCACTCCAACCAACGGAATCAACCCCCTCTGGAGACCCCTTCGTCGATGCAATGGTCGCCAACTTCAACAAGACCGATGACAGCCTGCCCCCCATGGGCTTCACGGAGAATATGTCCGCGACCTTTCTCTCCACCGGCAATCCTTGCCTTGATTTCTTCTTCCATGTGGTTCCTGATACCCCTGCCAATTCTTTGATCGACAGATTGAGTTTGGCTTGGAATCACAATCCTTTGATGACGCTCAAGCTTATCTGTAATTTGCGAGGTGTTCGTGGTACGGGAAAGTCCGATAAAGAGGGATACTACACGGCCGCGCTCTGGCTCTACAACTTTCATCCCAAAACCCTAGCAGGTAACATTCCTTCTATCGCTGATTTCGGTTATTTCAAGGATCTGCCGGAGATACTCTACCGGCTTCTTGAGGGTTCCGATGTGAGAAAGAATCAGAAGAATGAGTGGAAGAGAAGGGGGCTATCTGTCAGGCATGGAAGGTTCAAGCAAGAGAAGCCGAAGACGAGGAAGAAAGAAATTCAATCTTCAACAGACAGGGAGGCCAATATTTCGAAGGCAATGGAGAAATCGAGAATAGAAAAAGAGAAGGCGAGCGGTGAGAGGAAGTTAAGGAAGGTTTCGATGGCGAGGAAGGTTATGGAACGTTTCCAAGCTGATTCAAATTTCCAACTCTTGCACGATCGAATATCTGACTTCTTCACTGATTGCTTGAAATCTGATCTTCAATTTATGAATTCTGGAGATTTCACGAAAATCAGTCTCGCTGCGAAATGGTGCCCTTCCATCGATTCGTCCTTTGATCGATCGACACTACTCTGTGAGAGCATAGCGAGAAAGATTTTCCCTCGCGAATTGAATCCAGAATACAAAGAGATCGAAGAGGCGCACTATGCGTACAGAGTTCGCGACAGATTGAGGACGGATGTTTTGGTGCCACTCCGGAAGGTTTTGGAGCTGCCGGAGGTTTTCATTGGAGCCAATCGATGGGATTCGATCCCTTACAACAGAGTGGCTTCTGTTGCAATGAAAAACTACAAGGAAAAGTTCATGAAACACGATGGGGAGCGGTTTGCCCAATACTTGAAAGACGTGAAGGACGGTAAGACCAAGATCGCCGCCGGAGCACTGCTTCCTCACGAGATCATATTGTCTTTATTCGACGGACAGGAAGACGGTGGAGAAGTTGCAGAGCTTCAATGGAAGAGAATGGTGGATGACTTGTTGAAGAAAGGGAAGTTGAGAGAATGTATTGCTGTTTGTGATGTGTCCGGAAGTATGATGGGGATTCCCATGGATGTTTGTGTTGGTTTGGGTCTTTTGGTTTCTGAATTAAGCGAAGATCCATGGAAAGGGAAAGTGATCACATTCAGTGCAAACCCAGAGCTTCATATGATCCAAGGGGACAGTCTGAAATCAAAAGCGGAGTTCGTTAAGAGTATGGATTGGGGGGGTAATACTGATTTTCAGAAGGTTTTTGATCAAATTCTCAAAGTGGCTGTAGATGGAAAGTTGAAGGAAGAACAAATGATAAAGAGAGTGTTTGTGTTCAGTGACATGGAGTTCGATCAAGCATCACAGACCTCGTGGGAAACAGATTACCAAGTTATAGTCCGAAAGTTCACAGAAAAAGGGTATGGATCAGCTGTTCCACAGATTGTGTTTTGGAACTTAAGAGATTCGAGGGCGACGCCAGTGCCGTGCAACGAGAAGGGGGTGGCTTTGGTCAGTGGATACTCAAAGAATTTGATGAACTTGTTTTTGGATGGTGATGGTGTCATTCAACCGGAAGCCGTCATGGAGAAAGCTATCTCCGGCAATGAGTACCAGAAGCTTGTTGTTCTTGATTGATCAATCAAGGTACACAGTACAGTACAGTAGTTTTGTTTTTGTTTTTGTTTTTATTTATTTTCATTTTAGGGAAAGTATATAGAAGGAAAAATAGTGAAGAAGAAGGCTTGGTTATTGCCGTTGTGCTCCACACTCTTGAATTCAAAAAATAATCAATTTTGTTTTTCTACCACAAGTCCAAAACTACTTTCACTGTTGTTGTTTTACTATCAATTCTAGACAACTTCAATGGACATGATGGATGCTGTAAATGTAAGAGATACATTGTGTTTCATTCTGAATTGAAGTGTAGATTGTGACTTCTGACTGGTTCATAATTACAGAACTAAACACATGGTACAATAATATTGTAAATGGCAAAAGGAAAAGGTATCTGAATGTGTTTGTGATTGGTTTGGATGGGTTGAATGAAATTAGAGTCAATTT

mRNA sequence

ATGGCTCCTCCAAACCTTCTCGGTCCCCCGGAGCTCTACCACGCCGCTGCCCCCGTCTCACTCCAACCAACGGAATCAACCCCCTCTGGAGACCCCTTCGTCGATGCAATGGTCGCCAACTTCAACAAGACCGATGACAGCCTGCCCCCCATGGGCTTCACGGAGAATATGTCCGCGACCTTTCTCTCCACCGGCAATCCTTGCCTTGATTTCTTCTTCCATGTGGTTCCTGATACCCCTGCCAATTCTTTGATCGACAGATTGAGTTTGGCTTGGAATCACAATCCTTTGATGACGCTCAAGCTTATCTGTAATTTGCGAGGTGTTCGTGGTACGGGAAAGTCCGATAAAGAGGGATACTACACGGCCGCGCTCTGGCTCTACAACTTTCATCCCAAAACCCTAGCAGGTAACATTCCTTCTATCGCTGATTTCGGTTATTTCAAGGATCTGCCGGAGATACTCTACCGGCTTCTTGAGGGTTCCGATGTGAGAAAGAATCAGAAGAATGAGTGGAAGAGAAGGGGGCTATCTGTCAGGCATGGAAGGTTCAAGCAAGAGAAGCCGAAGACGAGGAAGAAAGAAATTCAATCTTCAACAGACAGGGAGGCCAATATTTCGAAGGCAATGGAGAAATCGAGAATAGAAAAAGAGAAGGCGAGCGGTGAGAGGAAGTTAAGGAAGGTTTCGATGGCGAGGAAGGTTATGGAACGTTTCCAAGCTGATTCAAATTTCCAACTCTTGCACGATCGAATATCTGACTTCTTCACTGATTGCTTGAAATCTGATCTTCAATTTATGAATTCTGGAGATTTCACGAAAATCAGTCTCGCTGCGAAATGGTGCCCTTCCATCGATTCGTCCTTTGATCGATCGACACTACTCTGTGAGAGCATAGCGAGAAAGATTTTCCCTCGCGAATTGAATCCAGAATACAAAGAGATCGAAGAGGCGCACTATGCGTACAGAGTTCGCGACAGATTGAGGACGGATGTTTTGGTGCCACTCCGGAAGGTTTTGGAGCTGCCGGAGGTTTTCATTGGAGCCAATCGATGGGATTCGATCCCTTACAACAGAGTGGCTTCTGTTGCAATGAAAAACTACAAGGAAAAGTTCATGAAACACGATGGGGAGCGGTTTGCCCAATACTTGAAAGACGTGAAGGACGGTAAGACCAAGATCGCCGCCGGAGCACTGCTTCCTCACGAGATCATATTGTCTTTATTCGACGGACAGGAAGACGGTGGAGAAGTTGCAGAGCTTCAATGGAAGAGAATGGTGGATGACTTGTTGAAGAAAGGGAAGTTGAGAGAATGTATTGCTGTTTGTGATGTGTCCGGAAGTATGATGGGGATTCCCATGGATGTTTGTGTTGGTTTGGGTCTTTTGGTTTCTGAATTAAGCGAAGATCCATGGAAAGGGAAAGTGATCACATTCAGTGCAAACCCAGAGCTTCATATGATCCAAGGGGACAGTCTGAAATCAAAAGCGGAGTTCGTTAAGAGTATGGATTGGGGGGGTAATACTGATTTTCAGAAGGTTTTTGATCAAATTCTCAAAGTGGCTGTAGATGGAAAGTTGAAGGAAGAACAAATGATAAAGAGAGTGTTTGTGTTCAGTGACATGGAGTTCGATCAAGCATCACAGACCTCGTGGGAAACAGATTACCAAGTTATAGTCCGAAAGTTCACAGAAAAAGGGTATGGATCAGCTGTTCCACAGATTGTGTTTTGGAACTTAAGAGATTCGAGGGCGACGCCAGTGCCGTGCAACGAGAAGGGGGTGGCTTTGGTCAGTGGATACTCAAAGAATTTGATGAACTTGTTTTTGGATGGTGATGGTGTCATTCAACCGGAAGCCGTCATGGAGAAAGCTATCTCCGGCAATGAGTACCAGAAGCTTGTTGTTCTTGATTGA

Coding sequence (CDS)

ATGGCTCCTCCAAACCTTCTCGGTCCCCCGGAGCTCTACCACGCCGCTGCCCCCGTCTCACTCCAACCAACGGAATCAACCCCCTCTGGAGACCCCTTCGTCGATGCAATGGTCGCCAACTTCAACAAGACCGATGACAGCCTGCCCCCCATGGGCTTCACGGAGAATATGTCCGCGACCTTTCTCTCCACCGGCAATCCTTGCCTTGATTTCTTCTTCCATGTGGTTCCTGATACCCCTGCCAATTCTTTGATCGACAGATTGAGTTTGGCTTGGAATCACAATCCTTTGATGACGCTCAAGCTTATCTGTAATTTGCGAGGTGTTCGTGGTACGGGAAAGTCCGATAAAGAGGGATACTACACGGCCGCGCTCTGGCTCTACAACTTTCATCCCAAAACCCTAGCAGGTAACATTCCTTCTATCGCTGATTTCGGTTATTTCAAGGATCTGCCGGAGATACTCTACCGGCTTCTTGAGGGTTCCGATGTGAGAAAGAATCAGAAGAATGAGTGGAAGAGAAGGGGGCTATCTGTCAGGCATGGAAGGTTCAAGCAAGAGAAGCCGAAGACGAGGAAGAAAGAAATTCAATCTTCAACAGACAGGGAGGCCAATATTTCGAAGGCAATGGAGAAATCGAGAATAGAAAAAGAGAAGGCGAGCGGTGAGAGGAAGTTAAGGAAGGTTTCGATGGCGAGGAAGGTTATGGAACGTTTCCAAGCTGATTCAAATTTCCAACTCTTGCACGATCGAATATCTGACTTCTTCACTGATTGCTTGAAATCTGATCTTCAATTTATGAATTCTGGAGATTTCACGAAAATCAGTCTCGCTGCGAAATGGTGCCCTTCCATCGATTCGTCCTTTGATCGATCGACACTACTCTGTGAGAGCATAGCGAGAAAGATTTTCCCTCGCGAATTGAATCCAGAATACAAAGAGATCGAAGAGGCGCACTATGCGTACAGAGTTCGCGACAGATTGAGGACGGATGTTTTGGTGCCACTCCGGAAGGTTTTGGAGCTGCCGGAGGTTTTCATTGGAGCCAATCGATGGGATTCGATCCCTTACAACAGAGTGGCTTCTGTTGCAATGAAAAACTACAAGGAAAAGTTCATGAAACACGATGGGGAGCGGTTTGCCCAATACTTGAAAGACGTGAAGGACGGTAAGACCAAGATCGCCGCCGGAGCACTGCTTCCTCACGAGATCATATTGTCTTTATTCGACGGACAGGAAGACGGTGGAGAAGTTGCAGAGCTTCAATGGAAGAGAATGGTGGATGACTTGTTGAAGAAAGGGAAGTTGAGAGAATGTATTGCTGTTTGTGATGTGTCCGGAAGTATGATGGGGATTCCCATGGATGTTTGTGTTGGTTTGGGTCTTTTGGTTTCTGAATTAAGCGAAGATCCATGGAAAGGGAAAGTGATCACATTCAGTGCAAACCCAGAGCTTCATATGATCCAAGGGGACAGTCTGAAATCAAAAGCGGAGTTCGTTAAGAGTATGGATTGGGGGGGTAATACTGATTTTCAGAAGGTTTTTGATCAAATTCTCAAAGTGGCTGTAGATGGAAAGTTGAAGGAAGAACAAATGATAAAGAGAGTGTTTGTGTTCAGTGACATGGAGTTCGATCAAGCATCACAGACCTCGTGGGAAACAGATTACCAAGTTATAGTCCGAAAGTTCACAGAAAAAGGGTATGGATCAGCTGTTCCACAGATTGTGTTTTGGAACTTAAGAGATTCGAGGGCGACGCCAGTGCCGTGCAACGAGAAGGGGGTGGCTTTGGTCAGTGGATACTCAAAGAATTTGATGAACTTGTTTTTGGATGGTGATGGTGTCATTCAACCGGAAGCCGTCATGGAGAAAGCTATCTCCGGCAATGAGTACCAGAAGCTTGTTGTTCTTGATTGA
BLAST of CSPI04G20580.1 vs. Swiss-Prot
Match: YL728_MIMIV (Uncharacterized protein L728 OS=Acanthamoeba polyphaga mimivirus GN=MIMI_L728 PE=4 SV=1)

HSP 1 Score: 204.1 bits (518), Expect = 4.4e-51
Identity = 117/379 (30.87%), Postives = 198/379 (52.24%), Query Frame = 1

Query: 256 FTDCLKSDLQFMNSGDFTK---ISLAAKWCPSIDSSFDRSTLLCESIARK---IFPRELN 315
           F D L+ D   +N+   +    ISL AKW PS    ++++ LL     R    + PR+  
Sbjct: 135 FADQLQKDFDTVNNNTGSSKVAISLCAKWAPSEKQHYNKAPLLIADSIRSQMGLTPRQ-- 194

Query: 316 PEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRVASVAMKNYK 375
                       YR        +L  LR  L++ E+ +  +++D I ++++ SVA+   K
Sbjct: 195 ------------YR-------KMLTKLRSHLQVLEMLMSTHQYDKIDFSKLPSVALMKMK 254

Query: 376 EKFMKHDGER-------------FAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGG 435
             F +    +             + +YL+D+  GKTK+    + PHE++   +    D  
Sbjct: 255 NAFNRDTNSQGIKSDFRVNLHTSYTKYLQDLSKGKTKVNTKGIQPHELV-GQYLSSSDFD 314

Query: 436 EVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKV 495
           ++ E QW  +   +   G      AV DVSGSM G PM V + LG+LV+E +  P+ G+V
Sbjct: 315 QLVESQWDAIKKGVSDSGTFNNVTAVVDVSGSMHGQPMQVAIALGILVAECTSGPYHGRV 374

Query: 496 ITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRV 555
           ITF   P  H + G +L  K + ++   WGG+T+ + VFD +L+ A++ KLK  +MI  +
Sbjct: 375 ITFHEKPSWHHLTGSNLMEKVKCMRDAPWGGSTNMKSVFDLVLQNAINAKLKPHEMIDTL 434

Query: 556 FVFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLR--DSRATPVPCNEKG 614
           F+F+DM+F+Q   +  E+ ++   RKFTE GY    P++V WNLR  +S++ P+  N++G
Sbjct: 435 FIFTDMQFNQCDCSGLESTFEYGQRKFTEAGY--TFPKVVCWNLRTSNSKSLPLMKNDEG 489

BLAST of CSPI04G20580.1 vs. TrEMBL
Match: A0A0A0L2K6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G538590 PE=4 SV=1)

HSP 1 Score: 1281.5 bits (3315), Expect = 0.0e+00
Identity = 637/638 (99.84%), Postives = 637/638 (99.84%), Query Frame = 1

Query: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTDDSLPPMGFTENMSAT 60
           MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTDDSLPPMGFTENMSAT
Sbjct: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTDDSLPPMGFTENMSAT 60

Query: 61  FLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEGY 120
           FLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEGY
Sbjct: 61  FLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEGY 120

Query: 121 YTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGLSVR 180
           YTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGLSVR
Sbjct: 121 YTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGLSVR 180

Query: 181 HGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEKEKASGERKLRKVSMARKVMERFQ 240
           HGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEKEKASGERKLRKVSMARKVMERFQ
Sbjct: 181 HGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEKEKASGERKLRKVSMARKVMERFQ 240

Query: 241 ADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIA 300
           ADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIA
Sbjct: 241 ADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIA 300

Query: 301 RKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRV 360
           RKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRV
Sbjct: 301 RKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRV 360

Query: 361 ASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGEVAE 420
           ASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGEVAE
Sbjct: 361 ASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGEVAE 420

Query: 421 LQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFS 480
           LQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFS
Sbjct: 421 LQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFS 480

Query: 481 ANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFS 540
           ANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFS
Sbjct: 481 ANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFS 540

Query: 541 DMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVPCNEKGVALVSG 600
           DMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVP NEKGVALVSG
Sbjct: 541 DMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVPSNEKGVALVSG 600

Query: 601 YSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           YSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD
Sbjct: 601 YSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 638

BLAST of CSPI04G20580.1 vs. TrEMBL
Match: B9GZA8_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s16360g PE=4 SV=1)

HSP 1 Score: 897.9 bits (2319), Expect = 7.2e-258
Identity = 461/655 (70.38%), Postives = 530/655 (80.92%), Query Frame = 1

Query: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTD-DSLPPMGFTENMSA 60
           MAPP+LLGPPE+     P   Q   +T   +PFVD MV NFNKT  + LP MG+TENMSA
Sbjct: 1   MAPPSLLGPPEI-KKPVPTPQQQAPTTVR-NPFVDLMVDNFNKTTVNQLPQMGYTENMSA 60

Query: 61  TFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEG 120
           TFLS+GNPCLD FFHVVP+TP  SL  RL  AWNHNPL TLKLICNLRGVRGTGKSDKEG
Sbjct: 61  TFLSSGNPCLDLFFHVVPNTPPESLQKRLHSAWNHNPLTTLKLICNLRGVRGTGKSDKEG 120

Query: 121 YYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRG--L 180
           +YT+A+WL+N HPKTLA NIPS+ADFGYFKDLPEILYRLLEG DVRK QK EW++R    
Sbjct: 121 FYTSAIWLHNNHPKTLACNIPSMADFGYFKDLPEILYRLLEGPDVRKIQKQEWRQRKGRK 180

Query: 181 SVRHGRFKQEKPKT--------RKKEIQSSTDR------EANISKAMEKSRIEKEKASGE 240
           + R   FK  +PKT        R K  +SS +          I     ++ +EKE AS  
Sbjct: 181 TGRRAGFKIGQPKTLAPFQRSKRPKNAKSSRNAGPSIPIHIRIQNEKRRAEMEKENASIA 240

Query: 241 RKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCP 300
           RK R+ +MA+KV+ER+  D +++ L++ +SDFF  CLK+D+Q +NS + TK+SLAAKWCP
Sbjct: 241 RKERRAAMAKKVIERYSHDPDYRFLYEGVSDFFAGCLKTDMQHLNSSNTTKVSLAAKWCP 300

Query: 301 SIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELP 360
           SIDSSFDRSTLLCESIARK+FPRE  PEY+ IEEAHYAYRVRDRLR +VLVPLRKVLELP
Sbjct: 301 SIDSSFDRSTLLCESIARKVFPRESYPEYEGIEEAHYAYRVRDRLRKEVLVPLRKVLELP 360

Query: 361 EVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHE 420
           EV+IGANRWDSIPYNRVASVAMK YK+KF KHD ERF QYL+DVK GKTKIAAGALLPHE
Sbjct: 361 EVYIGANRWDSIPYNRVASVAMKFYKKKFFKHDAERFRQYLEDVKAGKTKIAAGALLPHE 420

Query: 421 IILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLL 480
           II SL D  +DGGEVAELQWKR+VDDLL+KGK++ CIAVCDVSGSM G PM+V V LGLL
Sbjct: 421 IIESLND--DDGGEVAELQWKRIVDDLLQKGKMKNCIAVCDVSGSMSGTPMEVSVALGLL 480

Query: 481 VSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAV 540
           VSEL E+PWKGK+ITFS NP L M++GDSL  K EFV+SM+WG NT+FQKVFD IL+VAV
Sbjct: 481 VSELCEEPWKGKLITFSQNPMLQMVEGDSLLQKTEFVRSMEWGMNTNFQKVFDLILQVAV 540

Query: 541 DGKLKEEQMIKRVFVFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDS 600
           +G L+E+QMIKRVFVFSDMEFDQAS   WETDYQVI RKFTEKGYG+ +P+IVFWNLRDS
Sbjct: 541 NGNLREDQMIKRVFVFSDMEFDQASCNPWETDYQVIARKFTEKGYGNVIPEIVFWNLRDS 600

Query: 601 RATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           RATPVP  +KGVALVSG+SKNLM LFLDGDG I PEAVM++AI+G EYQKLVVLD
Sbjct: 601 RATPVPGTQKGVALVSGFSKNLMKLFLDGDGEISPEAVMKEAIAGEEYQKLVVLD 651

BLAST of CSPI04G20580.1 vs. TrEMBL
Match: A0A067JHF5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23115 PE=4 SV=1)

HSP 1 Score: 873.6 bits (2256), Expect = 1.5e-250
Identity = 436/659 (66.16%), Postives = 521/659 (79.06%), Query Frame = 1

Query: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTD--DSLPPMGFTENMS 60
           MAP +LLGPPEL++ A      P  ++   DPF+D MVANFNK      LPPM +TEN S
Sbjct: 37  MAPTSLLGPPELHNPALLSKQSPQPTSTQADPFMDLMVANFNKPAVVSPLPPMSYTENRS 96

Query: 61  ATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKE 120
           AT++S+GNPCLDFFFHVVPDTP  S+  RL+ AW  +PL TLKLICNLRGVRGTGKSDKE
Sbjct: 97  ATYISSGNPCLDFFFHVVPDTPPESIKQRLNEAWQQDPLTTLKLICNLRGVRGTGKSDKE 156

Query: 121 GYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEW--KRRG 180
           G+Y A +WL+ FHPKTLA N+  +ADFGYFKDLPEIL+RLLEG +VRK QK EW  ++RG
Sbjct: 157 GFYAAVIWLHQFHPKTLACNVAPMADFGYFKDLPEILFRLLEGFEVRKTQKAEWEQRKRG 216

Query: 181 LSVR--HGRFKQEKPKTR---------------KKEIQSSTDREANISKAMEKSRIEKEK 240
           L +R     F +  P+ R                K+ +    RE  I  AME+++IEKE+
Sbjct: 217 LGIRGKSSNFNRFSPRNRTFRGPFRGQSKLSKGSKQSKPLATREIRILNAMERNKIEKEE 276

Query: 241 ASGERKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAA 300
           AS  RK +++ MA+KV  R+  D +F+ L++RISDFF +CLK+D++++ S    KISLAA
Sbjct: 277 ASMSRKQKRICMAKKVFGRYSRDPDFRFLYERISDFFAECLKADVEYLKSLQTKKISLAA 336

Query: 301 KWCPSIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKV 360
           KWCPSIDSSFD+STLLCESIARK+FP+E  PEY+ IEEAHYAYR+RDRLR +VLVPLRKV
Sbjct: 337 KWCPSIDSSFDKSTLLCESIARKVFPKESYPEYEGIEEAHYAYRIRDRLRKEVLVPLRKV 396

Query: 361 LELPEVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGAL 420
           LELPEV+IG N+W  IPYNRVASVAMK YKEKF+KHD ERF++YL+DVK GK+KIAAGAL
Sbjct: 397 LELPEVYIGYNKWGEIPYNRVASVAMKLYKEKFLKHDAERFSKYLEDVKSGKSKIAAGAL 456

Query: 421 LPHEIILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVG 480
           LPHEII +L DG  DGG+VAELQWKRMVDDL++KGKLR C+A+ DVSGSM G PM+V V 
Sbjct: 457 LPHEIIAALNDG--DGGQVAELQWKRMVDDLVEKGKLRNCMAISDVSGSMSGTPMEVSVA 516

Query: 481 LGLLVSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQIL 540
           LG+LVSELSEDPWKGK+ITFSA+P L M+ G+SL  K  FV+ M+WG NTDFQKVFD IL
Sbjct: 517 LGVLVSELSEDPWKGKLITFSADPTLQMVTGNSLLEKTRFVRRMEWGMNTDFQKVFDLIL 576

Query: 541 KVAVDGKLKEEQMIKRVFVFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWN 600
           +VAV+GKLKE+QMIKR+FVFSDMEFDQAS  SWETDYQVI RKFT +GYG+ +PQIVFWN
Sbjct: 577 RVAVEGKLKEDQMIKRLFVFSDMEFDQASSRSWETDYQVIARKFTAEGYGNCIPQIVFWN 636

Query: 601 LRDSRATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           LRDSRATPVP  + GVALVSG+SKNLM LFLD DG I P +VME AI+G EYQKL V+D
Sbjct: 637 LRDSRATPVPATQDGVALVSGFSKNLMKLFLDEDGAIDPVSVMEAAIAGEEYQKLAVID 693

BLAST of CSPI04G20580.1 vs. TrEMBL
Match: G7JY31_MEDTR (Plant/T31B5-30 protein OS=Medicago truncatula GN=MTR_5g045160 PE=4 SV=2)

HSP 1 Score: 869.8 bits (2246), Expect = 2.1e-249
Identity = 440/669 (65.77%), Postives = 524/669 (78.33%), Query Frame = 1

Query: 1   MAPPNLLGPPELY------HAAAPVSLQPTEST-----PSGDPFVDAMVANFNKTDDSL- 60
           MA   L+GPPE+Y      +     + Q TE+T      + D F+D MVANFN    +  
Sbjct: 1   MAAVALVGPPEIYSLKSNPNPTTTTTAQTTETTVTTTTTTNDVFLDQMVANFNSLGRNRN 60

Query: 61  PPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRG 120
           PPMG TENMS TFLSTGNPCLDFFFHVVPDTP+ +L++RL LAW+ NPL  LKL+CNLRG
Sbjct: 61  PPMGLTENMSPTFLSTGNPCLDFFFHVVPDTPSETLVERLKLAWSQNPLTALKLVCNLRG 120

Query: 121 VRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQ 180
           VRGTGKS+KEG+Y AALW +  HPKTLA N+PS+ADFGYFKDLPEILYRLLEGS+VRK Q
Sbjct: 121 VRGTGKSNKEGFYAAALWFHENHPKTLATNVPSLADFGYFKDLPEILYRLLEGSEVRKTQ 180

Query: 181 KNEWK------------------RRGLSVRHGRFKQEKPKTRKKEIQSSTDREANISKAM 240
           K EW+                  RRG+  +       K   +  +    T++++ +++ +
Sbjct: 181 KEEWRERKSGSKRKSSSGSTPFLRRGMKKKQRHHHNNKNNNKDNKGWKGTEKDSIVTEEV 240

Query: 241 E-KSRIEKEKASGERKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKSDLQFMNS 300
             ++++EKE A   ++ +++++A+K+++R+  D NF+ LHD ISD F DCLK DL+F+ S
Sbjct: 241 AARAKVEKEGAHVLKEEKRIALAKKLVDRYTTDPNFKFLHDCISDHFADCLKKDLEFLKS 300

Query: 301 GDFTKISLAAKWCPSIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLR 360
           G   KISLAAKWCPS+DSSFDRSTLLCE+IA+KIFPRE   EY+ +EEAHYAYRVRDRLR
Sbjct: 301 GSPNKISLAAKWCPSVDSSFDRSTLLCETIAKKIFPRE---EYEGVEEAHYAYRVRDRLR 360

Query: 361 TDVLVPLRKVLELPEVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKD 420
            DVLVPLRKVLELPEVFIGAN+W  IPYNRVASVAMK YKEKF+KHD ERF +YL+DVK 
Sbjct: 361 KDVLVPLRKVLELPEVFIGANQWGLIPYNRVASVAMKFYKEKFLKHDKERFEKYLEDVKA 420

Query: 421 GKTKIAAGALLPHEIILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSM 480
           GKT IAAGALLPHEII SL D  EDGGEVAELQWKR+VDDLLKKGK+R C+AVCDVSGSM
Sbjct: 421 GKTTIAAGALLPHEIIESLDD--EDGGEVAELQWKRIVDDLLKKGKMRNCLAVCDVSGSM 480

Query: 481 MGIPMDVCVGLGLLVSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNT 540
            G PM+VCV LGLLVSEL+E+PWKGKVITFS  P+LH+I+GD+LKSK +FV++MDWG NT
Sbjct: 481 HGTPMEVCVALGLLVSELNEEPWKGKVITFSREPQLHVIKGDNLKSKTQFVRNMDWGMNT 540

Query: 541 DFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQASQTSWETDYQVIVRKFTEKGYG 600
           DFQKVFD+IL VAV+G LKE+QMIKR+FVFSDMEFDQAS  SWETDYQ I RK+ EKGYG
Sbjct: 541 DFQKVFDRILDVAVNGNLKEDQMIKRIFVFSDMEFDQASANSWETDYQAITRKYREKGYG 600

Query: 601 SAVPQIVFWNLRDSRATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGN 639
           SAVPQIVFWNLRDS+ATPVP  +KGVALVSG+SKNL+ LF D DG I P   ME AI+G 
Sbjct: 601 SAVPQIVFWNLRDSKATPVPSTQKGVALVSGFSKNLLTLFFDNDGDISPVEAMEAAIAGP 660

BLAST of CSPI04G20580.1 vs. TrEMBL
Match: M5WER8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020333mg PE=4 SV=1)

HSP 1 Score: 867.5 bits (2240), Expect = 1.0e-248
Identity = 436/643 (67.81%), Postives = 513/643 (79.78%), Query Frame = 1

Query: 1   MAPPNLL-GPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFN---KTDDSLPPMGFTEN 60
           MAPP+LL GPPE      P  +     T S DPFVD MVAN+N   K     PPMGFTEN
Sbjct: 1   MAPPSLLLGPPEF---RKPEPIAAATQTQSTDPFVDLMVANYNDSAKAPIIAPPMGFTEN 60

Query: 61  MSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSD 120
            SATFLS+GNPC+DFFFHVVP TPA+    +L LAW H+ L TLKLICNLRGVRGTGKSD
Sbjct: 61  RSATFLSSGNPCVDFFFHVVPSTPASYFNQQLPLAWAHDDLTTLKLICNLRGVRGTGKSD 120

Query: 121 KEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWK-RR 180
           KEG+YTAA WL+  HPKTLA N+ S+A+FGYFKDLPEILYRLL+G DVRK QK EW  R+
Sbjct: 121 KEGFYTAAFWLHKHHPKTLACNVASLAEFGYFKDLPEILYRLLQGEDVRKTQKAEWSMRK 180

Query: 181 GLSVRHGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEKEKASGERKLRKVSMARKV 240
           G + R GR                  REA I +AME++++EKEKAS  R+ +K SMA+K 
Sbjct: 181 GGACRIGR------------------REARIKRAMERAQLEKEKASSLRREKKSSMAQKA 240

Query: 241 MERFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLL 300
           + R+Q D +F+ L++R+SD F +CLKSD++  NS  + KI+LAAKWCPSIDSSFDR+TLL
Sbjct: 241 LGRYQRDPDFRFLYERVSDLFAECLKSDIENFNSNQYKKITLAAKWCPSIDSSFDRATLL 300

Query: 301 CESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSI 360
           CESIARK+FPRE  PEY+ +E+AHYAYRVRDRLR DVLVPLRKVLELPEV+IGAN+W SI
Sbjct: 301 CESIARKVFPRESYPEYEGVEDAHYAYRVRDRLRKDVLVPLRKVLELPEVYIGANQWGSI 360

Query: 361 PYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDG 420
           PYNRVASVAMK YKEKF+KHD ERF +YL+DVK GK+ IAAGALLPHEII SL  G  DG
Sbjct: 361 PYNRVASVAMKFYKEKFLKHDEERFKKYLEDVKAGKSTIAAGALLPHEIIESLNHG--DG 420

Query: 421 GEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGK 480
           G+VAELQWKRMVDD+ K+GK+  C+AVCDVSGSM G PM+V V LGLLVSELSE+PWKGK
Sbjct: 421 GQVAELQWKRMVDDMQKQGKMNNCLAVCDVSGSMNGTPMEVSVALGLLVSELSEEPWKGK 480

Query: 481 VITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKR 540
           VITFSA PELH+IQG  L SK EFV++M+WGGNT+FQKVFD +L+VAV G+LK E MIKR
Sbjct: 481 VITFSARPELHLIQGGDLMSKCEFVRTMEWGGNTNFQKVFDLLLQVAVKGRLKPEHMIKR 540

Query: 541 VFVFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVPCNEKGV 600
           +FVFSDMEFDQAS   WETDYQ I RK+ +KGYG+A+PQIVFWNLR S +TPVP  + GV
Sbjct: 541 IFVFSDMEFDQASTNRWETDYQTIQRKYNKKGYGNAIPQIVFWNLRHSLSTPVPSTQPGV 600

Query: 601 ALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           AL+SGYSKNLM LFLD DG ++P++VME+A+SG EYQKL+VLD
Sbjct: 601 ALLSGYSKNLMKLFLDNDGEVRPDSVMEQALSGEEYQKLLVLD 620

BLAST of CSPI04G20580.1 vs. TAIR10
Match: AT5G13210.1 (AT5G13210.1 Uncharacterised conserved protein UCP015417, vWA)

HSP 1 Score: 807.7 bits (2085), Expect = 4.9e-234
Identity = 422/681 (61.97%), Postives = 509/681 (74.74%), Query Frame = 1

Query: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSG--DPFVDAMVANFNKT----DDSLPPMGFT 60
           M+P  LLGPPEL     P SL P  +T SG  DPF+DAMV+NFN +    + + PPMG+T
Sbjct: 1   MSPSPLLGPPELRD---PNSLLPKPTTTSGPSDPFMDAMVSNFNNSARVNNVNSPPMGYT 60

Query: 61  ENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGK 120
           EN SAT+LS+GNPCLDFFFHVVP TP +SL   L  AW+H+ L TLKLICNLRGVRGTGK
Sbjct: 61  ENKSATYLSSGNPCLDFFFHVVPSTPKHSLEQWLQGAWDHDALTTLKLICNLRGVRGTGK 120

Query: 121 SDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNE-WK 180
           SDKEG+YTAALWL+  HPKTLA N+ S++ FGYFKD PE+LYR+L+GS++RK QK+E +K
Sbjct: 121 SDKEGFYTAALWLHGRHPKTLACNLESLSQFGYFKDFPELLYRILQGSEIRKIQKSERFK 180

Query: 181 RRGLSVR----------HGRFK------QEKPKTRKKEIQSSTDREANISKAMEKSRIEK 240
           R+  ++           HGR          +P +++K + +   R AN   A  K++ EK
Sbjct: 181 RKSEALDRRAPYDGHCYHGRLYGGRGRGSSRPSSKRKPVATRALRVAN---AERKNQAEK 240

Query: 241 EKASGERKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISL 300
            +AS +RK +KVSM +    R+  D +++ LH+R+SD F + LK DL+F+ S    +ISL
Sbjct: 241 ARASLDRKKKKVSMGKDAFTRYSCDPDYRYLHERVSDLFANQLKKDLEFLTSDKPNEISL 300

Query: 301 AAKWCPSIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLR 360
           AAKWCPS+DSSFD++TLLCESIARKIF RE  PEY+ + EAHYAYRVRDRLR DVLVPLR
Sbjct: 301 AAKWCPSLDSSFDKATLLCESIARKIFTRESFPEYEGVVEAHYAYRVRDRLRKDVLVPLR 360

Query: 361 KVLELPEVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAG 420
           K L+LPEV++GA  WD +PYNRVASVAMK+YKE F+KHD ERF QYL D K GKTK+AAG
Sbjct: 361 KTLQLPEVYMGARNWDILPYNRVASVAMKSYKEIFLKHDAERFQQYLDDAKAGKTKVAAG 420

Query: 421 ALLPHEIILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVC 480
           A+LPHEII  L  G  DGG+VAELQWKR VDD+ +KG LR CIAVCDVSGSM G PM+VC
Sbjct: 421 AVLPHEIIRELDGG--DGGQVAELQWKRTVDDMKEKGSLRNCIAVCDVSGSMNGEPMEVC 480

Query: 481 VGLGLLVSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQ 540
           V LGLLVSELSE+PWKGK+ITFS NPELH+++GD L SK EFVK M WG NTDFQKVFD 
Sbjct: 481 VALGLLVSELSEEPWKGKLITFSQNPELHLVKGDDLYSKTEFVKKMQWGMNTDFQKVFDL 540

Query: 541 ILKVAVDGKLKEEQMIKRVFVFSDMEFDQASQTS--------------------WETDYQ 600
           IL VAV  KLK E+MIKRVFVFSDMEFDQA+ +S                    WETDY+
Sbjct: 541 ILGVAVQEKLKPEEMIKRVFVFSDMEFDQAASSSHYSRPGYAFLRQPPSNPSNGWETDYE 600

Query: 601 VIVRKFTEKGYGSAVPQIVFWNLRDSRATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQ 639
           VIVRK+ + GYG  VP+IVFWNLRDSRATPVP N+KGVALVSG+SKNLM +FL+ DG I 
Sbjct: 601 VIVRKYKQNGYGDVVPEIVFWNLRDSRATPVPGNKKGVALVSGFSKNLMKMFLEHDGEID 660

BLAST of CSPI04G20580.1 vs. TAIR10
Match: AT5G43400.1 (AT5G43400.1 Uncharacterised conserved protein UCP015417, vWA)

HSP 1 Score: 739.6 bits (1908), Expect = 1.7e-213
Identity = 377/662 (56.95%), Postives = 481/662 (72.66%), Query Frame = 1

Query: 6   LLGPPELYHAAAPVS-LQPTESTPSGDPFVDAMVANFNKTDDSLPPMGFTENMSATFLST 65
           LLGPP +   +  +  +   E+  S +  + +  A  N  +   PPMG TEN S TFLS+
Sbjct: 8   LLGPPSVAGNSPIIKPIHSPETHISDENTLISQTATLNLEEP--PPMGLTENFSPTFLSS 67

Query: 66  GNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEGYYTAA 125
           GNPCLDFFFH+VPDT  + LI RL+++W+H+PL TLKLICNLRGVRGTGKSDKEG+YTAA
Sbjct: 68  GNPCLDFFFHIVPDTSPDDLIQRLAISWSHDPLTTLKLICNLRGVRGTGKSDKEGFYTAA 127

Query: 126 LWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGLSVRHGRF 185
            WLY  HPKTLA N+P++ DFGYFKDLPEIL+R+LEG ++ + +   W++R      G+ 
Sbjct: 128 FWLYKNHPKTLALNVPALVDFGYFKDLPEILFRILEGQNMERGKNRVWRKRVQRKFKGK- 187

Query: 186 KQEKPKTRKKEIQSSTDREANISKAMEK--SRIEKEKASGERKLRKVSMARKVMERFQAD 245
                  R+K+ + S + E  I +  E+    ++K KA   RK R+   A+K + R+ +D
Sbjct: 188 -------REKKSEISGEMEDRILENAEEIGGSVDKVKARALRKQREFEKAKKAVTRYNSD 247

Query: 246 SNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIARK 305
           +N++LL DRI+D F   LKSDL+++NS   TKISLA+KWCPS+DSS+D++TL+CE+IAR+
Sbjct: 248 ANYRLLFDRIADLFAVLLKSDLKYLNSNGLTKISLASKWCPSVDSSYDKATLICEAIARR 307

Query: 306 IFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRVAS 365
           +FPRE   EY+ IEEAHYAYR+RDRLR +VLVPL K LE PE+F+ A  W+ + YNRV S
Sbjct: 308 MFPRE---EYEGIEEAHYAYRIRDRLRKEVLVPLHKALEFPELFMSAKEWNLLKYNRVPS 367

Query: 366 VAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFD--GQEDGGEVAE 425
           VAMKNYK+ F +HD ERF ++L+DVK GK KIAAGALLPH+II  L D  G E G EVAE
Sbjct: 368 VAMKNYKKLFEEHDSERFTEFLEDVKSGKKKIAAGALLPHQIINQLEDDSGSEVGAEVAE 427

Query: 426 LQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFS 485
           LQW RMVDDL KKGKL+  +AVCDVSGSM G PM+VCV LGLLVSELSE+PWKGKVITFS
Sbjct: 428 LQWARMVDDLAKKGKLKNSLAVCDVSGSMSGTPMEVCVALGLLVSELSEEPWKGKVITFS 487

Query: 486 ANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFS 545
            NPELH++ G SL+ K +FV+ M+WG NTDFQ VFD+IL+VAV+  L ++QMIKR+FVFS
Sbjct: 488 ENPELHIVTGSSLREKTQFVREMEWGMNTDFQIVFDRILEVAVENNLTDDQMIKRLFVFS 547

Query: 546 DMEFDQA------------------------SQTSWETDYQVIVRKFTEKGYGSAVPQIV 605
           DMEFD A                        S+  WETDY+V+ RK+ EKG+ + VP++V
Sbjct: 548 DMEFDDAMANSHSEVSYHLSVEDRLKISKERSKEKWETDYEVVQRKYKEKGFQN-VPEMV 607

Query: 606 FWNLRDSRATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVV 639
           FWNLRDS ATPV  N+KGVA+VSG+SKNL+ LFL+  G++ PE VM  AI G EY+KLVV
Sbjct: 608 FWNLRDSSATPVVANQKGVAMVSGFSKNLLTLFLEEGGIVNPEDVMWIAIKGEEYKKLVV 655

BLAST of CSPI04G20580.1 vs. TAIR10
Match: AT5G43390.1 (AT5G43390.1 Uncharacterised conserved protein UCP015417, vWA)

HSP 1 Score: 720.3 bits (1858), Expect = 1.0e-207
Identity = 370/660 (56.06%), Postives = 475/660 (71.97%), Query Frame = 1

Query: 6   LLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTDDSLPPMGFTENMSATFLSTG 65
           LLGPP +     PVS          D  V + +A  N  +   P MG TEN S TFL++G
Sbjct: 9   LLGPPSVAAMETPVS---------DDNSVISQIATLNLEE---PQMGLTENFSPTFLTSG 68

Query: 66  NPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEGYYTAAL 125
           NPCLDFFFH+VPDTP++ LI RL+++W+H+PL TLKL+CNLRGVRGTGKSDKEG+YTAAL
Sbjct: 69  NPCLDFFFHIVPDTPSDDLIQRLAISWSHDPLTTLKLLCNLRGVRGTGKSDKEGFYTAAL 128

Query: 126 WLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGLSVRHGRFK 185
           WLY  HPKTLA NIP++ DFGYFKDLPEIL R+LEG    + +   W++R       +FK
Sbjct: 129 WLYKNHPKTLALNIPTLVDFGYFKDLPEILLRILEGQQTERGKTRVWRKR----IQRKFK 188

Query: 186 QEKPKTRKKEIQSSTDREANISKAMEKSR--IEKEKASGERKLRKVSMARKVMERFQADS 245
            +     +K+   S D E  I +  E++   + K KA   RK R+   A+K ++R+ +D+
Sbjct: 189 GDS----EKKSTISGDMEDRILETAEETGGPVGKVKARALRKQREFEKAKKALDRYNSDA 248

Query: 246 NFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIARKI 305
           N++LL D+I+D F + LKSDL+++N+ +  KISLA+KWCPS+DSS+D++TL+CE+IAR++
Sbjct: 249 NYRLLFDQIADLFAELLKSDLEYLNTDNLNKISLASKWCPSVDSSYDKTTLICEAIARRM 308

Query: 306 FPRELNPEYKE-IEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRVAS 365
           F RE   EY+E IEE HYAYR+RDRLR +VLVPL K LELPEV + A  W+ + YNRV S
Sbjct: 309 FLRE---EYEEGIEEVHYAYRIRDRLRKEVLVPLHKALELPEVSMSAKEWNLLKYNRVPS 368

Query: 366 VAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGEVAELQ 425
           +AM+NY  +F +HD ERF ++L+DVK GK K+AAGALLPH+II  L +  E G EVAELQ
Sbjct: 369 IAMQNYSSRFAEHDSERFTEFLEDVKSGKKKMAAGALLPHQIISQLLNDSE-GEEVAELQ 428

Query: 426 WKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFSAN 485
           W RMVDDL KKGKL+  +A+CDVSGSM G PM+VC+ LGLLVSEL+E+PWKGKVITFS N
Sbjct: 429 WARMVDDLAKKGKLKNSLAICDVSGSMAGTPMNVCIALGLLVSELNEEPWKGKVITFSEN 488

Query: 486 PELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDM 545
           P+LH++ G SL+ K +FV+ MD+G NTDFQKVFD+IL+VAV+  L +EQMIKR+FVFSDM
Sbjct: 489 PQLHVVTGSSLREKTKFVREMDFGINTDFQKVFDRILEVAVENNLTDEQMIKRLFVFSDM 548

Query: 546 EFDQA------------------------SQTSWETDYQVIVRKFTEKGYGSAVPQIVFW 605
           EFD A                        S   WETDY+V+ RK+ EKG+ + VP+IVFW
Sbjct: 549 EFDDARVDSHSEMSDYASNLESDYESVPESFEKWETDYEVVQRKYKEKGFQN-VPEIVFW 608

Query: 606 NLRDSRATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           NLRDS ATPV   +KGVA+VSG+SKNL+ LFL+  G++ PE VM  AI G EYQKL V D
Sbjct: 609 NLRDSSATPVVSKQKGVAMVSGFSKNLLTLFLEEGGIVNPEDVMLLAIKGEEYQKLAVYD 643

BLAST of CSPI04G20580.1 vs. TAIR10
Match: AT3G24780.1 (AT3G24780.1 Uncharacterised conserved protein UCP015417, vWA)

HSP 1 Score: 715.3 bits (1845), Expect = 3.3e-206
Identity = 366/600 (61.00%), Postives = 446/600 (74.33%), Query Frame = 1

Query: 47  SLPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNL 106
           S P MG+TEN SAT+LS+GNPCLDFFFH+VP TP  SL  RL  AW+H+ L TLKLICNL
Sbjct: 106 SSPAMGYTENRSATYLSSGNPCLDFFFHIVPSTPKKSLEQRLEEAWDHDSLTTLKLICNL 165

Query: 107 RGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRK 166
           RGVRGTGKSDKEG+YTAALWL+  HPKTLA N+ S++ FGYFKD PEILYR+L+G ++R 
Sbjct: 166 RGVRGTGKSDKEGFYTAALWLHGRHPKTLACNLESLSKFGYFKDFPEILYRILQGPEIRS 225

Query: 167 NQKNE---------WKRRGLSVRHGR-FKQEKPKTRKKEIQSSTDREANISKAMEKSRIE 226
            QK +          +RR    R GR F   + + R    +S+  RE  ++ A  K++ E
Sbjct: 226 IQKTQRYDTIAAASLRRRSRFSRGGRGFGGGRSRGRHFLKRSAATRELRVANAERKNQEE 285

Query: 227 KEKASGERKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKIS 286
           K +AS +RK +KVSMA+    ++  D N++ LH+R+S+ F + LK DL+F+ SG   KIS
Sbjct: 286 KARASLKRKQKKVSMAKAASTKYSNDPNYRFLHERVSELFANQLKRDLEFLTSGQPNKIS 345

Query: 287 LAAKWCPSIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPL 346
           LAAKWCPS+DSSFD++TL+CESIARKIFP+E  PEY+ +E+AHYAYRVRDRLR  VLVPL
Sbjct: 346 LAAKWCPSLDSSFDKATLICESIARKIFPQESFPEYEGVEDAHYAYRVRDRLRKQVLVPL 405

Query: 347 RKVLELPEVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAA 406
           RK L+LPEV++GA  W S+PYNRVASVAMK+YKE F+  D +RF QYL D K GKTKIAA
Sbjct: 406 RKTLQLPEVYMGARAWQSLPYNRVASVAMKSYKEVFLYRDEKRFQQYLNDAKTGKTKIAA 465

Query: 407 GALLPHEIILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDV 466
           GA+LPHEII  L  G  DGG+VAELQWKRMVDDL +KG L  C+A+CDVSGSM G PM+V
Sbjct: 466 GAVLPHEIIRELNGG--DGGKVAELQWKRMVDDLKEKGSLTNCMAICDVSGSMNGEPMEV 525

Query: 467 CVGLGLLVSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFD 526
            V LGLLVSELSE+PWKGK+ITF  +PELH+++GD L+SK EFV+SM W  NTDFQKVFD
Sbjct: 526 SVALGLLVSELSEEPWKGKLITFRQSPELHLVKGDDLRSKTEFVESMQWDMNTDFQKVFD 585

Query: 527 QILKVAVDGKLKEEQMIKRVFVFSDMEFDQASQT-------------------------- 586
            ILKVAV+ KLK + MIKRVFVFSDMEFD+AS +                          
Sbjct: 586 LILKVAVESKLKPQDMIKRVFVFSDMEFDEASTSTSSFNKWRSSPPTPSNRWDTLSYSED 645

Query: 587 -------SWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVPCNEKGVALVSGYSK 604
                  +W+TDY+VIVRK+ EKGYG AVP+IVFWNLRDSR+TPV  N+KGVALVSG+SK
Sbjct: 646 DEDEENDAWQTDYKVIVRKYREKGYGEAVPEIVFWNLRDSRSTPVLGNKKGVALVSGFSK 703

BLAST of CSPI04G20580.1 vs. NCBI nr
Match: gi|449453862|ref|XP_004144675.1| (PREDICTED: uncharacterized protein LOC101205449 [Cucumis sativus])

HSP 1 Score: 1281.5 bits (3315), Expect = 0.0e+00
Identity = 637/638 (99.84%), Postives = 637/638 (99.84%), Query Frame = 1

Query: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTDDSLPPMGFTENMSAT 60
           MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTDDSLPPMGFTENMSAT
Sbjct: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTDDSLPPMGFTENMSAT 60

Query: 61  FLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEGY 120
           FLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEGY
Sbjct: 61  FLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEGY 120

Query: 121 YTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGLSVR 180
           YTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGLSVR
Sbjct: 121 YTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGLSVR 180

Query: 181 HGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEKEKASGERKLRKVSMARKVMERFQ 240
           HGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEKEKASGERKLRKVSMARKVMERFQ
Sbjct: 181 HGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEKEKASGERKLRKVSMARKVMERFQ 240

Query: 241 ADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIA 300
           ADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIA
Sbjct: 241 ADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIA 300

Query: 301 RKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRV 360
           RKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRV
Sbjct: 301 RKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRV 360

Query: 361 ASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGEVAE 420
           ASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGEVAE
Sbjct: 361 ASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGEVAE 420

Query: 421 LQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFS 480
           LQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFS
Sbjct: 421 LQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFS 480

Query: 481 ANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFS 540
           ANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFS
Sbjct: 481 ANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFS 540

Query: 541 DMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVPCNEKGVALVSG 600
           DMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVP NEKGVALVSG
Sbjct: 541 DMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVPSNEKGVALVSG 600

Query: 601 YSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           YSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD
Sbjct: 601 YSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 638

BLAST of CSPI04G20580.1 vs. NCBI nr
Match: gi|659083104|ref|XP_008442184.1| (PREDICTED: uncharacterized protein LOC103486117 [Cucumis melo])

HSP 1 Score: 1187.6 bits (3071), Expect = 0.0e+00
Identity = 601/676 (88.91%), Postives = 620/676 (91.72%), Query Frame = 1

Query: 1   MAPPNLLGPPELYHAA--------------------APVSLQPTESTPSGDPFVDAMVAN 60
           MAPP+LLGPPELYHAA                    APVSLQPTESTPSG PFVDAM+AN
Sbjct: 1   MAPPSLLGPPELYHAASPVSLQPTESAPVSLQPTESAPVSLQPTESTPSGVPFVDAMLAN 60

Query: 61  FNK----TDDSLPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNP 120
           FN     +DD+LPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNP
Sbjct: 61  FNNINNHSDDNLPPMGFTENMSATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNP 120

Query: 121 LMTLKLICNLRGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILY 180
           LMTLKLICNLRGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILY
Sbjct: 121 LMTLKLICNLRGVRGTGKSDKEGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILY 180

Query: 181 RLLEGSDVRKNQKNEW--------------KRRGLSVRHGRFKQEKPKTRKKEIQSSTDR 240
           RLLEGSDVRKNQK EW              +R GLSVR+G FKQEKPKTRKKEIQSS DR
Sbjct: 181 RLLEGSDVRKNQKKEWGERKGKSRKRLSSPRRGGLSVRYGSFKQEKPKTRKKEIQSSIDR 240

Query: 241 EANISKAMEKSRIEKEKASGERKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKS 300
           EANISKAMEKSRIEKEKAS ERKLRKVSMARKVMERFQ+D NFQLLHDRISDFFTDCLKS
Sbjct: 241 EANISKAMEKSRIEKEKASAERKLRKVSMARKVMERFQSDPNFQLLHDRISDFFTDCLKS 300

Query: 301 DLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAY 360
           DLQFMNSGDFT+ISLAAKWCPS+DSSFDRSTLLCESIARK+FPRE +PEY+ IEEAHYAY
Sbjct: 301 DLQFMNSGDFTRISLAAKWCPSVDSSFDRSTLLCESIARKVFPRESDPEYEGIEEAHYAY 360

Query: 361 RVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQ 420
           RVRDRLR DVLVPLRKVLELPEV+IGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQ
Sbjct: 361 RVRDRLRKDVLVPLRKVLELPEVYIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQ 420

Query: 421 YLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAV 480
           YLKDVKDGKTKIAAGALLPHEII+SLFDGQEDGGEVAELQWKRMVDDLLKKGKLR+CIAV
Sbjct: 421 YLKDVKDGKTKIAAGALLPHEIIMSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRDCIAV 480

Query: 481 CDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKS 540
           CDVSGSM GIPMDVC+ LGLLVSELSEDPWKGKVITFSANPELH+IQGDSLKSKAEFVK+
Sbjct: 481 CDVSGSMEGIPMDVCIALGLLVSELSEDPWKGKVITFSANPELHVIQGDSLKSKAEFVKT 540

Query: 541 MDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQASQTSWETDYQVIVRK 600
           M WG NTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQAS TSWETDYQVIVRK
Sbjct: 541 MHWGVNTDFQKVFDQILKVAVDGKLKEEQMIKRVFVFSDMEFDQASATSWETDYQVIVRK 600

Query: 601 FTEKGYGSAVPQIVFWNLRDSRATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVM 639
           FTEKGYGSAVPQIVFWNLRDSRATPVP  EKGVALVSGYSKNLMNLFLDGDGVIQPEAVM
Sbjct: 601 FTEKGYGSAVPQIVFWNLRDSRATPVPGKEKGVALVSGYSKNLMNLFLDGDGVIQPEAVM 660

BLAST of CSPI04G20580.1 vs. NCBI nr
Match: gi|224075499|ref|XP_002304655.1| (hypothetical protein POPTR_0003s16360g [Populus trichocarpa])

HSP 1 Score: 897.9 bits (2319), Expect = 1.0e-257
Identity = 461/655 (70.38%), Postives = 530/655 (80.92%), Query Frame = 1

Query: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTD-DSLPPMGFTENMSA 60
           MAPP+LLGPPE+     P   Q   +T   +PFVD MV NFNKT  + LP MG+TENMSA
Sbjct: 1   MAPPSLLGPPEI-KKPVPTPQQQAPTTVR-NPFVDLMVDNFNKTTVNQLPQMGYTENMSA 60

Query: 61  TFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEG 120
           TFLS+GNPCLD FFHVVP+TP  SL  RL  AWNHNPL TLKLICNLRGVRGTGKSDKEG
Sbjct: 61  TFLSSGNPCLDLFFHVVPNTPPESLQKRLHSAWNHNPLTTLKLICNLRGVRGTGKSDKEG 120

Query: 121 YYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRG--L 180
           +YT+A+WL+N HPKTLA NIPS+ADFGYFKDLPEILYRLLEG DVRK QK EW++R    
Sbjct: 121 FYTSAIWLHNNHPKTLACNIPSMADFGYFKDLPEILYRLLEGPDVRKIQKQEWRQRKGRK 180

Query: 181 SVRHGRFKQEKPKT--------RKKEIQSSTDR------EANISKAMEKSRIEKEKASGE 240
           + R   FK  +PKT        R K  +SS +          I     ++ +EKE AS  
Sbjct: 181 TGRRAGFKIGQPKTLAPFQRSKRPKNAKSSRNAGPSIPIHIRIQNEKRRAEMEKENASIA 240

Query: 241 RKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCP 300
           RK R+ +MA+KV+ER+  D +++ L++ +SDFF  CLK+D+Q +NS + TK+SLAAKWCP
Sbjct: 241 RKERRAAMAKKVIERYSHDPDYRFLYEGVSDFFAGCLKTDMQHLNSSNTTKVSLAAKWCP 300

Query: 301 SIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELP 360
           SIDSSFDRSTLLCESIARK+FPRE  PEY+ IEEAHYAYRVRDRLR +VLVPLRKVLELP
Sbjct: 301 SIDSSFDRSTLLCESIARKVFPRESYPEYEGIEEAHYAYRVRDRLRKEVLVPLRKVLELP 360

Query: 361 EVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHE 420
           EV+IGANRWDSIPYNRVASVAMK YK+KF KHD ERF QYL+DVK GKTKIAAGALLPHE
Sbjct: 361 EVYIGANRWDSIPYNRVASVAMKFYKKKFFKHDAERFRQYLEDVKAGKTKIAAGALLPHE 420

Query: 421 IILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLL 480
           II SL D  +DGGEVAELQWKR+VDDLL+KGK++ CIAVCDVSGSM G PM+V V LGLL
Sbjct: 421 IIESLND--DDGGEVAELQWKRIVDDLLQKGKMKNCIAVCDVSGSMSGTPMEVSVALGLL 480

Query: 481 VSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAV 540
           VSEL E+PWKGK+ITFS NP L M++GDSL  K EFV+SM+WG NT+FQKVFD IL+VAV
Sbjct: 481 VSELCEEPWKGKLITFSQNPMLQMVEGDSLLQKTEFVRSMEWGMNTNFQKVFDLILQVAV 540

Query: 541 DGKLKEEQMIKRVFVFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDS 600
           +G L+E+QMIKRVFVFSDMEFDQAS   WETDYQVI RKFTEKGYG+ +P+IVFWNLRDS
Sbjct: 541 NGNLREDQMIKRVFVFSDMEFDQASCNPWETDYQVIARKFTEKGYGNVIPEIVFWNLRDS 600

Query: 601 RATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           RATPVP  +KGVALVSG+SKNLM LFLDGDG I PEAVM++AI+G EYQKLVVLD
Sbjct: 601 RATPVPGTQKGVALVSGFSKNLMKLFLDGDGEISPEAVMKEAIAGEEYQKLVVLD 651

BLAST of CSPI04G20580.1 vs. NCBI nr
Match: gi|1012037830|ref|XP_015953619.1| (PREDICTED: uncharacterized protein LOC107478027 [Arachis duranensis])

HSP 1 Score: 894.0 bits (2309), Expect = 1.5e-256
Identity = 444/641 (69.27%), Postives = 520/641 (81.12%), Query Frame = 1

Query: 6   LLGPPELYHAAAPVSLQPTESTPS------GDPFVDAMVANFNK--TDDSLPPMGFTENM 65
           L+GPPE+Y+      L PT +  +       DPF+D MV+ FN   T    PPMGFTEN 
Sbjct: 4   LIGPPEIYNPKPQSFLTPTSTATTTTPIAPSDPFIDVMVSKFNTITTIQPQPPMGFTENN 63

Query: 66  SATFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDK 125
           SATFLS+GNPCLDFFFHVVPDTP +S+ +RL +AW HNPL TLKLICNLRGVRGTGKSD+
Sbjct: 64  SATFLSSGNPCLDFFFHVVPDTPPDSVSERLHVAWAHNPLTTLKLICNLRGVRGTGKSDR 123

Query: 126 EGYYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRGL 185
           +G+YTAA WL++ HPKTLA N+PS+ADFGYFKDLPEILYRLLEGSDVR +QK  W     
Sbjct: 124 DGFYTAATWLFSNHPKTLAANVPSLADFGYFKDLPEILYRLLEGSDVRSDQKQRWLSVKR 183

Query: 186 SVRHGRFKQEKPKTRKKEIQSSTDREANISKAMEKSRIEKEKASGERKLRKVSMARKVME 245
           S +  R K+   K   K +Q  +D  A           EKEKA   R+ RK++MA+K+++
Sbjct: 184 SSKRNRLKRRPFKA--KPLQKVSDPVA-----------EKEKAHALREERKLAMAKKLLD 243

Query: 246 RFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWCPSIDSSFDRSTLLCE 305
           R+ +D NF+LLHD +SD F DCL++DLQ +NSG  TKISLAAKWCPS+DSSFDRSTLLCE
Sbjct: 244 RYNSDENFRLLHDSVSDHFADCLENDLQNLNSGALTKISLAAKWCPSVDSSFDRSTLLCE 303

Query: 306 SIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLELPEVFIGANRWDSIPY 365
           +IA +IFPR  NPEY+ IEEAHY YRVRDRLR DVLVPLRKVLELPEVF+GANRWDSIPY
Sbjct: 304 TIATRIFPRNGNPEYEGIEEAHYVYRVRDRLRKDVLVPLRKVLELPEVFMGANRWDSIPY 363

Query: 366 NRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPHEIILSLFDGQEDGGE 425
           NRVASVAMK YKEKF+KHD ERF +YL+DVK GKT IAAGALLPHEII SL DG  DGGE
Sbjct: 364 NRVASVAMKLYKEKFLKHDKERFEKYLEDVKSGKTTIAAGALLPHEIIRSLGDG--DGGE 423

Query: 426 VAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGLLVSELSEDPWKGKVI 485
           VAELQW RMV D+L KGK++ C+AVCDVSGSM G+PM+V V LGLLVSEL+E+PWKGKVI
Sbjct: 424 VAELQWSRMVSDMLSKGKMKNCLAVCDVSGSMDGVPMEVSVALGLLVSELNEEPWKGKVI 483

Query: 486 TFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVAVDGKLKEEQMIKRVF 545
           TFS  P+LH+I+G+ L+SKAEF++ M+WGGNTDFQ VFD+IL+VAV+GKLK +QMIKRVF
Sbjct: 484 TFSEEPKLHLIEGEDLRSKAEFIREMEWGGNTDFQAVFDRILEVAVNGKLKADQMIKRVF 543

Query: 546 VFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRDSRATPVPCNEKGVAL 605
           VFSDMEFDQAS   WETDYQ I+RK++EKGYGSAVPQIVFWNLRDSRATPVP  ++GVAL
Sbjct: 544 VFSDMEFDQASANPWETDYQAIIRKYSEKGYGSAVPQIVFWNLRDSRATPVPSTQQGVAL 603

Query: 606 VSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           VSG+SKNL++LF+D DG I PEA ME AI+G EYQKLVVLD
Sbjct: 604 VSGFSKNLLSLFMDNDGEISPEAAMETAIAGPEYQKLVVLD 629

BLAST of CSPI04G20580.1 vs. NCBI nr
Match: gi|743819271|ref|XP_011020843.1| (PREDICTED: uncharacterized protein LOC105123074 [Populus euphratica])

HSP 1 Score: 892.9 bits (2306), Expect = 3.3e-256
Identity = 456/656 (69.51%), Postives = 531/656 (80.95%), Query Frame = 1

Query: 1   MAPPNLLGPPELYHAAAPVSLQPTESTPSGDPFVDAMVANFNKTD-DSLPPMGFTENMSA 60
           MAPP+LLGPPE+       + Q   ST   +PFVD MV NFNKT  + LP MG+TENMSA
Sbjct: 1   MAPPSLLGPPEIKKPMP--TPQQEASTTVRNPFVDLMVDNFNKTTVNQLPQMGYTENMSA 60

Query: 61  TFLSTGNPCLDFFFHVVPDTPANSLIDRLSLAWNHNPLMTLKLICNLRGVRGTGKSDKEG 120
           TFLS+GNPCLD FFHVVP+TP  SL  RL  AWNHNPL TLKLICNLRGVRGTGKSDKEG
Sbjct: 61  TFLSSGNPCLDLFFHVVPNTPPESLKRRLHSAWNHNPLTTLKLICNLRGVRGTGKSDKEG 120

Query: 121 YYTAALWLYNFHPKTLAGNIPSIADFGYFKDLPEILYRLLEGSDVRKNQKNEWKRRG--L 180
           +YT+A+WL+N HPKTLA NIPS+ADFGYFKDLPEILYRLLEG DVRK QK EW++R    
Sbjct: 121 FYTSAIWLHNNHPKTLACNIPSMADFGYFKDLPEILYRLLEGPDVRKIQKQEWRQRKGRK 180

Query: 181 SVRHGRFKQEKPKT-------RKKEIQSSTDREAN--------ISKAMEKSRIEKEKASG 240
           + R   FK  +PKT       +K+   + + R A         I     ++ +EKE AS 
Sbjct: 181 TGRRAGFKIGQPKTPAPFQRNKKRPENAQSSRNAGPSIPIHIRIQNEKRRAEMEKENASI 240

Query: 241 ERKLRKVSMARKVMERFQADSNFQLLHDRISDFFTDCLKSDLQFMNSGDFTKISLAAKWC 300
            RK R+ +MA+KV+ER+  D +++ L++ +SDFF  CLK+D+Q +NS +  K+SLAAKWC
Sbjct: 241 ARKERRAAMAKKVIERYSHDPDYRFLYEGVSDFFAGCLKTDMQHLNSSNTRKVSLAAKWC 300

Query: 301 PSIDSSFDRSTLLCESIARKIFPRELNPEYKEIEEAHYAYRVRDRLRTDVLVPLRKVLEL 360
           PSIDSSFDRSTLLCESIARK+FPRE  PEY+ I+EAHYAYRVRDRLR +VLVPLRKVLEL
Sbjct: 301 PSIDSSFDRSTLLCESIARKVFPRESYPEYEGIKEAHYAYRVRDRLRKEVLVPLRKVLEL 360

Query: 361 PEVFIGANRWDSIPYNRVASVAMKNYKEKFMKHDGERFAQYLKDVKDGKTKIAAGALLPH 420
           PEV+IGANRWDSIPYNRVASVAMK YK+KF+KHD ERF QYL+DVK GKTKIAAGALLPH
Sbjct: 361 PEVYIGANRWDSIPYNRVASVAMKFYKKKFLKHDAERFRQYLEDVKAGKTKIAAGALLPH 420

Query: 421 EIILSLFDGQEDGGEVAELQWKRMVDDLLKKGKLRECIAVCDVSGSMMGIPMDVCVGLGL 480
           EII SL D  +DGGEV+ELQWKR+VDDLL+KGK++ CIAVCDVSGSM G PM+V V LGL
Sbjct: 421 EIIGSLND--DDGGEVSELQWKRIVDDLLQKGKMKNCIAVCDVSGSMSGTPMEVSVALGL 480

Query: 481 LVSELSEDPWKGKVITFSANPELHMIQGDSLKSKAEFVKSMDWGGNTDFQKVFDQILKVA 540
           LVSEL E+PWKGK+ITFS NP L M++GDSL  K EFV+SM+WG NT+FQKVFD IL+VA
Sbjct: 481 LVSELCEEPWKGKLITFSQNPMLQMVEGDSLLQKTEFVRSMEWGMNTNFQKVFDLILQVA 540

Query: 541 VDGKLKEEQMIKRVFVFSDMEFDQASQTSWETDYQVIVRKFTEKGYGSAVPQIVFWNLRD 600
           V+G L+E+QMIKRVFVFSDMEFD+AS   WETDYQVI RKFTEKGYG+ +P+IVFWNLRD
Sbjct: 541 VNGNLREDQMIKRVFVFSDMEFDRASCNPWETDYQVIARKFTEKGYGNVIPEIVFWNLRD 600

Query: 601 SRATPVPCNEKGVALVSGYSKNLMNLFLDGDGVIQPEAVMEKAISGNEYQKLVVLD 639
           SRATPVP  +KGVALVSG+SKNLM LFLDGDG I PEAVM++AI+G EYQKLVVLD
Sbjct: 601 SRATPVPGTQKGVALVSGFSKNLMKLFLDGDGEISPEAVMKEAIAGEEYQKLVVLD 652

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YL728_MIMIV4.4e-5130.87Uncharacterized protein L728 OS=Acanthamoeba polyphaga mimivirus GN=MIMI_L728 PE... [more]
Match NameE-valueIdentityDescription
A0A0A0L2K6_CUCSA0.0e+0099.84Uncharacterized protein OS=Cucumis sativus GN=Csa_4G538590 PE=4 SV=1[more]
B9GZA8_POPTR7.2e-25870.38Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s16360g PE=4 SV=1[more]
A0A067JHF5_JATCU1.5e-25066.16Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23115 PE=4 SV=1[more]
G7JY31_MEDTR2.1e-24965.77Plant/T31B5-30 protein OS=Medicago truncatula GN=MTR_5g045160 PE=4 SV=2[more]
M5WER8_PRUPE1.0e-24867.81Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020333mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G13210.14.9e-23461.97 Uncharacterised conserved protein UCP015417, vWA[more]
AT5G43400.11.7e-21356.95 Uncharacterised conserved protein UCP015417, vWA[more]
AT5G43390.11.0e-20756.06 Uncharacterised conserved protein UCP015417, vWA[more]
AT3G24780.13.3e-20661.00 Uncharacterised conserved protein UCP015417, vWA[more]
Match NameE-valueIdentityDescription
gi|449453862|ref|XP_004144675.1|0.0e+0099.84PREDICTED: uncharacterized protein LOC101205449 [Cucumis sativus][more]
gi|659083104|ref|XP_008442184.1|0.0e+0088.91PREDICTED: uncharacterized protein LOC103486117 [Cucumis melo][more]
gi|224075499|ref|XP_002304655.1|1.0e-25770.38hypothetical protein POPTR_0003s16360g [Populus trichocarpa][more]
gi|1012037830|ref|XP_015953619.1|1.5e-25669.27PREDICTED: uncharacterized protein LOC107478027 [Arachis duranensis][more]
gi|743819271|ref|XP_011020843.1|3.3e-25669.51PREDICTED: uncharacterized protein LOC105123074 [Populus euphratica][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002035VWF_A
IPR011205UCP015417_vWA
IPR024553DUF2828
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI04G20580CSPI04G20580gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI04G20580.1CSPI04G20580.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G20580.1.utr5p1CSPI04G20580.1.utr5p1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G20580.1.cds1CSPI04G20580.1.cds1CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI04G20580.1.utr3p1CSPI04G20580.1.utr3p1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002035von Willebrand factor, type AunknownSSF53300vWA-likecoord: 439..567
score: 8.5
IPR011205Uncharacterised conserved protein UCP015417, vWAPIRPIRSF015417T31B5_30_vWAcoord: 1..551
score: 2.6E
IPR024553Domain of unknown function DUF2828PFAMPF11443DUF2828coord: 54..620
score: 9.0E
NoneNo IPR availablePANTHERPTHR31373FAMILY NOT NAMEDcoord: 1..638
score: