CSPI03G19520 (gene) Wild cucumber (PI 183967)

NameCSPI03G19520
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionMidasin
LocationChr3 : 15311578 .. 15314422 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCCTAGAAAAATATAATGGGAATTTCCAACTAATAACAAAAATAGAGATGGATAGGAAACACAACACACTATCCCATTCCCTCTCTCTTCCTTTTCTCCCACAAAATCAAAACCTATCTTCCTCTTCTTATTCTTCTTTCCTCCTCCTCCCTTGCCTTCAATCATACTGTTTCACCACAGAAATCAAAACCCCATTTGCCCTAATGGCTACTCTTCACCGGTAACCATCTCTGTTTTCATGGATTCTGACCCCCATTTTCGTACCACAAGCACCAACAGTACCAGTTCCACCGCCACCCCCTCCAGCGAGCTCTTCATTTGCTTCACTTCTCGCTTCTCCTCTTCTTCTTCTTCCTCCATGAAGATCTCTTCCAAGTCCATTCTCAGTCCTGGCCGTCCCCGTGAACCCTCCCAAATCTCCCTCTCTACTTCCCTCAGTCGCCGTCTCAAATCCAGCGGTAGTCTCAAGGGTGGTCAAGCCTCCCCCATGTTTCCCACTGGCAGGAAGAAGCGTGGCTGTGCTTTTGATAATCCCGAACCCTCTTCGCCCAAGGTTACTTGTATTGGTCAGGTTAGGGTTAAGACGAAGAAGCAGGGGAAGAAGATGAGGGCTAGATCGCAGAAACGGAGGACTAATTCGGAGGCCAGTTTTCGGAGATCGGAGAGTCTTGTTCAATCGTCGCAGGGTAATGGTAGTGACCAGCAATTCTCGTCGCATCATAATCACCATCTTCTTCGTCAGAACAGTAATAGTAATGCTGGAAATGGTTTCCAGCAGGAATGCTTGTCGCATCGGAACCAGCGGTGGGTGCATTTGCCGTTTACGATTTGTGAGGCGCTTAGGGCTTTTGGTGCTGAACTCAACTGCTTCTTGCCGTGCCATTCGTCTTGTTCTGGTAACAGGGAGAATAATAAGGAATCGAAGCCGGCAGAGAGGTCGTCGGAGAGTGAGAGTTCTTGTGGGACGGTGTTTGCGCGGTGGTTGGTGGCGGTGCAGGATGGAGACGGGAAGGGGAGAGAGATCGAGCTGGTGGTTGGAGATGAAGAAACTCGAACGGAGAAGGAAAATGGAAGCCAGAGACGGCATGTTTTTGAGGGATTAGACTTCAAAGATAAGAATGAGGCTGTGGAGGAGGAGCAATCTAGGATCAGCATTTGCATTCCTCCGAAGAATGCTTTATTGCTAATGAGATGTAGATCTGATCCGGTGAAAATGGCGGAGCTGGCGAAACGATTCTGTGAACCTCCTGCGCCGAAAGTGGATGAAGAAGACGAGGAAGGAGAAGATGAAGACAATGAAGCAAAAAAGAGAGAAAATGAAGTGAAAAGAGATGTATCTGTGCCTGTGTCTTCCATTGTTACTGTAAATAAGGAAGAGGAAGAAGTAGAGGAAGAGGAAGATGAAAGAAAAGTGGAGCAGCTCATTGTGAAGCTTGAAAACGAGGAAGAAATGAATGAAGAATGTGTTTCTGATGCAGATAAAGAAAAGGAAGAAGCTAATTTGGTTTTACAGGAAGAAGAACGAGAAGAAGAAGAAGACAATGAAGAAGAGACCATAGAAATGGCCACAGAAAACGAAATCGATGAGCAAAAAGATATTACTGTTGTAAATCAGCTTAATCAAGAACAAGCACTGGAAGAAAAAGAAGAGGATAAAACCGATCAAGTTAATCAGCAAGAAACAATGGCGATTCCAATTCCGCTTCTGATTCAGACCCACTGTGAACCCGAAATGGCTCAAGATGTAGAGAAGCTGGAATCTGTCGAAAAAGAGGAACCCAAGCTATCCCATGAAAGCGAACAAGACCAGAAAACAGAAGAAGACGAAAACCTTAGAGAAGATAAGGAAGAAGAAGAAGAGGAAGAAGGCGAAAACGGCGAAAACGGCGAAACCACCACTTCACCATCATTATCAGTAGAGACAGAACCAGTTTCAGACGAAACCGAAACTGAAGTTGATGTGAATAGGGAAGAAGAAGAAGAAGAAGAAGAAGAGAAAACGACGGATGAAGGAATCGGACCCGATGACGAAAACGACGTATTAGTGGGTCCAGAGGAGGAGGACCAGTCCAAGGAGCGAGAAACTCCGCTGCCGGAGCCGGAATCAGAACCGGAACCGGAGAGAAAAACACAAACAGAAACATCCGTTCTCCCAGATTGCTTGCTGTTAATGATGTACGAGCCAAAGCTATCAATGGAGGTATCGAAGGAGACATGGGTTTGCAGCGCAGACTTCATAAGATGCGTTCCGACCAGGGAGAAGAAGGCGATCGGCAAAGACCCACCGCCGCCTCCGCCACCGAAGAAACGAGAAACGAAGCCGACGGACACTACGCAGACAGCGGTTGTTCAGCCAGCGAGATGGTCGTGTTCGTTTCCAGCGGCAGCGGCAGCGGCGGCGATGATAGAACAGAAGCTAGTAAGAGCCAAGGGTTACGAGCCGTTTGTTCTTACTAGATGCAAGTCGGAGCCGATGAGATCTTCGGCTAAGCTGGCGCCAGATGCTTGCTGTTGGAAGGATCGCAAGCTCGAGCCACACCGTCCGGCTACCTTCGGCGTCGGCGCGGCTGAAGTTGGATTTTGACCATTTCACCCTTCTCAACTCTTAAGGTAAAAAAGAAAAGGACTAGAAAGAAATAGGAATTTGTATAAATAGTGTGGTAGCAATTTGTATTTTTCCAGAAAAAAAAAAGGTATTTTGCCATTGTAGGTTTTGCATAATTCTGCCTTTTTGCAATCTCCCCCCCCTCTCCATCACATGTTTTTCTTTTTGCCCCTCATATTTATTATATTGGAAAATTAGTTTTAAACGTTTATATGATCATATCATCAACTTTACCCCAA

mRNA sequence

ATGGATTCTGACCCCCATTTTCGTACCACAAGCACCAACAGTACCAGTTCCACCGCCACCCCCTCCAGCGAGCTCTTCATTTGCTTCACTTCTCGCTTCTCCTCTTCTTCTTCTTCCTCCATGAAGATCTCTTCCAAGTCCATTCTCAGTCCTGGCCGTCCCCGTGAACCCTCCCAAATCTCCCTCTCTACTTCCCTCAGTCGCCGTCTCAAATCCAGCGGTAGTCTCAAGGGTGGTCAAGCCTCCCCCATGTTTCCCACTGGCAGGAAGAAGCGTGGCTGTGCTTTTGATAATCCCGAACCCTCTTCGCCCAAGGTTACTTGTATTGGTCAGGTTAGGGTTAAGACGAAGAAGCAGGGGAAGAAGATGAGGGCTAGATCGCAGAAACGGAGGACTAATTCGGAGGCCAGTTTTCGGAGATCGGAGAGTCTTGTTCAATCGTCGCAGGGTAATGGTAGTGACCAGCAATTCTCGTCGCATCATAATCACCATCTTCTTCGTCAGAACAGTAATAGTAATGCTGGAAATGGTTTCCAGCAGGAATGCTTGTCGCATCGGAACCAGCGGTGGGTGCATTTGCCGTTTACGATTTGTGAGGCGCTTAGGGCTTTTGGTGCTGAACTCAACTGCTTCTTGCCGTGCCATTCGTCTTGTTCTGGTAACAGGGAGAATAATAAGGAATCGAAGCCGGCAGAGAGGTCGTCGGAGAGTGAGAGTTCTTGTGGGACGGTGTTTGCGCGGTGGTTGGTGGCGGTGCAGGATGGAGACGGGAAGGGGAGAGAGATCGAGCTGGTGGTTGGAGATGAAGAAACTCGAACGGAGAAGGAAAATGGAAGCCAGAGACGGCATGTTTTTGAGGGATTAGACTTCAAAGATAAGAATGAGGCTGTGGAGGAGGAGCAATCTAGGATCAGCATTTGCATTCCTCCGAAGAATGCTTTATTGCTAATGAGATGTAGATCTGATCCGGTGAAAATGGCGGAGCTGGCGAAACGATTCTGTGAACCTCCTGCGCCGAAAGTGGATGAAGAAGACGAGGAAGGAGAAGATGAAGACAATGAAGCAAAAAAGAGAGAAAATGAAGTGAAAAGAGATGTATCTGTGCCTGTGTCTTCCATTGTTACTGTAAATAAGGAAGAGGAAGAAGTAGAGGAAGAGGAAGATGAAAGAAAAGTGGAGCAGCTCATTGTGAAGCTTGAAAACGAGGAAGAAATGAATGAAGAATGTGTTTCTGATGCAGATAAAGAAAAGGAAGAAGCTAATTTGGTTTTACAGGAAGAAGAACGAGAAGAAGAAGAAGACAATGAAGAAGAGACCATAGAAATGGCCACAGAAAACGAAATCGATGAGCAAAAAGATATTACTGTTGTAAATCAGCTTAATCAAGAACAAGCACTGGAAGAAAAAGAAGAGGATAAAACCGATCAAGTTAATCAGCAAGAAACAATGGCGATTCCAATTCCGCTTCTGATTCAGACCCACTGTGAACCCGAAATGGCTCAAGATGTAGAGAAGCTGGAATCTGTCGAAAAAGAGGAACCCAAGCTATCCCATGAAAGCGAACAAGACCAGAAAACAGAAGAAGACGAAAACCTTAGAGAAGATAAGGAAGAAGAAGAAGAGGAAGAAGGCGAAAACGGCGAAAACGGCGAAACCACCACTTCACCATCATTATCAGTAGAGACAGAACCAGTTTCAGACGAAACCGAAACTGAAGTTGATGTGAATAGGGAAGAAGAAGAAGAAGAAGAAGAAGAGAAAACGACGGATGAAGGAATCGGACCCGATGACGAAAACGACGTATTAGTGGGTCCAGAGGAGGAGGACCAGTCCAAGGAGCGAGAAACTCCGCTGCCGGAGCCGGAATCAGAACCGGAACCGGAGAGAAAAACACAAACAGAAACATCCGTTCTCCCAGATTGCTTGCTGTTAATGATGTACGAGCCAAAGCTATCAATGGAGGTATCGAAGGAGACATGGGTTTGCAGCGCAGACTTCATAAGATGCGTTCCGACCAGGGAGAAGAAGGCGATCGGCAAAGACCCACCGCCGCCTCCGCCACCGAAGAAACGAGAAACGAAGCCGACGGACACTACGCAGACAGCGGTTGTTCAGCCAGCGAGATGGTCGTGTTCGTTTCCAGCGGCAGCGGCAGCGGCGGCGATGATAGAACAGAAGCTAGTAAGAGCCAAGGGTTACGAGCCGTTTGTTCTTACTAGATGCAAGTCGGAGCCGATGAGATCTTCGGCTAAGCTGGCGCCAGATGCTTGCTGTTGGAAGGATCGCAAGCTCGAGCCACACCGTCCGGCTACCTTCGGCGTCGGCGCGGCTGAAGTTGGATTTTGA

Coding sequence (CDS)

ATGGATTCTGACCCCCATTTTCGTACCACAAGCACCAACAGTACCAGTTCCACCGCCACCCCCTCCAGCGAGCTCTTCATTTGCTTCACTTCTCGCTTCTCCTCTTCTTCTTCTTCCTCCATGAAGATCTCTTCCAAGTCCATTCTCAGTCCTGGCCGTCCCCGTGAACCCTCCCAAATCTCCCTCTCTACTTCCCTCAGTCGCCGTCTCAAATCCAGCGGTAGTCTCAAGGGTGGTCAAGCCTCCCCCATGTTTCCCACTGGCAGGAAGAAGCGTGGCTGTGCTTTTGATAATCCCGAACCCTCTTCGCCCAAGGTTACTTGTATTGGTCAGGTTAGGGTTAAGACGAAGAAGCAGGGGAAGAAGATGAGGGCTAGATCGCAGAAACGGAGGACTAATTCGGAGGCCAGTTTTCGGAGATCGGAGAGTCTTGTTCAATCGTCGCAGGGTAATGGTAGTGACCAGCAATTCTCGTCGCATCATAATCACCATCTTCTTCGTCAGAACAGTAATAGTAATGCTGGAAATGGTTTCCAGCAGGAATGCTTGTCGCATCGGAACCAGCGGTGGGTGCATTTGCCGTTTACGATTTGTGAGGCGCTTAGGGCTTTTGGTGCTGAACTCAACTGCTTCTTGCCGTGCCATTCGTCTTGTTCTGGTAACAGGGAGAATAATAAGGAATCGAAGCCGGCAGAGAGGTCGTCGGAGAGTGAGAGTTCTTGTGGGACGGTGTTTGCGCGGTGGTTGGTGGCGGTGCAGGATGGAGACGGGAAGGGGAGAGAGATCGAGCTGGTGGTTGGAGATGAAGAAACTCGAACGGAGAAGGAAAATGGAAGCCAGAGACGGCATGTTTTTGAGGGATTAGACTTCAAAGATAAGAATGAGGCTGTGGAGGAGGAGCAATCTAGGATCAGCATTTGCATTCCTCCGAAGAATGCTTTATTGCTAATGAGATGTAGATCTGATCCGGTGAAAATGGCGGAGCTGGCGAAACGATTCTGTGAACCTCCTGCGCCGAAAGTGGATGAAGAAGACGAGGAAGGAGAAGATGAAGACAATGAAGCAAAAAAGAGAGAAAATGAAGTGAAAAGAGATGTATCTGTGCCTGTGTCTTCCATTGTTACTGTAAATAAGGAAGAGGAAGAAGTAGAGGAAGAGGAAGATGAAAGAAAAGTGGAGCAGCTCATTGTGAAGCTTGAAAACGAGGAAGAAATGAATGAAGAATGTGTTTCTGATGCAGATAAAGAAAAGGAAGAAGCTAATTTGGTTTTACAGGAAGAAGAACGAGAAGAAGAAGAAGACAATGAAGAAGAGACCATAGAAATGGCCACAGAAAACGAAATCGATGAGCAAAAAGATATTACTGTTGTAAATCAGCTTAATCAAGAACAAGCACTGGAAGAAAAAGAAGAGGATAAAACCGATCAAGTTAATCAGCAAGAAACAATGGCGATTCCAATTCCGCTTCTGATTCAGACCCACTGTGAACCCGAAATGGCTCAAGATGTAGAGAAGCTGGAATCTGTCGAAAAAGAGGAACCCAAGCTATCCCATGAAAGCGAACAAGACCAGAAAACAGAAGAAGACGAAAACCTTAGAGAAGATAAGGAAGAAGAAGAAGAGGAAGAAGGCGAAAACGGCGAAAACGGCGAAACCACCACTTCACCATCATTATCAGTAGAGACAGAACCAGTTTCAGACGAAACCGAAACTGAAGTTGATGTGAATAGGGAAGAAGAAGAAGAAGAAGAAGAAGAGAAAACGACGGATGAAGGAATCGGACCCGATGACGAAAACGACGTATTAGTGGGTCCAGAGGAGGAGGACCAGTCCAAGGAGCGAGAAACTCCGCTGCCGGAGCCGGAATCAGAACCGGAACCGGAGAGAAAAACACAAACAGAAACATCCGTTCTCCCAGATTGCTTGCTGTTAATGATGTACGAGCCAAAGCTATCAATGGAGGTATCGAAGGAGACATGGGTTTGCAGCGCAGACTTCATAAGATGCGTTCCGACCAGGGAGAAGAAGGCGATCGGCAAAGACCCACCGCCGCCTCCGCCACCGAAGAAACGAGAAACGAAGCCGACGGACACTACGCAGACAGCGGTTGTTCAGCCAGCGAGATGGTCGTGTTCGTTTCCAGCGGCAGCGGCAGCGGCGGCGATGATAGAACAGAAGCTAGTAAGAGCCAAGGGTTACGAGCCGTTTGTTCTTACTAGATGCAAGTCGGAGCCGATGAGATCTTCGGCTAAGCTGGCGCCAGATGCTTGCTGTTGGAAGGATCGCAAGCTCGAGCCACACCGTCCGGCTACCTTCGGCGTCGGCGCGGCTGAAGTTGGATTTTGA
BLAST of CSPI03G19520 vs. TrEMBL
Match: A0A0A0L789_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236550 PE=4 SV=1)

HSP 1 Score: 1416.4 bits (3665), Expect = 0.0e+00
Identity = 775/781 (99.23%), Postives = 779/781 (99.74%), Query Frame = 1

Query: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60
           MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI
Sbjct: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60

Query: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
           SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQ 180
           KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQ
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQ 180

Query: 181 ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSESESS 240
           ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSESESS
Sbjct: 181 ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSESESS 240

Query: 241 CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE 300
           CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE
Sbjct: 241 CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE 300

Query: 301 QSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDEDNEAKKREN 360
           +SRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDEDNEAKKR+N
Sbjct: 301 ESRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDEDNEAKKRQN 360

Query: 361 EVKRDVSVPVSSIVTVNKEEEEVEEEEDERKVEQLIVKLENEEEMNEECVSDADKEKEEA 420
           EVKRDVSVPVSSIVTVNKEEEEV+EEEDERKVEQLIVKLENEEEMNEECVSDADKEKEEA
Sbjct: 361 EVKRDVSVPVSSIVTVNKEEEEVKEEEDERKVEQLIVKLENEEEMNEECVSDADKEKEEA 420

Query: 421 NLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEEKEEDKTDQVNQQ 480
           NLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEEKEEDKTDQVNQQ
Sbjct: 421 NLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEEKEEDKTDQVNQQ 480

Query: 481 ETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLREDKEEEE 540
           ETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLREDKEEEE
Sbjct: 481 ETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLREDKEEEE 540

Query: 541 EEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEEEEKTTDEGIGPDDEND 600
           EEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEEEEKTTDEGIGPDDEND
Sbjct: 541 EEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEEEEKTTDEGIGPDDEND 600

Query: 601 VLVGPEEEDQSKERETPLPEPESEPEPERKTQTETSVLPDCLLLMMYEPKLSMEVSKETW 660
           VLVGPEEEDQSKE ETP PEPESEP+PERKTQTETSVLPDCLLLMMYEPKLSMEVSKETW
Sbjct: 601 VLVGPEEEDQSKEGETPPPEPESEPKPERKTQTETSVLPDCLLLMMYEPKLSMEVSKETW 660

Query: 661 VCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARWSCSFPAAAAAA 720
           VCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARWSCSFPAAAAAA
Sbjct: 661 VCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARWSCSFPAAAAAA 720

Query: 721 AMIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHRPATFGVGAAEVG 780
           AMIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHRPATFGVGAAEVG
Sbjct: 721 AMIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHRPATFGVGAAEVG 780

Query: 781 F 782
           F
Sbjct: 781 F 781

BLAST of CSPI03G19520 vs. TrEMBL
Match: M5X0H9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001903mg PE=4 SV=1)

HSP 1 Score: 493.0 bits (1268), Expect = 6.5e-136
Identity = 395/820 (48.17%), Postives = 500/820 (60.98%), Query Frame = 1

Query: 1   MDSDPHFRTTSTNSTS-STATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQ 60
           M+SD   RT S +S S +T+T +SELFICFT+  S  SSSSMK+SSKSILSPGR REPSQ
Sbjct: 1   MESDRPHRTKSNSSISGTTSTTTSELFICFTT--SRLSSSSMKLSSKSILSPGRAREPSQ 60

Query: 61  ISLSTSLSRRLKSSGSLKGGQASPMFPTG---RKKRGCAFDNPEPSSPKVTCIGQVRVKT 120
           ISLS+SLSRRL++SGS+KGGQASPMFP+     KKRGCAF+NPEPSSPKVTCIGQVRVKT
Sbjct: 61  ISLSSSLSRRLRTSGSIKGGQASPMFPSNGGTSKKRGCAFENPEPSSPKVTCIGQVRVKT 120

Query: 121 KKQGKKMRARSQKRRTN-SEASFRRSESLVQSSQGNGSDQQF-----SSHHNHHLLRQNS 180
           KKQGKKMR  S+ +R+  SEASFR+ E   QS+    S  Q      +S +N   L   S
Sbjct: 121 KKQGKKMRIISRSKRSRGSEASFRKPEQNQQSTNNTASQSQELYNRDNSSNNFQGLHFQS 180

Query: 181 NSNAGNGFQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENN---KE 240
           +    N  QQECL HRNQRWVHLP TICEALRAFG+E NC +P  SSC  + +NN   KE
Sbjct: 181 HQ-INNNNQQECLRHRNQRWVHLPLTICEALRAFGSEFNCLIPNRSSCLASDDNNNKEKE 240

Query: 241 SKPAERSSESESSCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENG-----SQRR 300
                RS    SSCG VFARW VA+QDGDGKGREIEL+VG+++ RTE+        SQRR
Sbjct: 241 ENKGVRSESGGSSCGAVFARWFVALQDGDGKGREIELMVGEDQERTERSTNSSSGHSQRR 300

Query: 301 HVFEGLDFKDK--NEAV--EEEQSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPA 360
            VFEG++FK++  NE+V  EEE   +SIC+PPKNALLLMRCRSDPVKMA LA RF E PA
Sbjct: 301 QVFEGIEFKEERLNESVMEEEEAGGVSICVPPKNALLLMRCRSDPVKMAALANRFWEMPA 360

Query: 361 PKVDEEDEEGEDEDNEAKKRENEVKRDVSVPVSSIVTVNKEEEEVEEEEDERKVEQLIVK 420
              DEE E+ E+++++                       K ++ VEE+  +  +E++   
Sbjct: 361 APQDEEVEDEEEKEDKG-------------------LTEKAQDFVEEQGTDEVLEKVQNG 420

Query: 421 LENE----EEMNEECVSDADKE---KEEANLVLQEEEREEE--EDNEEETIEMATENEID 480
           LE E    + + E+ V D ++    +E   LVL+E+E E+E  ++N E+  ++      D
Sbjct: 421 LETEVAEGDGVCEKWVCDGEEHEDLEEVEKLVLEEKEDEKEGLDENPEKRQQL-----YD 480

Query: 481 EQKDITVVNQLNQEQALEEKEEDKTDQVNQQETMAIPIPLLIQTHCEPEMAQDVEKLESV 540
           E ++I    +  QE  LEE+EE + D V QQ         L +  C  ++  D E LE  
Sbjct: 481 EVEEIEEKAECQQEAELEEQEEQELD-VTQQ--------ALSEECCVLDVVADPEMLEFE 540

Query: 541 EKEEPKLSHESEQDQKTEEDENLREDKEEEEEEEGENGENGETTTSPSLSVETEPVSDET 600
           E E     HE E    TE+++  RE+++EEE  E       +     +  V++E + +E 
Sbjct: 541 ENE-----HECE---ATEQEQEQREEEKEEEVRE------VKLPIPSNECVKSEELEEEE 600

Query: 601 ETEVDVNREEEEEEEEEKTTDEGIGPDDENDVLVGPEEEDQSKERETPLPEPESEPEPER 660
           +TE +V  +E  EEE E  T     P  EN                     P+++ +   
Sbjct: 601 KTEAEV-ADESTEEETETVTQYRPEPVSEN---------------------PKNQLDSGS 660

Query: 661 KTQTETSVLPDCLLLMMYEPKLSMEVSKETWVCSADFIRCVPTREKKAIGKDPPPPPPPK 720
           K   + SVLPDCLLLMM EPKLSMEVSKETWVC+ DFIRC+P R  K +      P   K
Sbjct: 661 KRAVQNSVLPDCLLLMMCEPKLSMEVSKETWVCTTDFIRCLPERHVKKV----DAPDEAK 720

Query: 721 KR---ETKPTDT-TQTAVVQPARWSCSFPAAA---AAAAMIEQKLVRAKGYEPFVLTRCK 780
           KR   ++ P        V+QP R SCSFP  A   + A MI QKLV +  YEPFVLTRCK
Sbjct: 721 KRVNIDSNPAAAPAAQPVIQPPRSSCSFPVQAGPVSMATMIGQKLVGSTAYEPFVLTRCK 744

Query: 781 SEPMRSSAKL-APDACCWKDRKLEPHRPATFGVGAAEVGF 782
           SEPMRS+ KL A + C WK+RK+EPHR A  GVGAA VGF
Sbjct: 781 SEPMRSAGKLPAAETCFWKNRKMEPHRRAAMGVGAAGVGF 744

BLAST of CSPI03G19520 vs. TrEMBL
Match: A0A061F8W3_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_031658 PE=4 SV=1)

HSP 1 Score: 491.5 bits (1264), Expect = 1.9e-135
Identity = 400/827 (48.37%), Postives = 503/827 (60.82%), Query Frame = 1

Query: 1   MDSDPHFRTTSTNSTSS---TATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREP 60
           MD +   R+TS N++SS   T + +SELFICFTSR SS   SSMK+SSKSILSPGR RE 
Sbjct: 1   MDPERPHRSTSINNSSSSSNTTSTTSELFICFTSRLSS---SSMKLSSKSILSPGRTRES 60

Query: 61  SQISLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTK 120
           SQISLS+SLSRRLKS+GS+KGGQASPMFPT  KKRGCAF+NPEPSSPKVTCIGQVRVKTK
Sbjct: 61  SQISLSSSLSRRLKSNGSMKGGQASPMFPTNGKKRGCAFENPEPSSPKVTCIGQVRVKTK 120

Query: 121 KQGKKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNG 180
           KQGKK +A   KRR   E SFR+ +    ++  N  D      +N      N+N +    
Sbjct: 121 KQGKKFKACRSKRR--GEVSFRKVDHNNANNGSNSLDTSSCQDYNMGHFLSNNNHHHQQQ 180

Query: 181 FQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSES 240
            QQEC     ++WVHLP TICEALRAFGAE NCFLPC SSC  N+ + +E       S  
Sbjct: 181 QQQEC-----KKWVHLPLTICEALRAFGAEFNCFLPCRSSCMANQRDKEERTGGSGGSNG 240

Query: 241 E---SSCGTVFARWLVAVQDGDGKGREIELVVG--DEETRTEKE--NGSQRRHVFEGLDF 300
               SSCG VFARWLVAVQ+G+GK REIELVVG  D+E R   E    SQRRHVFE ++ 
Sbjct: 241 NGNGSSCGAVFARWLVAVQEGEGKEREIELVVGGEDDERRESSEMMRSSQRRHVFEDIEI 300

Query: 301 KD-KNEAVEEEQSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGE 360
            D  NE V +E++R+SICIPPKNALLLMRCRSDPVKMA LA +F E P PK DEE+EE E
Sbjct: 301 NDCGNENVGDEEARVSICIPPKNALLLMRCRSDPVKMAALANKFWETPVPK-DEEEEEEE 360

Query: 361 DEDNEAKKRENEVKRDVSVPVSSIVTVNKEEEEVEEEEDERKV-----EQLIVKLENE-- 420
           +E+ E  + ++E                 E+EE EEEE++R V     E   VK E E  
Sbjct: 361 EEEEEGAENKSE-----------------EKEEEEEEENQRDVVEGEREGRRVKFEQEME 420

Query: 421 ----EEMNEECVSDADKEKEEANLVLQEEEREEEEDNEEETIEMATENEIDEQ------K 480
                E+++  VS    E++E    + E E E   + E E++ +  E E+ E+      K
Sbjct: 421 HQEVSEVSQMFVSCEATEEQE----IPEAEAEAVAETEAESVFVGDEAELVEETLERSLK 480

Query: 481 DITVVNQLNQEQALEEKEEDKTDQVNQQETMAIPIPLLIQTHCEPEMAQDVEKLESVEKE 540
           + T++   +QEQ  E +E+ +    N++    +P+ L        E  Q  E ++  ++E
Sbjct: 481 EETIIECQDQEQENEVEEDQQASTTNEEFLSEVPLHL--------EKLQREENVQGSDQE 540

Query: 541 -EPKLSHESEQDQKTEEDENLREDKEEEEEEEGENGENGETTTSPSLSVETEPVSDETET 600
            E  L  E ++++   E+EN+   K EEE EE EN E GE        VE + +++E E 
Sbjct: 541 NEDGLEGEQQEEEVEAEEENVL-GKVEEECEENEN-EGGE-------EVEDQAIAEEAEE 600

Query: 601 EVDVNREEEEEEEEEKTTDEGIGPDDENDVLVGPEEEDQSKERETPLPEPESEPEPERKT 660
           E + +  EE+E E   TT E                E Q  E   P P  ES     +++
Sbjct: 601 EEESSTVEEKEAE---TTQE--------------RSELQCLEAREPDPGDES-----KES 660

Query: 661 QTETSVLPDCLLLMMYEPKLSMEVSKETWVCSADFIRCVPTREKKAIGKDPPPPPPPKKR 720
           +++ ++LPDCLLLMM EPKLSMEVSKETWVCS DFIR VP ++K+   K       PK+R
Sbjct: 661 ESQQNLLPDCLLLMMCEPKLSMEVSKETWVCSTDFIRWVPEKKKQPAVKQKDGGDEPKRR 720

Query: 721 ---ETKPTDTTQTAVVQPARWSCSFPAA-------------AAAAAMIEQKLV-RAKGYE 780
              ++KP       ++QP R SCSFPAA              + A MIEQKLV  +KGYE
Sbjct: 721 LCIDSKPA----PMLLQPPRSSCSFPAAPPMAKAANGAGGGGSMATMIEQKLVGGSKGYE 749

Query: 781 PFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHRPATFGVGAAEVGF 782
           PFVLTRCKSEPMRSSAKL+PDAC WK+RKLE   PAT GVGAA VGF
Sbjct: 781 PFVLTRCKSEPMRSSAKLSPDACFWKNRKLE---PATLGVGAAGVGF 749

BLAST of CSPI03G19520 vs. TrEMBL
Match: B9I1E3_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s09990g PE=4 SV=2)

HSP 1 Score: 490.0 bits (1260), Expect = 5.5e-135
Identity = 387/826 (46.85%), Postives = 494/826 (59.81%), Query Frame = 1

Query: 5   PHFRTTSTNS--TSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQISL 64
           PH R+ S NS  TS+  + +SELFICFTSR SS   SSMK+SSKSILSPGR R+ SQISL
Sbjct: 13  PH-RSNSNNSSSTSNNNSNTSELFICFTSRLSS---SSMKLSSKSILSPGRHRDSSQISL 72

Query: 65  STSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKK 124
           S SLSRRL+SSGS+KGGQASPMFPT  KKRGCAF+NPEPSSPKVTCIGQVRVKTKKQGKK
Sbjct: 73  SNSLSRRLRSSGSMKGGQASPMFPTNGKKRGCAFENPEPSSPKVTCIGQVRVKTKKQGKK 132

Query: 125 MRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQEC 184
           +R RS++R    E SFRR +          +   F   +NHH L  N   N     QQE 
Sbjct: 133 LRTRSKRR---GEISFRRVDQ---------NSNTFEGSNNHHDLINNQFLNQQQQ-QQEG 192

Query: 185 LSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSES-ESSC 244
           LSHRNQRWVH P TICEALRAFGAE NCFLPC SSC  + +  +E+  A  S+ +  SSC
Sbjct: 193 LSHRNQRWVHFPVTICEALRAFGAEFNCFLPCRSSCMASEKEKEENTAAAGSNNNGSSSC 252

Query: 245 GTVFARWLVAVQDGDGKGREIELVVGDE--ETRTEKENGSQRRHVFEGLDFKDK------ 304
           G VFARWLVAVQ+G+GKG+EIELVVG+E  E   ++   S RRH+FE ++FK++      
Sbjct: 253 GAVFARWLVAVQEGEGKGKEIELVVGEEVVEEERDERRRSYRRHIFEDIEFKEEEGHVFE 312

Query: 305 --NEAVEEEQSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDE 364
             N  ++EE++R+SICIPPKNALLLMRCRSDPVKMA LA +F E PAP    +DEE E+E
Sbjct: 313 GGNAGLQEEEARVSICIPPKNALLLMRCRSDPVKMAALANKFWESPAP----QDEEDEEE 372

Query: 365 DNEAKKRENEVKRDVSVPVSSIVTVNKEEEEVEEEEDERKVEQLIVKLENEEEMNEECVS 424
           DNE    E E  R++   V   + +  + E    +E+E KVEQ I+  + ++    + ++
Sbjct: 373 DNE----EGEKDRNLGAEVDKFINIENKSEVKASQEEEIKVEQEIIIEQKQDLTVSDKLA 432

Query: 425 DADKEKEEANLVLQEEER----EEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALE 484
             +  +E   ++ + EE     E  ED++E  I    +N     +++ +V Q       E
Sbjct: 433 FCETIEEHYQIIQETEESLVILEAGEDSQE--IGSTDDNIDGVLQEVNLVKQ-------E 492

Query: 485 EKEEDKTDQVNQQETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTE 544
           E+E +    +N Q T +        T     +  D       E  +P     +E + K  
Sbjct: 493 EEESETPGVMNLQPTSS--------TQETVSLCSDESSSHDQEIVDPAALMNNENEYKV- 552

Query: 545 EDENLREDKEEEEEEEGENGENGETTTSPSLSVETEPVSDETETE-VDVNREEEEEEEEE 604
               ++E++E+ +EE     E  +     S  +E   VS   E E + V  ++ +++E E
Sbjct: 553 ----VQENEEDNQEERVFQAEQEQVVQGLSDDIEENSVSVRFEQETLQVAVQDLQDQEPE 612

Query: 605 KTTDEGIGPDDENDVLVGPEEEDQSKERETPLPEPESEPEPERKTQTETSV--------- 664
             +   +   +        EEE ++ E ET L E E E +P+     +T V         
Sbjct: 613 SLSVAELQVQE-------TEEEKETTENETELAEEEPE-DPKTHVNGQTGVKSREGDNSQ 672

Query: 665 --LPDCLLLMMYEPKLSMEVSKETWVCSADFIRCVPTREK---KAIGKDPPPPPPPKKR- 724
             LPDCLLLMM EPKLSMEVSKETWVCS DFIR +P   +   K  GKD      PKKR 
Sbjct: 673 PLLPDCLLLMMCEPKLSMEVSKETWVCSTDFIRWLPEHSRPVSKTNGKD-----EPKKRV 732

Query: 725 --ETKPTD-----TTQTAVVQPARWSCSFPAAAAA--------AAMIEQKLVRAKGYEPF 782
             + KP           ++ QP R SCS+PA   A        + MIEQKLV AK YEPF
Sbjct: 733 SIDIKPAQVYNNGNNSNSLQQPRRSSCSYPAKPPARCAGTESMSTMIEQKLVGAKAYEPF 778

BLAST of CSPI03G19520 vs. TrEMBL
Match: A0A0L9ULB0_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan05g113200 PE=4 SV=1)

HSP 1 Score: 473.0 bits (1216), Expect = 7.0e-130
Identity = 394/811 (48.58%), Postives = 475/811 (58.57%), Query Frame = 1

Query: 14  STSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQISLSTSLSRRLKSS 73
           STSS ++ +SELF+CFTSR SSSS   MK+SSKSILSP R R+P QISLS+SLSRRLKS+
Sbjct: 11  STSSNSSSTSELFVCFTSRLSSSS---MKLSSKSILSPSRSRDPPQISLSSSLSRRLKSN 70

Query: 74  GSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMRARSQKRRTN 133
           GS+KGGQASPMFPT  K+RGC F+NPEPSSPKVTCIGQVRVKTKKQGKK+RARS++R   
Sbjct: 71  GSMKGGQASPMFPTAGKRRGCGFENPEPSSPKVTCIGQVRVKTKKQGKKIRARSKRR--- 130

Query: 134 SEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQ--QECLSHRNQRWV 193
            EASFR+ E    ++ GN +        N  L RQNS      GFQ  Q CL HRNQRWV
Sbjct: 131 GEASFRKGEQGCANANGNANP-------NADLTRQNSQ-----GFQHHQNCLKHRNQRWV 190

Query: 194 HLPFTICEALRAFGAELNCFLPCHSSC-SGNRENNKESKPAERSSESESSCGTVFARWLV 253
           HLP TICEALR    E +CF PC SSC S  +E  K           E SCG    RWLV
Sbjct: 191 HLPLTICEALR----EFSCFFPCRSSCMSSEKEKEKGGGVEGGGLVREGSCGNGLGRWLV 250

Query: 254 AVQDGDGKGREIELVVGDEETRTEKENG----SQRRHVFEGLDF--------KDKNEAV- 313
           A+QDGDGKGR IELV+ +EE    ++ G    SQRRHVFE +D         + KN+ V 
Sbjct: 251 ALQDGDGKGRGIELVM-EEEMEDGRDTGERSHSQRRHVFEDIDVDLVVGEEEEKKNDEVV 310

Query: 314 --EEEQSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPA--PKVDEEDEEGE--DE 373
             EEE++R+SICIPPKNALLLMRCRSDPVKMA LA RF E P    K  EE++EGE  DE
Sbjct: 311 GEEEEKARVSICIPPKNALLLMRCRSDPVKMAALANRFWESPVHKNKCQEEEKEGEEHDE 370

Query: 374 DNEAKKRENEVKRDVSVPVSSIVTVNKEEEEVEEEEDERKVEQLIVKLENEEEMNEECVS 433
           ++   + + EV+ D        + V ++E+ ++ +EDE++V         +EE+  E   
Sbjct: 371 EDSGDEDDQEVEDD---DTEQPIKVKEDEQPIKVKEDEQQV---------QEEVERETKE 430

Query: 434 DADKEKEEANLVLQEEEREEE----EDNEEETIEMATENEIDEQKDITVVNQLNQEQALE 493
           D   E+E   +V+  EE EEE    E  E E++E     E+ E K+              
Sbjct: 431 DTICERETETMVVVGEEEEEEQVWNESYEVESVENIESRELHEVKE-------------- 490

Query: 494 EKEEDKTDQVNQQETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTE 553
             +E+K DQ N+                       +EK E++   E   +H    + KTE
Sbjct: 491 --DEEKGDQANE---------------------NTIEKGETLTHPE---AHSDLDNLKTE 550

Query: 554 EDE-NLREDKEE-EEEEEGENGENGETTTSPSLSVETEPVSDETETEVDV----NREEEE 613
           E E NLRE KEE  +EE  EN E+ E +++P     +EP +D+ E E +       E   
Sbjct: 551 EKEVNLREGKEEVGKEEVRENNESSELSSTPETFTASEPENDDGEPETEAVTTETSEGST 610

Query: 614 EEEEEKTTDEGIGPDDENDVLVGPEEEDQSKERETPLPEPESEPEPERKTQTETSVLPDC 673
           EEEEEK T E    D   D    P  + QSKE E             +    E   LP+C
Sbjct: 611 EEEEEKVTTETESQDPTRD----PNSDPQSKEGE----------NGSKCEDRERETLPEC 670

Query: 674 LLLMMYEPKLSMEVSKETWVCSADFIRCVPTREKKA-IGKDPPPPPPPKKRETKPTDTTQ 733
           LLLMM EPKLSMEVSKETWVCS DFIR +P R   A  GK        K   TKP     
Sbjct: 671 LLLMMCEPKLSMEVSKETWVCSTDFIRWLPERPAAAGAGKRLTGETFTK---TKPKPKPS 729

Query: 734 TAVVQPARWSCSFPAAAAA-----AAMIEQKLVRAK---GYEPFVLTRCKSEPMRSSAKL 782
             + QP R SCSFPA   A     A MIEQKLV +K   GYEPFVLTRCKSEPMRSSAKL
Sbjct: 731 PPLAQPPRSSCSFPAFGGAAGVSMATMIEQKLVGSKSGNGYEPFVLTRCKSEPMRSSAKL 729

BLAST of CSPI03G19520 vs. TAIR10
Match: AT3G15095.1 (AT3G15095.1 unknown protein)

HSP 1 Score: 213.0 bits (541), Expect = 6.6e-55
Identity = 234/582 (40.21%), Postives = 306/582 (52.58%), Query Frame = 1

Query: 5   PHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQISLST 64
           PH  ++  +S+++ +  S++LFICFTSRF  SSSSSM++SSKSI SP R        L+T
Sbjct: 7   PHRSSSINSSSNNNSGSSTDLFICFTSRF--SSSSSMRLSSKSIHSPAR-----SACLTT 66

Query: 65  SLSRRLKSSGSLKGGQA----SPMFPT--GRKKRGCAFDNP--------EPSSPKVTCIG 124
           SLSRRL++SGSLK   A    SPMF    GRK+ G  ++N         EPSSPKVTCIG
Sbjct: 67  SLSRRLRTSGSLKNASAGVLNSPMFGANGGRKRSGSGYENSNNNNNNNIEPSSPKVTCIG 126

Query: 125 QVRVKTKKQ-GKKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQN 184
           QVRVKT+K   KKMRARS  RR   E SFRRS   V  + G G   +F +  N       
Sbjct: 127 QVRVKTRKHVKKKMRARS--RRKGGENSFRRS---VDQNDGGGG-CRFKASEN------- 186

Query: 185 SNSNAGNGFQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESK 244
                              R VHLP TICE+LR+FG+ELNCF PC SSC+ N  ++ + +
Sbjct: 187 -------------------RLVHLPVTICESLRSFGSELNCFFPCRSSCTEN--SHGDGR 246

Query: 245 PAERSSE-------SESSCGTVFARWLVAVQD-GDGKGREIELVVGDEETRTEKENGSQR 304
            AE +++         +SCG VF RW VAV++   GK REIELVVG E+   E    S+R
Sbjct: 247 RAESNNDGCGGGGGGSNSCGAVFTRWFVAVEETSGGKRREIELVVGGEDEVEEDRRRSRR 306

Query: 305 RHVFEGLDFKD---KNEAVE--EEQSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEP 364
           RHVFEGLD  +   K E  E  EE  R+SIC PPKNALLLMRCRSDPVK+A LA R  E 
Sbjct: 307 RHVFEGLDLSEIEMKTEKKERGEEVGRMSICSPPKNALLLMRCRSDPVKVAALANRVRER 366

Query: 365 PAPKVD-EEDEEGEDEDNEAKKRENEVKRDVSVP---VSSIVTVNKEEEEVEEEEDERKV 424
                D    EE EDE     + E E K+ + +    +S   TV  EE  V   E E + 
Sbjct: 367 QLSLNDGVYTEEEEDERRRRFELEIEDKKRIDLCEKWISGETTVETEEVSVAVAEAEAEA 426

Query: 425 E---QLIVKLENEEEMNEECVSDA-DKEKEEANLVLQEEEREEE-------EDNEEETIE 484
           E    L      EEE   + V D+  +E++EA+ +L   E E E       ED     IE
Sbjct: 427 EAEAPLPSNPATEEEERVKVVEDSIVEEEQEASKILDSFEEEIEATIMKKIEDEIRNAIE 486

Query: 485 MATENEIDEQKDITVVNQLNQEQALEEKE---------EDKTDQVNQQETMAIPIPLLIQ 524
              E ++ E +++ VV     E+  E KE         E++++Q N++     P P ++ 
Sbjct: 487 --EEEKLAEMEELAVVAVAETEEVEESKEVVPDCIPQNEERSEQGNREPD---PSPEVVM 542

BLAST of CSPI03G19520 vs. NCBI nr
Match: gi|778680362|ref|XP_004146243.2| (PREDICTED: glutamic acid-rich protein [Cucumis sativus])

HSP 1 Score: 1416.4 bits (3665), Expect = 0.0e+00
Identity = 775/781 (99.23%), Postives = 779/781 (99.74%), Query Frame = 1

Query: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60
           MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI
Sbjct: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60

Query: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
           SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQ 180
           KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQ
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQ 180

Query: 181 ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSESESS 240
           ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSESESS
Sbjct: 181 ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSESESS 240

Query: 241 CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE 300
           CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE
Sbjct: 241 CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE 300

Query: 301 QSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDEDNEAKKREN 360
           +SRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDEDNEAKKR+N
Sbjct: 301 ESRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEGEDEDNEAKKRQN 360

Query: 361 EVKRDVSVPVSSIVTVNKEEEEVEEEEDERKVEQLIVKLENEEEMNEECVSDADKEKEEA 420
           EVKRDVSVPVSSIVTVNKEEEEV+EEEDERKVEQLIVKLENEEEMNEECVSDADKEKEEA
Sbjct: 361 EVKRDVSVPVSSIVTVNKEEEEVKEEEDERKVEQLIVKLENEEEMNEECVSDADKEKEEA 420

Query: 421 NLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEEKEEDKTDQVNQQ 480
           NLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEEKEEDKTDQVNQQ
Sbjct: 421 NLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEEKEEDKTDQVNQQ 480

Query: 481 ETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLREDKEEEE 540
           ETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLREDKEEEE
Sbjct: 481 ETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLREDKEEEE 540

Query: 541 EEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEEEEKTTDEGIGPDDEND 600
           EEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEEEEKTTDEGIGPDDEND
Sbjct: 541 EEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEEEEKTTDEGIGPDDEND 600

Query: 601 VLVGPEEEDQSKERETPLPEPESEPEPERKTQTETSVLPDCLLLMMYEPKLSMEVSKETW 660
           VLVGPEEEDQSKE ETP PEPESEP+PERKTQTETSVLPDCLLLMMYEPKLSMEVSKETW
Sbjct: 601 VLVGPEEEDQSKEGETPPPEPESEPKPERKTQTETSVLPDCLLLMMYEPKLSMEVSKETW 660

Query: 661 VCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARWSCSFPAAAAAA 720
           VCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARWSCSFPAAAAAA
Sbjct: 661 VCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARWSCSFPAAAAAA 720

Query: 721 AMIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHRPATFGVGAAEVG 780
           AMIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHRPATFGVGAAEVG
Sbjct: 721 AMIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHRPATFGVGAAEVG 780

Query: 781 F 782
           F
Sbjct: 781 F 781

BLAST of CSPI03G19520 vs. NCBI nr
Match: gi|659111998|ref|XP_008456014.1| (PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis melo])

HSP 1 Score: 1277.3 bits (3304), Expect = 0.0e+00
Identity = 725/788 (92.01%), Postives = 744/788 (94.42%), Query Frame = 1

Query: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60
           MD D HFRTTSTNSTSSTATPSSELFICFTSRF  SSSSSMKISSKSILSPGR REPSQI
Sbjct: 1   MDPDRHFRTTSTNSTSSTATPSSELFICFTSRF--SSSSSMKISSKSILSPGRHREPSQI 60

Query: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
           SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQ 180
           KKMRARSQKRRTNSEASFRRSES+VQSSQ N +DQQFSSHHNHHLLRQNSNSNAGNGFQQ
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESVVQSSQVNSNDQQFSSHHNHHLLRQNSNSNAGNGFQQ 180

Query: 181 ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSESESS 240
           ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKE KPAERSSESESS
Sbjct: 181 ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKEPKPAERSSESESS 240

Query: 241 CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE 300
           CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE
Sbjct: 241 CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE 300

Query: 301 QSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEED-EEGEDEDNEAKKRE 360
           +SRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEED EEGEDEDNEAKKR+
Sbjct: 301 ESRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEEGEDEDNEAKKRK 360

Query: 361 NEVKRDVSVPVSSIVTVNKEEEEVEE---EEDERKVEQLIVKLENEEEMNEECVSDADKE 420
           NEVKRDVSVPVSSI+TVNKEEEE EE   EEDERKVEQ +VKLENEEE+NEE VSD DKE
Sbjct: 361 NEVKRDVSVPVSSIITVNKEEEEEEEEEKEEDERKVEQFVVKLENEEEVNEESVSDEDKE 420

Query: 421 KEEANLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEEKEEDKTDQ 480
           KEEANLVLQEE+R EE+DNEEETIEMATEN+ ++++DITVVNQLNQEQALEEKEEDKTDQ
Sbjct: 421 KEEANLVLQEEQR-EEKDNEEETIEMATENDDEQKQDITVVNQLNQEQALEEKEEDKTDQ 480

Query: 481 VNQQETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLREDK 540
           VNQQETMAIPIP  IQTHCEPEMAQD EKLESVEKEE KLSHESEQDQKTEEDE LRE+K
Sbjct: 481 VNQQETMAIPIP--IQTHCEPEMAQDAEKLESVEKEESKLSHESEQDQKTEEDEILREEK 540

Query: 541 EEEEEEEGENGENGETTTSPSLSVETEPVSDETETEVDVNR--EEEEEEEEEKTTDEGIG 600
           EEEEEEE E GENGE  TSPSLSVET+PV DETETEVD  R  EEEEEEEEEK TDEGIG
Sbjct: 541 EEEEEEE-EEGENGENPTSPSLSVETKPVLDETETEVDGKREEEEEEEEEEEKATDEGIG 600

Query: 601 PDDEND-VLVGPEEEDQSKERETPLPEPESEPEPERKTQTETSVLPDCLLLMMYEPKLSM 660
           PDDEN+  LVGPEEEDQSKERETP PEP  EPEPE KTQTETSVLPDCLLLMMYEPKLSM
Sbjct: 601 PDDENNGALVGPEEEDQSKERETPPPEP--EPEPEGKTQTETSVLPDCLLLMMYEPKLSM 660

Query: 661 EVSKETWVCSADFIRCVPTREKKAIGKDPPPPPPPKKRETKPTDTTQTAVVQPARWSCSF 720
           EVSKETWVCSADFIRCVPTREKK +G+DPPPPPPPKKRETKPTDT QT VVQPARWSCSF
Sbjct: 661 EVSKETWVCSADFIRCVPTREKKTVGRDPPPPPPPKKRETKPTDTMQTTVVQPARWSCSF 720

Query: 721 PAAAAAAAMIEQKLVRAKGYEPFVLTRCKSEPMRSSAKLAPDACCWKDRKLEPHRPATFG 780
           PAAAAAAAMIEQKL RAKGYEPFVLTRCKSEPMRSSAKLAPDAC WKDRKLEPHRPATFG
Sbjct: 721 PAAAAAAAMIEQKLARAKGYEPFVLTRCKSEPMRSSAKLAPDACFWKDRKLEPHRPATFG 780

Query: 781 VGAAEVGF 782
           VGAAEVGF
Sbjct: 781 VGAAEVGF 780

BLAST of CSPI03G19520 vs. NCBI nr
Match: gi|659112000|ref|XP_008456015.1| (PREDICTED: glutamic acid-rich protein isoform X2 [Cucumis melo])

HSP 1 Score: 1034.2 bits (2673), Expect = 1.1e-298
Identity = 606/662 (91.54%), Postives = 623/662 (94.11%), Query Frame = 1

Query: 1   MDSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQI 60
           MD D HFRTTSTNSTSSTATPSSELFICFTSRF  SSSSSMKISSKSILSPGR REPSQI
Sbjct: 1   MDPDRHFRTTSTNSTSSTATPSSELFICFTSRF--SSSSSMKISSKSILSPGRHREPSQI 60

Query: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120
           SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG
Sbjct: 61  SLSTSLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQG 120

Query: 121 KKMRARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQ 180
           KKMRARSQKRRTNSEASFRRSES+VQSSQ N +DQQFSSHHNHHLLRQNSNSNAGNGFQQ
Sbjct: 121 KKMRARSQKRRTNSEASFRRSESVVQSSQVNSNDQQFSSHHNHHLLRQNSNSNAGNGFQQ 180

Query: 181 ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERSSESESS 240
           ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKE KPAERSSESESS
Sbjct: 181 ECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKEPKPAERSSESESS 240

Query: 241 CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE 300
           CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE
Sbjct: 241 CGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKDKNEAVEEE 300

Query: 301 QSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEED-EEGEDEDNEAKKRE 360
           +SRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEED EEGEDEDNEAKKR+
Sbjct: 301 ESRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPAPKVDEEDEEEGEDEDNEAKKRK 360

Query: 361 NEVKRDVSVPVSSIVTVNKEEEEVEE---EEDERKVEQLIVKLENEEEMNEECVSDADKE 420
           NEVKRDVSVPVSSI+TVNKEEEE EE   EEDERKVEQ +VKLENEEE+NEE VSD DKE
Sbjct: 361 NEVKRDVSVPVSSIITVNKEEEEEEEEEKEEDERKVEQFVVKLENEEEVNEESVSDEDKE 420

Query: 421 KEEANLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALEEKEEDKTDQ 480
           KEEANLVLQEE+R EE+DNEEETIEMATEN+ ++++DITVVNQLNQEQALEEKEEDKTDQ
Sbjct: 421 KEEANLVLQEEQR-EEKDNEEETIEMATENDDEQKQDITVVNQLNQEQALEEKEEDKTDQ 480

Query: 481 VNQQETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTEEDENLREDK 540
           VNQQETMAIPIP  IQTHCEPEMAQD EKLESVEKEE KLSHESEQDQKTEEDE LRE+K
Sbjct: 481 VNQQETMAIPIP--IQTHCEPEMAQDAEKLESVEKEESKLSHESEQDQKTEEDEILREEK 540

Query: 541 EEEEEEEGENGENGETTTSPSLSVETEPVSDETETEVDVNR--EEEEEEEEEKTTDEGIG 600
           EEEEEEE E GENGE  TSPSLSVET+PV DETETEVD  R  EEEEEEEEEK TDEGIG
Sbjct: 541 EEEEEEE-EEGENGENPTSPSLSVETKPVLDETETEVDGKREEEEEEEEEEEKATDEGIG 600

Query: 601 PDDEND-VLVGPEEEDQSKERETPLPEPESEPEPERKTQTETSVLPDCLLLMMYEPKLSM 656
           PDDEN+  LVGPEEEDQSKERETP PEP  EPEPE KTQTETSVLPDCLLLMMYEPKLSM
Sbjct: 601 PDDENNGALVGPEEEDQSKERETPPPEP--EPEPEGKTQTETSVLPDCLLLMMYEPKLSM 654

BLAST of CSPI03G19520 vs. NCBI nr
Match: gi|1009162037|ref|XP_015899218.1| (PREDICTED: trichohyalin [Ziziphus jujuba])

HSP 1 Score: 523.9 bits (1348), Expect = 5.0e-145
Identity = 406/818 (49.63%), Postives = 507/818 (61.98%), Query Frame = 1

Query: 5   PHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPREPSQISLST 64
           PH RTTS+NS SS +T  SELFICFTSR SSSS   MKISSKSILSPGR REPSQISLS+
Sbjct: 6   PH-RTTSSNSNSSGST--SELFICFTSRLSSSS---MKISSKSILSPGRAREPSQISLSS 65

Query: 65  SLSRRLKSSGSLKGGQASPMFPTGRKKRGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMR 124
           SLSRRLK+SGS+KGGQASPMFPTG KK+GCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMR
Sbjct: 66  SLSRRLKNSGSIKGGQASPMFPTGGKKKGCAFDNPEPSSPKVTCIGQVRVKTKKQGKKMR 125

Query: 125 ARSQKRRTNSEASFRRSESLVQSSQGNGSDQQFSSHHNHHLLRQNSNSNAGNGFQQECLS 184
            RS++R  +SE SFR+SE   QS  G   + Q  +H N+H   Q  N       Q ECL 
Sbjct: 126 IRSKRR--SSEPSFRKSEQNSQSISGKQQENQ--NHENNHQGFQFQNLQ-----QPECLP 185

Query: 185 H-RNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAERS--SESESSC 244
           H RNQRWVHLP TICEALRAFGAE NCFLPC SSC  + E +KE K  ERS   E+ SSC
Sbjct: 186 HHRNQRWVHLPLTICEALRAFGAEFNCFLPCRSSCMTS-EKDKEEKGGERSVADENGSSC 245

Query: 245 GTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGSQRRHVFEGLDFKD--KNEAVEE 304
           G VFARWLVA+QDGDGKGREIELVVG+EE  TE+ + S+RR + EG++ K+  K   +E+
Sbjct: 246 GAVFARWLVALQDGDGKGREIELVVGEEERTTERRSSSRRRQMLEGIEIKEESKESGMED 305

Query: 305 EQSRISICIPPKNALLLMRCRSDPVKMAELAKRFCE-PPAPKVDE-EDEEGEDEDNEAKK 364
           E+ R+SICIPPKNALLLMRCRSDPVKMA LA RF E P APK +E EDEE + +D + ++
Sbjct: 306 EEGRVSICIPPKNALLLMRCRSDPVKMAALANRFWESPAAPKNEEGEDEEEDGDDEDGRR 365

Query: 365 RENEVKRDVSVPVSSIVTVNKE----------EEEVEEEEDERKVEQLIVKLENEEEMNE 424
            +  V+ D    V + V   KE          EEE EEEE   + E+     E +E+  E
Sbjct: 366 NKEAVQDDERGLVKAEVVDRKEVAEQQANLAMEEEEEEEEVAEEAEENPESSELDEQQQE 425

Query: 425 ECVSDADKEKEEANLVLQEEEREEEEDNEEETIEMATENEIDEQKDITVVNQLNQEQALE 484
              S  D  +E    V +E+ + ++ED EEE  E   E E++  + + V  +   + +L 
Sbjct: 426 IVSSGEDPSEESEGEVEEEKPKGKKEDEEEE--EEEEEQELEANQQLAVEKESVFDVSLS 485

Query: 485 EKEEDKTDQVNQQETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPKLSHESEQDQKTE 544
           E   D  + ++++E                  AQ VEK  S E++ P             
Sbjct: 486 EILVDAENVLDEEE----------------NAAQLVEKENSYEQKPPH------------ 545

Query: 545 EDENLREDKEEEEEEEGENGENGETTTSPSLSVETEPVSDETETEVDVNREEEEEEEEEK 604
                +E+++E E +E       + ++ P  SVE EPV  +    V+   ++ E E E K
Sbjct: 546 -----QEEEKERELKEERRASVSKYSSEPVKSVEQEPVEKDNIHGVEEGEDKAENEIEAK 605

Query: 605 TTDEGIGPDDEND-----VLVGPEEEDQSKERETPLPEPESEPEPERKTQTETSVLPDCL 664
             +      +E +     V+    + +        +    SEP+ ER++   + VLPDCL
Sbjct: 606 FAEAAGSTKEEKETREETVIHQKSKPEYPNNTHEEVEVAGSEPKVERES---SPVLPDCL 665

Query: 665 LLMMYEPKLSMEVSKETWVCSADFIRCVPTRE-KKAIGKDPPPPPPPKKRE--------- 724
            LMM EPKLSMEVSKETWVCS DF+R +P R+  +  G D P      +++         
Sbjct: 666 RLMMCEPKLSMEVSKETWVCSTDFVRWLPERKVNQRNGLDQPKKHQHHQQQQQLSIINNE 725

Query: 725 ----TKP-TDTTQTAVVQPARWSCSFPAAAAAAAM---IEQKLVRAKGYEPFVLTRCKSE 782
                +P +      ++QP R SCSFPA AA A+M   +E+KLV +KGYEPFVLTRCKSE
Sbjct: 726 DSNSVRPCSKQLANQLMQPPRSSCSFPAPAAPASMATVVEEKLVGSKGYEPFVLTRCKSE 769

BLAST of CSPI03G19520 vs. NCBI nr
Match: gi|694318091|ref|XP_009341280.1| (PREDICTED: uncharacterized protein LOC103933313 [Pyrus x bretschneideri])

HSP 1 Score: 494.2 bits (1271), Expect = 4.2e-136
Identity = 401/840 (47.74%), Postives = 498/840 (59.29%), Query Frame = 1

Query: 2   DSDPHFRTTSTNSTSSTATPSSELFICFTSRFSSSSSSSMKISSKSILSPGRPR--EPSQ 61
           D  PH RT S++ST+STAT  SELFICFT+  S  SSSSMK+SSKS+LSPGR R  +P+Q
Sbjct: 4   DQRPH-RTKSSSSTASTAT--SELFICFTT--SRLSSSSMKLSSKSLLSPGRARGADPNQ 63

Query: 62  ISLSTSLSRRLKSSGSLKGGQASPMFPTG--RKKRGCAFDNPEPSSPKVTCIGQVRVKTK 121
           ISLS+SLSRRLK+SGS+KGGQASPMFP G   +KRGCAF+NPEPSSPKVTCIGQVRVKTK
Sbjct: 64  ISLSSSLSRRLKTSGSMKGGQASPMFPNGGTNRKRGCAFENPEPSSPKVTCIGQVRVKTK 123

Query: 122 KQGKKMRARSQKRRTNSEASFRRSESLVQSSQGNGSD---QQFSSHH--NHHLLRQNSNS 181
           KQGKKMR  ++ RR  +EASFRR+ES   ++     +     F S H  NH +   +SN 
Sbjct: 124 KQGKKMRIITRSRRRGTEASFRRTESSNNNAATQSEELFNNNFQSLHFPNHQINGSSSNR 183

Query: 182 NAGNGFQQECLSHRNQRWVHLPFTICEALRAFGAELNCFLPCHSSCSGNRENNKESKPAE 241
           N     Q+EC+  RNQRWVHLP TICEALRAFG+E NC  P  SSC    +  +ES  + 
Sbjct: 184 NN----QRECMRQRNQRWVHLPLTICEALRAFGSEFNCLFPNRSSCLSTDKEKEESSSST 243

Query: 242 RSSESES----SCGTVFARWLVAVQDGDGKGREIELVVGDEETRTEKENGS-----QRRH 301
           +   SES    SCG VFARW VA+QDGDGKGR+IELVVGD++ RTE+ + S     QRRH
Sbjct: 244 KEVRSESGGSSSCGAVFARWFVALQDGDGKGRQIELVVGDDQERTERGSESSGGRSQRRH 303

Query: 302 VFEGLDFKDK--NE---AVEEEQSRISICIPPKNALLLMRCRSDPVKMAELAKRFCEPPA 361
           VFEG++FK++  NE   AVEEE  R+SIC+PPKNALLLMRCRSDPVKMA LA RF E PA
Sbjct: 304 VFEGIEFKEERFNEGGGAVEEEGGRVSICVPPKNALLLMRCRSDPVKMAALANRFWEMPA 363

Query: 362 PKVDEEDEEGEDEDNEAKKRENEVKRDV--SVPVSSIVTVNKEEEEVEEEEDERKVEQLI 421
            + +EED+E ++E +   K E    R+V   + + S V    E E  E E+ E   E+  
Sbjct: 364 AEENEEDQEDDNEGSNV-KAEKVAGREVVEKMGMGSEVEAAAEAESAEREDGE--CEEWF 423

Query: 422 VKLENEEEMNEECVSDADKEKEEANLVLQEEEREEEEDNEEETIEMATE-NEIDEQKDIT 481
              E EE        D+++  E     + EE +E+ ++N E+ ++++ E  +I E+    
Sbjct: 424 --REGEEH------GDSEELVESLGFEVDEELKEDLDENSEKCLQISEEVRDIKEKVQCQ 483

Query: 482 VVNQLNQEQALEEKEEDKTDQVNQQETMAIPIPLLIQTHCEPEMAQDVEKLESVEKEEPK 541
           V     +E+  E+ EE K+D    Q         + +  C  E+  D E LE        
Sbjct: 484 V-----EEEFFEQDEEQKSDVTGLQ--------AVSEDLCSTEVVVDPENLE-------- 543

Query: 542 LSHESEQDQKTEEDENLREDKEEEEEEEGENGENGETTTSPSL-SVETEPVSDETE-TEV 601
           L  +++Q+Q         E+ EE EEE  E          PS+ SV++E V  E E TE 
Sbjct: 544 LEEQAQQEQ---------EEDEEREEEVSE-----AKIPEPSIDSVQSEEVEGEEEKTEA 603

Query: 602 DVNREEEEEEEEEKTTDEGIGPDDENDVLVGPEEEDQSKERETPLPEPESEPEPER---- 661
           +V  E  EEE E    D                          P P+PE EPEPE     
Sbjct: 604 EVAEESTEEETETVMADR-------------------------PEPDPEPEPEPEHPNMH 663

Query: 662 -----KTQTETSVLPDCLLLMMYEPKLSMEVSKETWVCSADFIRCVPTRE---------- 721
                K   + SVLPDCLLLMM EPKLSMEVSKETWVCS DFIRC+P R           
Sbjct: 664 LGTGSKRVGQNSVLPDCLLLMMCEPKLSMEVSKETWVCSTDFIRCLPERHVKKIDVPNEA 723

Query: 722 KKAIGKDPPPPPPPKKRE--------TKPTDTTQTAVVQPARWSCSFP---AAAAAAAMI 781
           KK +  D  PP  P  ++        +  +   Q   +QP R SCSFP      + A MI
Sbjct: 724 KKRVSIDSKPPLAPVVQQPITMQLPRSSCSFLVQLIRMQPPRSSCSFPVQEGPVSMATMI 763

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L789_CUCSA0.0e+0099.23Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236550 PE=4 SV=1[more]
M5X0H9_PRUPE6.5e-13648.17Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001903mg PE=4 SV=1[more]
A0A061F8W3_THECC1.9e-13548.37Uncharacterized protein OS=Theobroma cacao GN=TCM_031658 PE=4 SV=1[more]
B9I1E3_POPTR5.5e-13546.85Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s09990g PE=4 SV=2[more]
A0A0L9ULB0_PHAAN7.0e-13048.58Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan05g113200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G15095.16.6e-5540.21 unknown protein[more]
Match NameE-valueIdentityDescription
gi|778680362|ref|XP_004146243.2|0.0e+0099.23PREDICTED: glutamic acid-rich protein [Cucumis sativus][more]
gi|659111998|ref|XP_008456014.1|0.0e+0092.01PREDICTED: glutamic acid-rich protein isoform X1 [Cucumis melo][more]
gi|659112000|ref|XP_008456015.1|1.1e-29891.54PREDICTED: glutamic acid-rich protein isoform X2 [Cucumis melo][more]
gi|1009162037|ref|XP_015899218.1|5.0e-14549.63PREDICTED: trichohyalin [Ziziphus jujuba][more]
gi|694318091|ref|XP_009341280.1|4.2e-13647.74PREDICTED: uncharacterized protein LOC103933313 [Pyrus x bretschneideri][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0010207 photosystem II assembly
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G19520.1CSPI03G19520.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 566..586
score: -coord: 376..446
score: -coord: 341..361
score: -coord: 460..480
score: -coord: 518..545
scor
NoneNo IPR availablePANTHERPTHR33448FAMILY NOT NAMEDcoord: 1..154
score: 1.7E-242coord: 178..473
score: 1.7E-242coord: 536..781
score: 1.7E
NoneNo IPR availablePANTHERPTHR33448:SF1CHLOROPLAST PROTEIN HCF243coord: 1..154
score: 1.7E-242coord: 536..781
score: 1.7E-242coord: 178..473
score: 1.7E