CSPI01G29250 (gene) Wild cucumber (PI 183967)

NameCSPI01G29250
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionProtein of Unknown Function (DUF239)
LocationChr1 : 23708033 .. 23711504 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGAAAATGACAAGAAAGTGGGGCAGTTTTCTTGAGGGGGGGAGAGAGAGAGATTGAATGCTTTTTAGAGAAGTGCGCGTGTGGGTTTTTGAGAGAGAGAGAGAGAGAGAGGCTCTCTGTGAAGTGGGTTTTTGGTGGTAGTGGCCATCCGCCATTGTTAGTTCTTCAGCTGAGCTTCTTCTTCTTCTCATTCTTCTGTTTTTGGGGTTTTCTTCATTTGGGGTGTAGTGGAGGAAATTGCTGTGGATCTTTGATTTTGAGAGTGTTTTTGGACATGGGTTCTGCTCGATTTAGCAGATGCAGGACCGTGGAAGCTCTTGTCGTCGTTTTTTCTGTTTTGGGGATGGTTTCTTTGTGCTGTGGGACGAGGTTGGAATCAGGTTCTCGCCAGAAGCTCGAGGTGCAAAAACACCTCAGGCGTTTGAACAAACCTGCTGTTAAAACCATTGAGGTTTGTTTGTTTTCTATTCAAATGGGGCTTTGATTCTGGTGGTTCTTCCTCTGTTTGGTTTCTGGGAAAGTGGGGGAAAATGTGGGGAAATGGGTGTTTTTCTTCATTTTCTCAGGGACCAAGCAAGGATGGATGTGTATAACTTCCTTCCCATTTTTGCTCGAATGGGATTTTCTTCCTGGGAAAATGTTTGGCTTCCTTCATTTTCTCTGGAACCAAACAAGAACGGATACATTCTACCCAAATGGGGCTTTTCTTCCAGTGTCACTCAGTTTGGTTCCTGAGAAAATGTGGGAAAATGCTTTGTTTCCTTAGTTGTCTCCTTCACATTTTCTCTTGAAATCTCTTTATCTTAACACCATATATTGTTGTCTCTCTTTGTTAAACTTCACCGAAAACTTGGCTGAAGATGAAGTTTATAACCTTTTACAATTTAGAACAAAGAGGGTTTTATGGGTTTTTATGTTGAGAAAGCTCTTTGATATTGTTTTCATGTTTGTTCTAATGGTTCAACCAGAGTCCAGATGGGGACCTAATTGACTGTGTTCACATGTCTCACCAACCTGCATTTGACCATCCTTTCCTCAAAGACCATAAAATCCAGGTTTGTTCTAGTTTGAGCTGATAATTTTCTTTCATTTTCTCCTCAAAGTCTTAACTCACAAAAGTTTGTCCACGGTTCTTTTTGGTTTTTCAGATGAGGCCAAGTTTTCATCCAGAAGGGTTATTTGATGAGAACAAAGTAGCTGAGAAAGCCAGTGAAAAACCAAAACCAATAAACCAACTGTGGCATGTTAATGGAAAATGTCCCGAAGGAACTATCCCCATTAGAAGAACAAAACATGAGGATGTTTTGAGAGCAAGTTCAGTCAAAAGATACGGAAGGAAGAAGCACAGATCAACTCCAATACCCCCAAGGTCTGCAGAGCCTGATCTCATAAACCAAAGTGGTCATCAGGTAATCATCGTATTTCTCCTTTTACTTTATCTCTTTAATGTTCTTTTTTTCAGCTCTGATTTTGTTCTTTTTGGGATTGTCTAGCATGCGATAGCTTATGTGGAAGGAGATAAGTATTATGGAGCTAAAGCAACTATGAATGTGTGGGAGCCCAGTATACAGCAGCCTAATGAGTTTAGCTTATCACAGCTTTGGATATTGGGTGGTTCTTTTGGTGAAGATCTCAATAGCATTGAAGCTGGTTGGCAGGTATCATCATCATCTTCATTATCTTCATGTTACTGACTCAAAATGATTGAACTTATAAATGAGAAAAATGTCAATAACTTCTTATTCTTTCTTAACATTATTAATTTCTTGCCATTATTCCCCTTCCATGTATCAGGTTAGTCCTGATCTCTACGGCGATAACAATACTAGACTCTTCACATACTGGACGGTGAGCCCTGCGATACACATTCGAACATTTTTCCAGCGGGTTTTTTGGATTGTCATTGTTTCTTTTGCTTACTTGTCTCTTTTTTTTTTTCCAGAGTGATGCATATCAAGCTACAGGCTGTTACAACCTCCTATGCTCAGGCTTTATTCAAATCAACAGCGATATTGCAATGGGAGCGAGCATCTCTCCTGTCTCAGCATACAGAAATTCACAATACGATATTAGTATACTTGTCTGGAAGGTAAAAGCCAAAACCTGACCGCTTTATTTTTAAATAGCATGATAGTGTTTCTAAAACAAGGAGAAAAATCCAATACTTACGAAAATAATAGTACCAATGAAAGATGGCATGTGGGGGAAGAATCAGATGATGGTTGTTTGTCTCAATCCCAATTGATCACTCAAGACATTTCAAATCTTTTGGCCATTTCAATAAATACCTTGCTGCAATAAATGAGACACTGGTAGGGAGGGTGGGGGGAGATTTGAAACGAGAGTGCATTATTGTGGTTCTCATGCTGCTGCTTAGAATCTCTTGGCCTTAACCATGACAGGTTGGCCTTAGGGGCCACCAAAAGAGATAAACGTAAGGGGTGGAGATTTGAAATAGCTGGCTGGAAAAATGACCAGAGCATTAGGAGAATGGTTAACACACATAATGGCTATTGATATTGATTAGTTGAAGTGTATATAAACTTGTTTGACCCACCGTGTCTCTTGACAAGACACCATAACCAAGTCTCAGGCTCTTTTCGAGTCCGTGCTTGGCAGATCTGAAATGAAAACCACTTTAAACTAATAGTTTTGTTTGTATTAGATTGGTTTCTCGGATAATATTTACATAAATGGTGATTGATCTATGCAGGATCCAAAAGAAGGGCATTGGTGGATGCAATTTGGAAATGGTTATGTGATGGGATATTGGCCTTCATTTTTATTCTCATATTTAGCAGACAGTGCTTCCATGATAGAATGGGGAGGGGAAGTTGTGAACTCAGAGCCAAATGGAGAACACACTTCAACTCAAATGGGGAGTGGCCATTTCCCAGATGAAGGATTTGGTAAAGCAAGCTATTTTAGAAACATTCAAGTTGTCGATGGATCCAACAATCTGAAACCCCCCAAAGGCATTGGCACATTCACAGAACAACCTGATTGTTATGATGTGCAAACAGGTAGCAATGGGGATTGGGGCCACTTCTTTTACTATGGAGGCCCCGGTAGAAACGCTAATTGCCAATGAAAAAGATGAAATGAATGACCATTCTACCCTCCATGTCCTTCACTCACTGTAAACTACCTCTCTTAGGCTTAGTTAGAGGCTATATGTGACATTTTCTCACTGGTGGAAAGTTTTGGCTCTATGGGTCTCTTTTCTTTTTTTCTTGTTTTTTTTTTAGAGGAAAAAGATTCTCTCCATGTAAGTGGTGGGGTGTAGGGGAAAGAAAGGGCCAAAGTTTCTTAAAAGCCTGAAAAGAAACAGGGGCCTTTTGTTATGACTTGATAATGATGAATGTGTATTTTATTAGTTTGTTCAGTATACTCCTCTCCCTTGGAGGTGCTTTATTAATGAAGAATGGGCTTTTTTATATTTTTATTTTGTGGACTGTGAGGAATATTGGTG

mRNA sequence

ATGGGTTCTGCTCGATTTAGCAGATGCAGGACCGTGGAAGCTCTTGTCGTCGTTTTTTCTGTTTTGGGGATGGTTTCTTTGTGCTGTGGGACGAGGTTGGAATCAGGTTCTCGCCAGAAGCTCGAGGTGCAAAAACACCTCAGGCGTTTGAACAAACCTGCTGTTAAAACCATTGAGAGTCCAGATGGGGACCTAATTGACTGTGTTCACATGTCTCACCAACCTGCATTTGACCATCCTTTCCTCAAAGACCATAAAATCCAGATGAGGCCAAGTTTTCATCCAGAAGGGTTATTTGATGAGAACAAAGTAGCTGAGAAAGCCAGTGAAAAACCAAAACCAATAAACCAACTGTGGCATGTTAATGGAAAATGTCCCGAAGGAACTATCCCCATTAGAAGAACAAAACATGAGGATGTTTTGAGAGCAAGTTCAGTCAAAAGATACGGAAGGAAGAAGCACAGATCAACTCCAATACCCCCAAGGTCTGCAGAGCCTGATCTCATAAACCAAAGTGGTCATCAGCATGCGATAGCTTATGTGGAAGGAGATAAGTATTATGGAGCTAAAGCAACTATGAATGTGTGGGAGCCCAGTATACAGCAGCCTAATGAGTTTAGCTTATCACAGCTTTGGATATTGGGTGGTTCTTTTGGTGAAGATCTCAATAGCATTGAAGCTGGTTGGCAGGTTAGTCCTGATCTCTACGGCGATAACAATACTAGACTCTTCACATACTGGACGAGTGATGCATATCAAGCTACAGGCTGTTACAACCTCCTATGCTCAGGCTTTATTCAAATCAACAGCGATATTGCAATGGGAGCGAGCATCTCTCCTGTCTCAGCATACAGAAATTCACAATACGATATTAGTATACTTGTCTGGAAGGATCCAAAAGAAGGGCATTGGTGGATGCAATTTGGAAATGGTTATGTGATGGGATATTGGCCTTCATTTTTATTCTCATATTTAGCAGACAGTGCTTCCATGATAGAATGGGGAGGGGAAGTTGTGAACTCAGAGCCAAATGGAGAACACACTTCAACTCAAATGGGGAGTGGCCATTTCCCAGATGAAGGATTTGGTAAAGCAAGCTATTTTAGAAACATTCAAGTTGTCGATGGATCCAACAATCTGAAACCCCCCAAAGGCATTGGCACATTCACAGAACAACCTGATTGTTATGATGTGCAAACAGGTAGCAATGGGGATTGGGGCCACTTCTTTTACTATGGAGGCCCCGGTAGAAACGCTAATTGCCAATGA

Coding sequence (CDS)

ATGGGTTCTGCTCGATTTAGCAGATGCAGGACCGTGGAAGCTCTTGTCGTCGTTTTTTCTGTTTTGGGGATGGTTTCTTTGTGCTGTGGGACGAGGTTGGAATCAGGTTCTCGCCAGAAGCTCGAGGTGCAAAAACACCTCAGGCGTTTGAACAAACCTGCTGTTAAAACCATTGAGAGTCCAGATGGGGACCTAATTGACTGTGTTCACATGTCTCACCAACCTGCATTTGACCATCCTTTCCTCAAAGACCATAAAATCCAGATGAGGCCAAGTTTTCATCCAGAAGGGTTATTTGATGAGAACAAAGTAGCTGAGAAAGCCAGTGAAAAACCAAAACCAATAAACCAACTGTGGCATGTTAATGGAAAATGTCCCGAAGGAACTATCCCCATTAGAAGAACAAAACATGAGGATGTTTTGAGAGCAAGTTCAGTCAAAAGATACGGAAGGAAGAAGCACAGATCAACTCCAATACCCCCAAGGTCTGCAGAGCCTGATCTCATAAACCAAAGTGGTCATCAGCATGCGATAGCTTATGTGGAAGGAGATAAGTATTATGGAGCTAAAGCAACTATGAATGTGTGGGAGCCCAGTATACAGCAGCCTAATGAGTTTAGCTTATCACAGCTTTGGATATTGGGTGGTTCTTTTGGTGAAGATCTCAATAGCATTGAAGCTGGTTGGCAGGTTAGTCCTGATCTCTACGGCGATAACAATACTAGACTCTTCACATACTGGACGAGTGATGCATATCAAGCTACAGGCTGTTACAACCTCCTATGCTCAGGCTTTATTCAAATCAACAGCGATATTGCAATGGGAGCGAGCATCTCTCCTGTCTCAGCATACAGAAATTCACAATACGATATTAGTATACTTGTCTGGAAGGATCCAAAAGAAGGGCATTGGTGGATGCAATTTGGAAATGGTTATGTGATGGGATATTGGCCTTCATTTTTATTCTCATATTTAGCAGACAGTGCTTCCATGATAGAATGGGGAGGGGAAGTTGTGAACTCAGAGCCAAATGGAGAACACACTTCAACTCAAATGGGGAGTGGCCATTTCCCAGATGAAGGATTTGGTAAAGCAAGCTATTTTAGAAACATTCAAGTTGTCGATGGATCCAACAATCTGAAACCCCCCAAAGGCATTGGCACATTCACAGAACAACCTGATTGTTATGATGTGCAAACAGGTAGCAATGGGGATTGGGGCCACTTCTTTTACTATGGAGGCCCCGGTAGAAACGCTAATTGCCAATGA
BLAST of CSPI01G29250 vs. TrEMBL
Match: A0A0A0LZQ5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G599510 PE=4 SV=1)

HSP 1 Score: 896.0 bits (2314), Expect = 1.8e-257
Identity = 422/422 (100.00%), Postives = 422/422 (100.00%), Query Frame = 1

Query: 1   MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES 60
           MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES
Sbjct: 1   MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES 60

Query: 61  PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH 120
           PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH
Sbjct: 61  PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH 120

Query: 121 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 180
           VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY
Sbjct: 121 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 180

Query: 181 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 240
           VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN
Sbjct: 181 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 240

Query: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 300
           TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Sbjct: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 300

Query: 301 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 360
           EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE
Sbjct: 301 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 360

Query: 361 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 420
           GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN
Sbjct: 361 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 420

Query: 421 CQ 423
           CQ
Sbjct: 421 CQ 422

BLAST of CSPI01G29250 vs. TrEMBL
Match: A0A061GPD6_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_038694 PE=4 SV=1)

HSP 1 Score: 760.0 bits (1961), Expect = 1.5e-216
Identity = 359/422 (85.07%), Postives = 383/422 (90.76%), Query Frame = 1

Query: 1   MGSARFSRCRTVEA-LVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIE 60
           M    FSR  T    L+VVF +LG +SL C  R    SRQ+L+VQKHL RLNKPAVKTIE
Sbjct: 1   MADVHFSRGWTRRGVLLVVFCLLGSISLSCAAR-PGVSRQRLQVQKHLNRLNKPAVKTIE 60

Query: 61  SPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLW 120
           SPDGD+IDCVH+SHQPAFDHPFLKDHKIQMRP++H EGLFDENKV+EK      PI QLW
Sbjct: 61  SPDGDIIDCVHISHQPAFDHPFLKDHKIQMRPNYHREGLFDENKVSEKPKPHSNPITQLW 120

Query: 121 HVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIA 180
           HVNGKCPEGTIPIRRTK +DVLRASSVKRYGRKKHR+ P  PRSA+PDLIN+SGHQHAIA
Sbjct: 121 HVNGKCPEGTIPIRRTKEQDVLRASSVKRYGRKKHRAIP-QPRSADPDLINESGHQHAIA 180

Query: 181 YVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDN 240
           YVEGDKYYGAKAT+NVWEP IQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDN
Sbjct: 181 YVEGDKYYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDN 240

Query: 241 NTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDP 300
           NTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSAYRNSQYDISILVWKDP
Sbjct: 241 NTRLFTYWTSDAYQATGCYNLLCSGFIQINSEIAMGASISPVSAYRNSQYDISILVWKDP 300

Query: 301 KEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPD 360
           KEGHWWMQFGN YV+GYWPSFLFSYLADSASMIEWGGEVVNSEP+G HTSTQMGSG FP+
Sbjct: 301 KEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGHHTSTQMGSGRFPE 360

Query: 361 EGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNA 420
           EGFGK+SYFRNIQVVDGSNNLK PKG+GTFTEQ +CYDVQTGSNGDWGH+FYYGGPG+N 
Sbjct: 361 EGFGKSSYFRNIQVVDGSNNLKAPKGLGTFTEQSNCYDVQTGSNGDWGHYFYYGGPGKNP 420

Query: 421 NC 422
           NC
Sbjct: 421 NC 420

BLAST of CSPI01G29250 vs. TrEMBL
Match: A0A0D2RR33_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G000100 PE=4 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 1.1e-214
Identity = 358/426 (84.04%), Postives = 388/426 (91.08%), Query Frame = 1

Query: 1   MGSARFSRCRTVEALVVVFS-VLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIE 60
           MG   FSR R  + +V++F   L ++SL C  RL S SRQKL+VQ HL RLNKPAVKTI+
Sbjct: 1   MGDVHFSRERAGKVVVLIFFWTLSLISLSCAARL-SVSRQKLQVQNHLNRLNKPAVKTIQ 60

Query: 61  SPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPK----PI 120
           SPDGD+IDCVH++ QPAFDHPFLKDHKIQMRPS+HPEGLFDE+KV++  +EKPK    PI
Sbjct: 61  SPDGDIIDCVHLARQPAFDHPFLKDHKIQMRPSYHPEGLFDESKVSD--TEKPKKGSNPI 120

Query: 121 NQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQ 180
            QLWH+NGKCPEGTIPIRRTK EDVLRASSVK YGRKKHR+TP  PRSA+PDLIN+SGHQ
Sbjct: 121 TQLWHMNGKCPEGTIPIRRTKEEDVLRASSVKSYGRKKHRATP-QPRSADPDLINESGHQ 180

Query: 181 HAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL 240
           HAIAYVEGDKYYGAKAT+NVWEP IQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL
Sbjct: 181 HAIAYVEGDKYYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL 240

Query: 241 YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILV 300
           YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSAYRNSQYDISILV
Sbjct: 241 YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSEIAMGASISPVSAYRNSQYDISILV 300

Query: 301 WKDPKEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSG 360
           WKDPKEGHWWMQFGN YV+GYWPSFLFSYLADSASMIEWGGEVVNSEP+G HTSTQMGSG
Sbjct: 301 WKDPKEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGRHTSTQMGSG 360

Query: 361 HFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGP 420
            FP+EGFGK+SYFRNIQVVD SNNLK PKG+GTFTEQ +CYDVQTGSNGDWGH+FYYGGP
Sbjct: 361 RFPEEGFGKSSYFRNIQVVDDSNNLKAPKGLGTFTEQSNCYDVQTGSNGDWGHYFYYGGP 420

Query: 421 GRNANC 422
           G+N NC
Sbjct: 421 GKNPNC 422

BLAST of CSPI01G29250 vs. TrEMBL
Match: A0A0B0PIB1_GOSAR (tRNA-splicing ligase RtcB OS=Gossypium arboreum GN=F383_29250 PE=4 SV=1)

HSP 1 Score: 751.9 bits (1940), Expect = 4.2e-214
Identity = 358/426 (84.04%), Postives = 387/426 (90.85%), Query Frame = 1

Query: 1   MGSARFSRCRTVEALVVVFS-VLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIE 60
           MG   FSR +  + +V+ F  +L ++SL C  RL S SRQKL+VQ HL RLNKPAVKTI+
Sbjct: 1   MGDVHFSREKAGKGVVLFFFWMLSLISLSCAARL-SVSRQKLQVQNHLNRLNKPAVKTIQ 60

Query: 61  SPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPK----PI 120
           SPDGD+IDCVH+S QPAFDHPFLKDHKIQMRPS+HPEGLFDENKV++  +EKPK    PI
Sbjct: 61  SPDGDIIDCVHLSRQPAFDHPFLKDHKIQMRPSYHPEGLFDENKVSD--TEKPKKGSNPI 120

Query: 121 NQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQ 180
            QLWH+NGKCP+GTIPIRRTK EDVLRASSVKRYGRKKHR+ P  PRSA+PDLIN+SGHQ
Sbjct: 121 TQLWHMNGKCPDGTIPIRRTKEEDVLRASSVKRYGRKKHRAIP-QPRSADPDLINESGHQ 180

Query: 181 HAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL 240
           HAIAYVEGDKYYGAKAT+NVWEP IQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL
Sbjct: 181 HAIAYVEGDKYYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL 240

Query: 241 YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILV 300
           YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVSAYRNSQYDISILV
Sbjct: 241 YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSAIAMGASISPVSAYRNSQYDISILV 300

Query: 301 WKDPKEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSG 360
           WKDPKEGHWWMQFGN YV+GYWPSFLFSYLADSASMIEWGGEVVNSEP+G HTSTQMGSG
Sbjct: 301 WKDPKEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGHHTSTQMGSG 360

Query: 361 HFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGP 420
            FP+EGFGK+SYFRNIQVVD SNNLK PKG+GTFTEQ +CYDVQTGSNGDWGH+FYYGGP
Sbjct: 361 RFPEEGFGKSSYFRNIQVVDDSNNLKAPKGLGTFTEQSNCYDVQTGSNGDWGHYFYYGGP 420

Query: 421 GRNANC 422
           G+N NC
Sbjct: 421 GKNPNC 422

BLAST of CSPI01G29250 vs. TrEMBL
Match: B9GLD1_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s04100g PE=4 SV=2)

HSP 1 Score: 751.1 bits (1938), Expect = 7.2e-214
Identity = 353/422 (83.65%), Postives = 384/422 (91.00%), Query Frame = 1

Query: 2   GSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIESP 61
           G   FSR R    LV+VF +  +VSL C  RL S SRQKLEVQKHL RLNKPAVK+IESP
Sbjct: 8   GCVHFSRSRL---LVLVFCLCSLVSLSCAARL-SVSRQKLEVQKHLDRLNKPAVKSIESP 67

Query: 62  DGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEG-LFDENKVAEKASEKPKPINQLWH 121
           DGD+IDCVHMSHQPAFDHP+LKDHKIQMRP +HPEG +FD+NKV+ ++ E+  PI Q WH
Sbjct: 68  DGDIIDCVHMSHQPAFDHPYLKDHKIQMRPGYHPEGRVFDDNKVSTESKERTNPITQSWH 127

Query: 122 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 181
           VNGKCPEGTIPIRRTK +DVLRASSVKRYG+KKHR+ P  PRSA+PDL+N+SGHQHAIAY
Sbjct: 128 VNGKCPEGTIPIRRTKKDDVLRASSVKRYGKKKHRAIP-QPRSADPDLVNESGHQHAIAY 187

Query: 182 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 241
           VEGDKYYGAKAT+NVWEP IQQPNEFSLSQLWILGGSFG+DLNSIEAGWQVSPDLYGDNN
Sbjct: 188 VEGDKYYGAKATLNVWEPKIQQPNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNN 247

Query: 242 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 301
           TRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVS YRNSQYDISILVWKDPK
Sbjct: 248 TRLFTYWTSDAYQATGCYNLLCSGFIQINSEIAMGASISPVSGYRNSQYDISILVWKDPK 307

Query: 302 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 361
           EGHWWMQFGN YV+GYWPSFLFSYLADSASMIEWGGEVVNSEP+G+HTSTQMGSG FP+E
Sbjct: 308 EGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGQHTSTQMGSGRFPEE 367

Query: 362 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 421
           GFGK+SYFRN+QVVD SNNLK PKGIGTFTEQ +CYDV TG+NGDWGH+FYYGGPGRN N
Sbjct: 368 GFGKSSYFRNVQVVDASNNLKAPKGIGTFTEQSNCYDVLTGNNGDWGHYFYYGGPGRNEN 424

Query: 422 CQ 423
           CQ
Sbjct: 428 CQ 424

BLAST of CSPI01G29250 vs. TAIR10
Match: AT3G13510.1 (AT3G13510.1 Protein of Unknown Function (DUF239))

HSP 1 Score: 711.8 bits (1836), Expect = 2.4e-205
Identity = 333/421 (79.10%), Postives = 365/421 (86.70%), Query Frame = 1

Query: 1   MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES 60
           MG+  FS  +      V   V  M+SL C       SRQK EV+KHL RLNKP VKTI+S
Sbjct: 1   MGAEHFSLVKFNRGFFVCLWV--MLSLSCAAASYGSSRQKFEVKKHLNRLNKPPVKTIQS 60

Query: 61  PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH 120
           PDGD+IDC+ +S QPAFDHPFLKDHKIQMRPS+HPEGLFD+NKV+ +   K   I QLWH
Sbjct: 61  PDGDIIDCIPISKQPAFDHPFLKDHKIQMRPSYHPEGLFDDNKVSAEPKGKETHIPQLWH 120

Query: 121 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 180
             GKC EGTIP+RRT+ +DVLRASSVKRYG+KKHRS PIP +SAEPDLINQ+GHQHAIAY
Sbjct: 121 RYGKCTEGTIPMRRTREDDVLRASSVKRYGKKKHRSVPIP-KSAEPDLINQNGHQHAIAY 180

Query: 181 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 240
           VEGDKYYGAKAT+NVWEP IQ  NEFSLSQ+W+LGGSFG+DLNSIEAGWQVSPDLYGDNN
Sbjct: 181 VEGDKYYGAKATLNVWEPKIQNTNEFSLSQIWLLGGSFGQDLNSIEAGWQVSPDLYGDNN 240

Query: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 300
           TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS YRNSQYDISIL+WKDPK
Sbjct: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSGYRNSQYDISILIWKDPK 300

Query: 301 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 360
           EGHWWMQFGNGYV+GYWPSFLFSYL +SASMIEWGGEVVNS+  G HT TQMGSGHFP+E
Sbjct: 301 EGHWWMQFGNGYVLGYWPSFLFSYLTESASMIEWGGEVVNSQSEGHHTWTQMGSGHFPEE 360

Query: 361 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 420
           GF KASYFRNIQVVDGSNNLK PKG+GTFTE+ +CYDVQTGSN DWGH+FYYGGPG+N N
Sbjct: 361 GFSKASYFRNIQVVDGSNNLKAPKGLGTFTEKSNCYDVQTGSNDDWGHYFYYGGPGKNKN 418

Query: 421 C 422
           C
Sbjct: 421 C 418

BLAST of CSPI01G29250 vs. TAIR10
Match: AT1G55360.1 (AT1G55360.1 Protein of Unknown Function (DUF239))

HSP 1 Score: 702.6 bits (1812), Expect = 1.5e-202
Identity = 331/421 (78.62%), Postives = 366/421 (86.94%), Query Frame = 1

Query: 2   GSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIESP 61
           G    S  +     +V   + G  SL    R    S+QK EV+KHL RLNKPAVK+I+S 
Sbjct: 3   GVVHLSTAKLARGFLVCLCLWGFFSLSYAAR-SGVSKQKFEVKKHLNRLNKPAVKSIQSS 62

Query: 62  DGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKV-AEKASEKPKPINQLWH 121
           DGD+IDCV +S QPAFDHPFLKDHKIQM+P++HPEGLFD+NKV A K++EK   I QLWH
Sbjct: 63  DGDVIDCVPISKQPAFDHPFLKDHKIQMKPNYHPEGLFDDNKVSAPKSNEKEGHIPQLWH 122

Query: 122 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 181
             GKC EGTIP+RRTK +DVLRASSVKRYG+KK RS P+P +SAEPDLINQSGHQHAIAY
Sbjct: 123 RYGKCSEGTIPMRRTKEDDVLRASSVKRYGKKKRRSVPLP-KSAEPDLINQSGHQHAIAY 182

Query: 182 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 241
           VEGDKYYGAKAT+NVWEP IQQ NEFSLSQ+W+LGGSFG+DLNSIEAGWQVSPDLYGDNN
Sbjct: 183 VEGDKYYGAKATINVWEPKIQQQNEFSLSQIWLLGGSFGQDLNSIEAGWQVSPDLYGDNN 242

Query: 242 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 301
           TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS YRNSQYDISIL+WKDPK
Sbjct: 243 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSGYRNSQYDISILIWKDPK 302

Query: 302 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 361
           EGHWWMQFGNGYV+GYWPSFLFSYL +SASMIEWGGEVVNS+ +G+HTSTQMGSG FP+E
Sbjct: 303 EGHWWMQFGNGYVLGYWPSFLFSYLTESASMIEWGGEVVNSQSDGQHTSTQMGSGKFPEE 362

Query: 362 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 421
           GF KASYFRNIQVVDGSNNLK PKG+GTFTEQ +CYDVQTGSN DWGH+FYYGGPG+N  
Sbjct: 363 GFSKASYFRNIQVVDGSNNLKAPKGLGTFTEQSNCYDVQTGSNDDWGHYFYYGGPGKNQK 421

BLAST of CSPI01G29250 vs. TAIR10
Match: AT5G56530.1 (AT5G56530.1 Protein of Unknown Function (DUF239))

HSP 1 Score: 700.7 bits (1807), Expect = 5.6e-202
Identity = 328/422 (77.73%), Postives = 361/422 (85.55%), Query Frame = 1

Query: 1   MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES 60
           M +A FS+ R     +V F   G++SL C  RL S SRQ  EV KHL RLNKPAVK+I+S
Sbjct: 1   MAAAHFSKERVFRGFLVWFCFWGLMSLTCAGRL-SVSRQNFEVHKHLNRLNKPAVKSIQS 60

Query: 61  PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH 120
           PDGD+IDCVH+S QPAFDHPFLKDHKIQM PS+ PE LF E+KV+EK  E   PI QLWH
Sbjct: 61  PDGDIIDCVHISKQPAFDHPFLKDHKIQMGPSYTPESLFGESKVSEKPKESVNPITQLWH 120

Query: 121 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 180
            NG C EGTIP+RRTK EDVLRASSVKRYG+KKH S P+P RSA+PDLINQSGHQHAIAY
Sbjct: 121 QNGVCSEGTIPVRRTKKEDVLRASSVKRYGKKKHLSVPLP-RSADPDLINQSGHQHAIAY 180

Query: 181 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 240
           VEG K+YGAKAT+NVWEP +Q  NEFSLSQLWILGGSFG+DLNSIEAGWQVSPDLYGDNN
Sbjct: 181 VEGGKFYGAKATINVWEPKVQSSNEFSLSQLWILGGSFGQDLNSIEAGWQVSPDLYGDNN 240

Query: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 300
           TRLFTYWTSDAYQATGCYNLLCSGFIQINS IAMGASISPVS + N QYDISI +WKDPK
Sbjct: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSQIAMGASISPVSGFHNPQYDISITIWKDPK 300

Query: 301 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 360
           EGHWWMQFG+GYV+GYWPSFLFSYLADSAS++EWGGEVVN E +G HT+TQMGSG FPDE
Sbjct: 301 EGHWWMQFGDGYVLGYWPSFLFSYLADSASIVEWGGEVVNMEEDGHHTTTQMGSGQFPDE 360

Query: 361 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 420
           GF KASYFRNIQVVD SNNLK PKG+ TFTE+ +CYDV+ G N DWGH+FYYGGPGRN N
Sbjct: 361 GFTKASYFRNIQVVDSSNNLKEPKGLNTFTEKSNCYDVEVGKNDDWGHYFYYGGPGRNPN 420

Query: 421 CQ 423
           CQ
Sbjct: 421 CQ 420

BLAST of CSPI01G29250 vs. TAIR10
Match: AT2G44210.2 (AT2G44210.2 Protein of Unknown Function (DUF239))

HSP 1 Score: 552.4 bits (1422), Expect = 2.5e-157
Identity = 257/440 (58.41%), Postives = 326/440 (74.09%), Query Frame = 1

Query: 16  VVVFSVLGMVSLCCGTRLESGSR--QKLEVQKHLRRLNKPAVKTI--------------- 75
           V  F  L M  +     + SG      L+++ HL+RLNKPA+K+I               
Sbjct: 5   VSFFLALVMTVVILAPSVVSGENGFSDLKIRTHLKRLNKPALKSIKVNSTVILERKLHKS 64

Query: 76  ---------------ESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENK 135
                          +SPDGD+IDCV ++ QPAF HP L +H +QM PS +PE +F E+K
Sbjct: 65  FILLLFSGNNFEFLKQSPDGDMIDCVPITDQPAFAHPLLINHTVQMWPSLNPESVFSESK 124

Query: 136 VAEKA-SEKPKPINQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPR 195
           V+ K  +++   I+QLWHVNGKCP+ TIPIRRT+ +D+ RASSV+ YG K  +S P P  
Sbjct: 125 VSSKTKNQQSNAIHQLWHVNGKCPKNTIPIRRTRRQDLYRASSVENYGMKNQKSIPKPKS 184

Query: 196 SAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDL 255
           S  P+++ Q+GHQHAI YVE   +YGAKA +NVW+P ++ PNEFSL+Q+W+LGG+F  DL
Sbjct: 185 SEPPNVLTQNGHQHAIMYVEDGVFYGAKAKINVWKPDVEMPNEFSLAQIWVLGGNFNSDL 244

Query: 256 NSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVS 315
           NSIEAGWQVSP LYGDN TRLFTYWTSDAYQ TGCYNLLCSGF+QIN +IAMG SISP+S
Sbjct: 245 NSIEAGWQVSPQLYGDNRTRLFTYWTSDAYQGTGCYNLLCSGFVQINREIAMGGSISPLS 304

Query: 316 AYRNSQYDISILVWKDPKEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSE 375
            Y NSQYDI+IL+WKDPKEGHWW+QFG  Y++GYWP+ LFSYL++SASMIEWGGEVVNS+
Sbjct: 305 NYGNSQYDITILIWKDPKEGHWWLQFGEKYIIGYWPASLFSYLSESASMIEWGGEVVNSQ 364

Query: 376 -PNGEHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTG 422
              G+HT+TQMGSG F +EG+GKASYF+N+QVVDGSN L+ P+ +  FT+Q +CY+V++G
Sbjct: 365 SEEGQHTTTQMGSGRFAEEGWGKASYFKNVQVVDGSNELRNPENLQVFTDQENCYNVKSG 424

BLAST of CSPI01G29250 vs. TAIR10
Match: AT1G10750.1 (AT1G10750.1 Protein of Unknown Function (DUF239))

HSP 1 Score: 490.0 bits (1260), Expect = 1.5e-138
Identity = 224/380 (58.95%), Postives = 287/380 (75.53%), Query Frame = 1

Query: 43  VQKHLRRLNKPAVKTIESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDEN 102
           + +HLR++NKP++KTI SPDGD+IDCV + HQPAFDHP L+  K  + P   P G     
Sbjct: 105 INQHLRKINKPSIKTIHSPDGDIIDCVLLHHQPAFDHPSLRGQK-PLDPPERPRG----- 164

Query: 103 KVAEKASEKPKPINQLWHVNGK-CPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPP 162
               +   +PK   QLW + G+ CPEGT+PIRRTK ED+LRA+SV  +G+K         
Sbjct: 165 --HNRRGLRPKSF-QLWGMEGETCPEGTVPIRRTKEEDILRANSVSSFGKKL-------- 224

Query: 163 RSAEPDLINQSGHQHAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGED 222
           R    D  + +GH+HA+ YV G+KYYGAKA++NVW P +Q   EFSLSQ+WI+ GSFG D
Sbjct: 225 RHYRRDT-SSNGHEHAVGYVSGEKYYGAKASINVWAPQVQNQYEFSLSQIWIISGSFGND 284

Query: 223 LNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPV 282
           LN+IEAGWQVSP+LYGDN  R FTYWT+DAYQATGCYNLLCSGF+Q NS+IA+GA+ISP 
Sbjct: 285 LNTIEAGWQVSPELYGDNYPRFFTYWTNDAYQATGCYNLLCSGFVQTNSEIAIGAAISPS 344

Query: 283 SAYRNSQYDISILVWKDPKEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNS 342
           S+Y+  Q+DI++L+WKDPK G+WW++FG+G ++GYWPSFLF++L + ASM+++GGE+VNS
Sbjct: 345 SSYKGGQFDITLLIWKDPKHGNWWLEFGSGILVGYWPSFLFTHLKEHASMVQYGGEIVNS 404

Query: 343 EPNGEHTSTQMGSGHFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTG 402
            P G HTSTQMGSGHF +EGF K+SYFRNIQVVD  NNL P   +    + P+CYD+Q G
Sbjct: 405 SPFGAHTSTQMGSGHFAEEGFTKSSYFRNIQVVDWDNNLVPSPNLRVLADHPNCYDIQGG 464

Query: 403 SNGDWGHFFYYGGPGRNANC 422
           SN  WG +FYYGGPG+N  C
Sbjct: 465 SNRAWGSYFYYGGPGKNPKC 466

BLAST of CSPI01G29250 vs. NCBI nr
Match: gi|449452648|ref|XP_004144071.1| (PREDICTED: uncharacterized protein LOC101217988 [Cucumis sativus])

HSP 1 Score: 896.0 bits (2314), Expect = 2.6e-257
Identity = 422/422 (100.00%), Postives = 422/422 (100.00%), Query Frame = 1

Query: 1   MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES 60
           MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES
Sbjct: 1   MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES 60

Query: 61  PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH 120
           PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH
Sbjct: 61  PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH 120

Query: 121 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 180
           VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY
Sbjct: 121 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 180

Query: 181 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 240
           VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN
Sbjct: 181 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 240

Query: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 300
           TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Sbjct: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 300

Query: 301 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 360
           EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE
Sbjct: 301 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 360

Query: 361 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 420
           GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN
Sbjct: 361 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 420

Query: 421 CQ 423
           CQ
Sbjct: 421 CQ 422

BLAST of CSPI01G29250 vs. NCBI nr
Match: gi|659100275|ref|XP_008451013.1| (PREDICTED: uncharacterized protein LOC103492421 [Cucumis melo])

HSP 1 Score: 884.8 bits (2285), Expect = 6.0e-254
Identity = 414/422 (98.10%), Postives = 419/422 (99.29%), Query Frame = 1

Query: 1   MGSARFSRCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES 60
           MGSARFSRCRT+EALVV+FSVLG+VSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES
Sbjct: 1   MGSARFSRCRTMEALVVIFSVLGIVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIES 60

Query: 61  PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLWH 120
           PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPS+HPEGLFDENKVAEKASEKP PINQLWH
Sbjct: 61  PDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSYHPEGLFDENKVAEKASEKPNPINQLWH 120

Query: 121 VNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 180
            NGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY
Sbjct: 121 ANGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIAY 180

Query: 181 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 240
           VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN
Sbjct: 181 VEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDNN 240

Query: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 300
           TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK
Sbjct: 241 TRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDPK 300

Query: 301 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 360
           EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE
Sbjct: 301 EGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPDE 360

Query: 361 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNAN 420
           GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRN N
Sbjct: 361 GFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNPN 420

Query: 421 CQ 423
           C+
Sbjct: 421 CK 422

BLAST of CSPI01G29250 vs. NCBI nr
Match: gi|747084593|ref|XP_011089714.1| (PREDICTED: uncharacterized protein LOC105170589 [Sesamum indicum])

HSP 1 Score: 761.9 bits (1966), Expect = 5.8e-217
Identity = 355/425 (83.53%), Postives = 390/425 (91.76%), Query Frame = 1

Query: 1   MGSARFS---RCRTVEALVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKT 60
           MGS RFS   R R +EA++++  +  ++SL C  RL S SRQKL+VQKHL RLNKP +KT
Sbjct: 1   MGSVRFSNGHRRRKLEAVLLLLCLCELISLSCARRL-SASRQKLQVQKHLNRLNKPPIKT 60

Query: 61  IESPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQ 120
           IESPDGD+IDCVH+SHQPAFDHPFLKDHKIQMRPS+HPEGLFDENK++EK  E+  PI Q
Sbjct: 61  IESPDGDIIDCVHISHQPAFDHPFLKDHKIQMRPSYHPEGLFDENKISEKPEERTNPITQ 120

Query: 121 LWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHA 180
           LWH+NGKCPE TIPIRRTK EDVLRASSVKRYG+KKHRS P  PRSA+PDLINQSGHQHA
Sbjct: 121 LWHMNGKCPEDTIPIRRTKKEDVLRASSVKRYGKKKHRSIP-KPRSADPDLINQSGHQHA 180

Query: 181 IAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYG 240
           IAYVEGDKYYGAKAT+NVWEP IQQPNEFSLSQ+W+LGGSFGEDLNSIEAGWQVSPDLYG
Sbjct: 181 IAYVEGDKYYGAKATINVWEPKIQQPNEFSLSQIWVLGGSFGEDLNSIEAGWQVSPDLYG 240

Query: 241 DNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWK 300
           DNNTRLFTYWTSDAYQATGCYNLLCSGFIQ+NS+IAMGASISPVSA+RNSQYDISILVWK
Sbjct: 241 DNNTRLFTYWTSDAYQATGCYNLLCSGFIQVNSEIAMGASISPVSAFRNSQYDISILVWK 300

Query: 301 DPKEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHF 360
           DPKEG+WWMQFG+ YV+GYWPSFLFSYLADSASMIEWGGEVVNSEP+G+HTSTQMGSGHF
Sbjct: 301 DPKEGNWWMQFGSDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGQHTSTQMGSGHF 360

Query: 361 PDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGR 420
           P+EGFGK+SYFRNIQVVD SNNLK PK +GTF EQ +CYDVQTGSNGDWGH+FYYGGPGR
Sbjct: 361 PEEGFGKSSYFRNIQVVDSSNNLKAPKELGTFAEQSNCYDVQTGSNGDWGHYFYYGGPGR 420

Query: 421 NANCQ 423
           N NCQ
Sbjct: 421 NPNCQ 423

BLAST of CSPI01G29250 vs. NCBI nr
Match: gi|590580379|ref|XP_007014051.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 760.0 bits (1961), Expect = 2.2e-216
Identity = 359/422 (85.07%), Postives = 383/422 (90.76%), Query Frame = 1

Query: 1   MGSARFSRCRTVEA-LVVVFSVLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIE 60
           M    FSR  T    L+VVF +LG +SL C  R    SRQ+L+VQKHL RLNKPAVKTIE
Sbjct: 1   MADVHFSRGWTRRGVLLVVFCLLGSISLSCAAR-PGVSRQRLQVQKHLNRLNKPAVKTIE 60

Query: 61  SPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPKPINQLW 120
           SPDGD+IDCVH+SHQPAFDHPFLKDHKIQMRP++H EGLFDENKV+EK      PI QLW
Sbjct: 61  SPDGDIIDCVHISHQPAFDHPFLKDHKIQMRPNYHREGLFDENKVSEKPKPHSNPITQLW 120

Query: 121 HVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQHAIA 180
           HVNGKCPEGTIPIRRTK +DVLRASSVKRYGRKKHR+ P  PRSA+PDLIN+SGHQHAIA
Sbjct: 121 HVNGKCPEGTIPIRRTKEQDVLRASSVKRYGRKKHRAIP-QPRSADPDLINESGHQHAIA 180

Query: 181 YVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDN 240
           YVEGDKYYGAKAT+NVWEP IQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDN
Sbjct: 181 YVEGDKYYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDLYGDN 240

Query: 241 NTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILVWKDP 300
           NTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSAYRNSQYDISILVWKDP
Sbjct: 241 NTRLFTYWTSDAYQATGCYNLLCSGFIQINSEIAMGASISPVSAYRNSQYDISILVWKDP 300

Query: 301 KEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSGHFPD 360
           KEGHWWMQFGN YV+GYWPSFLFSYLADSASMIEWGGEVVNSEP+G HTSTQMGSG FP+
Sbjct: 301 KEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGHHTSTQMGSGRFPE 360

Query: 361 EGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGPGRNA 420
           EGFGK+SYFRNIQVVDGSNNLK PKG+GTFTEQ +CYDVQTGSNGDWGH+FYYGGPG+N 
Sbjct: 361 EGFGKSSYFRNIQVVDGSNNLKAPKGLGTFTEQSNCYDVQTGSNGDWGHYFYYGGPGKNP 420

Query: 421 NC 422
           NC
Sbjct: 421 NC 420

BLAST of CSPI01G29250 vs. NCBI nr
Match: gi|823224334|ref|XP_012444936.1| (PREDICTED: uncharacterized protein LOC105769079 [Gossypium raimondii])

HSP 1 Score: 753.8 bits (1945), Expect = 1.6e-214
Identity = 358/426 (84.04%), Postives = 388/426 (91.08%), Query Frame = 1

Query: 1   MGSARFSRCRTVEALVVVFS-VLGMVSLCCGTRLESGSRQKLEVQKHLRRLNKPAVKTIE 60
           MG   FSR R  + +V++F   L ++SL C  RL S SRQKL+VQ HL RLNKPAVKTI+
Sbjct: 1   MGDVHFSRERAGKVVVLIFFWTLSLISLSCAARL-SVSRQKLQVQNHLNRLNKPAVKTIQ 60

Query: 61  SPDGDLIDCVHMSHQPAFDHPFLKDHKIQMRPSFHPEGLFDENKVAEKASEKPK----PI 120
           SPDGD+IDCVH++ QPAFDHPFLKDHKIQMRPS+HPEGLFDE+KV++  +EKPK    PI
Sbjct: 61  SPDGDIIDCVHLARQPAFDHPFLKDHKIQMRPSYHPEGLFDESKVSD--TEKPKKGSNPI 120

Query: 121 NQLWHVNGKCPEGTIPIRRTKHEDVLRASSVKRYGRKKHRSTPIPPRSAEPDLINQSGHQ 180
            QLWH+NGKCPEGTIPIRRTK EDVLRASSVK YGRKKHR+TP  PRSA+PDLIN+SGHQ
Sbjct: 121 TQLWHMNGKCPEGTIPIRRTKEEDVLRASSVKSYGRKKHRATP-QPRSADPDLINESGHQ 180

Query: 181 HAIAYVEGDKYYGAKATMNVWEPSIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL 240
           HAIAYVEGDKYYGAKAT+NVWEP IQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL
Sbjct: 181 HAIAYVEGDKYYGAKATINVWEPKIQQPNEFSLSQLWILGGSFGEDLNSIEAGWQVSPDL 240

Query: 241 YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSDIAMGASISPVSAYRNSQYDISILV 300
           YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINS+IAMGASISPVSAYRNSQYDISILV
Sbjct: 241 YGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSEIAMGASISPVSAYRNSQYDISILV 300

Query: 301 WKDPKEGHWWMQFGNGYVMGYWPSFLFSYLADSASMIEWGGEVVNSEPNGEHTSTQMGSG 360
           WKDPKEGHWWMQFGN YV+GYWPSFLFSYLADSASMIEWGGEVVNSEP+G HTSTQMGSG
Sbjct: 301 WKDPKEGHWWMQFGNDYVLGYWPSFLFSYLADSASMIEWGGEVVNSEPDGRHTSTQMGSG 360

Query: 361 HFPDEGFGKASYFRNIQVVDGSNNLKPPKGIGTFTEQPDCYDVQTGSNGDWGHFFYYGGP 420
            FP+EGFGK+SYFRNIQVVD SNNLK PKG+GTFTEQ +CYDVQTGSNGDWGH+FYYGGP
Sbjct: 361 RFPEEGFGKSSYFRNIQVVDDSNNLKAPKGLGTFTEQSNCYDVQTGSNGDWGHYFYYGGP 420

Query: 421 GRNANC 422
           G+N NC
Sbjct: 421 GKNPNC 422

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LZQ5_CUCSA1.8e-257100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G599510 PE=4 SV=1[more]
A0A061GPD6_THECC1.5e-21685.07Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_038694 PE=4 SV=1[more]
A0A0D2RR33_GOSRA1.1e-21484.04Uncharacterized protein OS=Gossypium raimondii GN=B456_009G000100 PE=4 SV=1[more]
A0A0B0PIB1_GOSAR4.2e-21484.04tRNA-splicing ligase RtcB OS=Gossypium arboreum GN=F383_29250 PE=4 SV=1[more]
B9GLD1_POPTR7.2e-21483.65Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s04100g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT3G13510.12.4e-20579.10 Protein of Unknown Function (DUF239)[more]
AT1G55360.11.5e-20278.62 Protein of Unknown Function (DUF239)[more]
AT5G56530.15.6e-20277.73 Protein of Unknown Function (DUF239)[more]
AT2G44210.22.5e-15758.41 Protein of Unknown Function (DUF239)[more]
AT1G10750.11.5e-13858.95 Protein of Unknown Function (DUF239)[more]
Match NameE-valueIdentityDescription
gi|449452648|ref|XP_004144071.1|2.6e-257100.00PREDICTED: uncharacterized protein LOC101217988 [Cucumis sativus][more]
gi|659100275|ref|XP_008451013.1|6.0e-25498.10PREDICTED: uncharacterized protein LOC103492421 [Cucumis melo][more]
gi|747084593|ref|XP_011089714.1|5.8e-21783.53PREDICTED: uncharacterized protein LOC105170589 [Sesamum indicum][more]
gi|590580379|ref|XP_007014051.1|2.2e-21685.07Uncharacterized protein isoform 1 [Theobroma cacao][more]
gi|823224334|ref|XP_012444936.1|1.6e-21484.04PREDICTED: uncharacterized protein LOC105769079 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004314Neprosin
IPR025521Neprosin_propep
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016874 ligase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G29250.1CSPI01G29250.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004314Domain of unknown function DUF239PFAMPF03080DUF239coord: 193..415
score: 1.9
IPR025521Domain of unknown function DUF4409PFAMPF14365DUF4409coord: 57..180
score: 3.1
NoneNo IPR availablePANTHERPTHR31589FAMILY NOT NAMEDcoord: 1..421
score:
NoneNo IPR availablePANTHERPTHR31589:SF24SUBFAMILY NOT NAMEDcoord: 1..421
score: