CmoCh05G003960 (gene) Cucurbita moschata (Rifu)

NameCmoCh05G003960
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPlant protein of unknown function (DUF247)
LocationCmo_Chr05 : 1782681 .. 1785869 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCTCTCCGCCTCTTCCAACACAAATGAGATCAGTCGAGCAGAAATAGAAAATGAAACTGGTACTTATGATCCAGTGGAAGCATCCATTAATCGTATTCTTCAACAATCTATTTCATTCTGTTCTGAAGGCGTTACCATTTACAAGGTTCCAGAACTTTTACGCAGCATCAAGCCAGAAGTTTATATGCCCACTACCATCTCTATCGGCCCTTTGCACTCTGATAGAAAAGATCTTGGAGCTAATTCATTCAAATCCATATTCCTTCGACTCTTCCTTGATTTCACCCAGCTCTCCGTGAACACGATTGTAGAAACAGTCCGAAATTTGGAACAAAGAGCTCGCTCATGTTATGCAAAATCCATGGAGATGAGCAGGGATGAATTTGTGGAGCTTCTGGTCTTGGATGCGTATTTCGTGGTCATGCATCTCATCCAATCGACGTGCTCACATTTGGATAGCCCAAAGATGACGGATTTCTTGCAATTCAATTATGAAACATCTCGTGATTTGATGCTGCTTGAAAACCAGCTTCCCTTCTTCCTTCTCCAATCTTTATACGACCTGGTCCTCCACTCAAAACCTTCACTGAATGATAAGAGTTTCATTCAAATTGTTAGTAATTATTTCTGCAACAACGATGACCAAGAGTTGCTTTTTATTGATATCAATTTACCTTTGGCTGCCAAAGTGGATCATTTTCTTGATTTAGTAAGAATACGTAAGCCTTGTTTGAATGGGGATATAACGTTCACTTCCTCAGGCATATTTTGGCCACCGAATGCCACCGAGCTTCACAAATGCGGCGTTATTTTTAGGACGGGGAGTGTCATAAAATTTAGTGACCAAGGTGGCTTTCTAGACTTCCGGAAATCAAGATATACGACGATTTTGAAAGGCGTATGAGGAACCTTATAGCTTATGAGCAATGCTATATTGGGTATGAGGATAAGAACTCGATGAGCAACTTTGCTGCGTTTATGCAGTTCTTGGTCCAGACAGACCAAGACGTCAAATTGCTAATTAAGGGAGGGATCATAGACAACAACTTGGGTAGTGTCAAAGAGGTTACCCAATTATTCAACAACCTCGGTAAGCACGTTTACCCTGGAGTCAACTACTACAGTTTTTATTGCAAAACAATGAAAGATTATTGCAAGCACCGCTGCCATCGTTGGATGACGTCGTTGCGGCGCAACTATTTCAGCACGCCATGGCTATGTGCTTCCTCCATTGCAGCCATCTTCCTCCTTGCCCTCACTTATACAAACCATCATAGCTATAGTCACTGGATTCAAACAAACTTCTTAATTGGAATTCAACAACTTATGTTTAAGTTGGGTCTATTAAATTTATATTCCAACTATCATGCTATTTTCGGTGTAATTGAGAACTCCATCTATTTGGATTCATAACTATGTGAAACAAACGCAGAAACAATGGTAATCCCAAATGAATCTAGGAAATATAAAAAGAAACTATACTTTGTACTTTCTTCTGACGAAAAAAAAATAAATAAAGCTTTTGATTTTGTTTATTTTATTTTAGTAAGATTGAATAAGAAACAATACTTTGTACTTTCTTCTTATGGAAAATAATAATAATAATAAATAAAGCTTTTGATTTTGTTTATTTTATTTTAGTAAGATTGAATAATTTTTTAGCAACAGTGTTATATCCTTCCAAAAATATGTTAATACAAGAAGAAAAGTTAGATAGATTTGTAAAATTCCACATCCATTAAAGTGGGGAATAAAACATTTCTTATAAATATGTGGAGACCCCTCTCTAACAGACGCGTTTTAAAATCGTGAGGCTAACGGAGATATGTAACGAGCTAAAATGGACAATATGTACTAGTATTTGGCTTTAGCTGTTACAAGATTGAATAACTTTTAGATCAATAAACCGTGGTCATCATATGCTTATACTAATCAATGAACATAGTCAGTCTCGTAGCAAACAAGACTAGTGCATTCATGACAACAAACCCTCTCTACAAACAACAAAGTCTATATTGTCCAGCAGTTGGAATAGAGGACCCCAATTTCTCAATCATATGGAATGGAAGACCCAACGTTTCATTGTCTAACCGTTATCTATGAAAATTTGATACAGGAATCTTTTAGATTCATGAATACTTTTGGTACAGTCATCACTTCGATTTATGAATCTATTTGGATTTATTTGGTTCATCCCCGTAGTCGTTTATAAATCATCTTTATTTATGTATGAGCGTTAGTATTATTATACTTATATAAAGGTCTATCTAGTATTGTTATGAGATAAAAAATCATTGTCATTTTGTATTTATCTAAAGGAAATTGAGAGGAATTTTATAAGTGAAATTTTGTAATAGGAAAGTGAGAGATATATTTATAAACATTTTGGTTAGAATGATTGGCTAACACTTGGATACAATAAATTTCTTTTTTTCTAAATTAGGTAAAATTTTCTCTCTTTATTCATTATGTCAGTAGGTGTGTGCCTATGACATTCTTGTTTCCGCTTTACTCCTCCATGCGATCATTTATGTAAGGCCCATCAATACAACTGTCTTAGGATATGTCGAAATAAGTGTTGAGTTTAAAGATAATTGATCAATAAAAAAAAAATTGATGGGACAACTATTGAGTTGGTCTCATAAATCTATCTTTCTTCTCTTTCAATTTTTTTCAGAATCAAGTCAAATTTCTTATTTTTGAATTTGAGGATTAAAATGCTAAACAGTTGCATTAAAGACATTTGAGGATCTAAAAGTATTCCAATCTTGTAATAGCCAAGTCCATCGATATTGTCTACTTTAACTTGTTGTGAATTGTCGTGAGTCTCACGCTTTTAAAACGCTTCTGTTACGGAGAGGTTTCCATACTCAAATAAAGAATGGTTGGTTCCATCATGCTTCAACATTGATGAAAATGATTTTATTATTCCTCCATTCATAAGTTAGCTCCAGAAGATTAAGAAATTAGAACAAATATCCAATATGTCTTGTCACTGAGTTATTGATATGGGCAGAAAAAATGTGAAATTACTTATAAAAGCTCAAATCATAATCCAAGGATGTGGATAAAAAAGTTTCATAATAGTTTAACAATCTTTCTAAATTTGTCACCGTCCAAAATAAGTTATATTCATAATCATAGTCTAACTTCATTTTAGGGCCATATGCATGAAGTAACATAA

mRNA sequence

ATGGAGCTCTCCGCCTCTTCCAACACAAATGAGATCAGTCGAGCAGAAATAGAAAATGAAACTGGTACTTATGATCCAGTGGAAGCATCCATTAATCGTATTCTTCAACAATCTATTTCATTCTGTTCTGAAGGCGTTACCATTTACAAGGTTCCAGAACTTTTACGCAGCATCAAGCCAGAAGTTTATATGCCCACTACCATCTCTATCGGCCCTTTGCACTCTGATAGAAAAGATCTTGGAGCTAATTCATTCAAATCCATATTCCTTCGACTCTTCCTTGATTTCACCCAGCTCTCCGTGAACACGATTGTAGAAACAGTCCGAAATTTGGAACAAAGAGCTCGCTCATGTTATGCAAAATCCATGGAGATGAGCAGGGATGAATTTGTGGAGCTTCTGGTCTTGGATGCGTATTTCGTGGTCATGCATCTCATCCAATCGACGTGCTCACATTTGGATAGCCCAAAGATGACGGATTTCTTGCAATTCAATTATGAAACATCTCGTGATTTGATGCTGCTTGAAAACCAGCTTCCCTTCTTCCTTCTCCAATCTTTATACGACCTGGTCCTCCACTCAAAACCTTCACTGAATGATAAGAGTTTCATTCAAATTGTTAGTAATTATTTCTGCAACAACGATGACCAAGAGTTGCTTTTTATTGATATCAATTTACCTTTGGCTGCCAAAGTGGATCATTTTCTTGATTTAGTAAGAATACCTTATGAGCAATGCTATATTGGGTATGAGGATAAGAACTCGATGAGCAACTTTGCTGCGTTTATGCAGTTCTTGGTCCAGACAGACCAAGACGTCAAATTGCTAATTAAGGGAGGGATCATAGACAACAACTTGGGTAGTGTCAAAGAGGGCCATATGCATGAAGTAACATAA

Coding sequence (CDS)

ATGGAGCTCTCCGCCTCTTCCAACACAAATGAGATCAGTCGAGCAGAAATAGAAAATGAAACTGGTACTTATGATCCAGTGGAAGCATCCATTAATCGTATTCTTCAACAATCTATTTCATTCTGTTCTGAAGGCGTTACCATTTACAAGGTTCCAGAACTTTTACGCAGCATCAAGCCAGAAGTTTATATGCCCACTACCATCTCTATCGGCCCTTTGCACTCTGATAGAAAAGATCTTGGAGCTAATTCATTCAAATCCATATTCCTTCGACTCTTCCTTGATTTCACCCAGCTCTCCGTGAACACGATTGTAGAAACAGTCCGAAATTTGGAACAAAGAGCTCGCTCATGTTATGCAAAATCCATGGAGATGAGCAGGGATGAATTTGTGGAGCTTCTGGTCTTGGATGCGTATTTCGTGGTCATGCATCTCATCCAATCGACGTGCTCACATTTGGATAGCCCAAAGATGACGGATTTCTTGCAATTCAATTATGAAACATCTCGTGATTTGATGCTGCTTGAAAACCAGCTTCCCTTCTTCCTTCTCCAATCTTTATACGACCTGGTCCTCCACTCAAAACCTTCACTGAATGATAAGAGTTTCATTCAAATTGTTAGTAATTATTTCTGCAACAACGATGACCAAGAGTTGCTTTTTATTGATATCAATTTACCTTTGGCTGCCAAAGTGGATCATTTTCTTGATTTAGTAAGAATACCTTATGAGCAATGCTATATTGGGTATGAGGATAAGAACTCGATGAGCAACTTTGCTGCGTTTATGCAGTTCTTGGTCCAGACAGACCAAGACGTCAAATTGCTAATTAAGGGAGGGATCATAGACAACAACTTGGGTAGTGTCAAAGAGGGCCATATGCATGAAGTAACATAA
BLAST of CmoCh05G003960 vs. Swiss-Prot
Match: Y3720_ARATH (UPF0481 protein At3g47200 OS=Arabidopsis thaliana GN=At3g47200 PE=2 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 4.1e-15
Identity = 66/202 (32.67%), Postives = 104/202 (51.49%), Query Frame = 1

Query: 44  EGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLDFTQ---L 103
           E   I++VPE   ++ P+ Y P  +SIGP H   K L      K   L+LFLD  +   +
Sbjct: 44  ESCCIFRVPESFVALNPKAYKPKVVSIGPYHYGEKHLQMIQQHKPRLLQLFLDEAKKKDV 103

Query: 104 SVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVM-HLIQSTCSHLDSPKM 163
             N +V+ V +LE + R  Y++ ++   D  + ++VLD  F++M  LI S    L    +
Sbjct: 104 EENVLVKAVVDLEDKIRKSYSEELKTGHD-LMFMMVLDGCFILMVFLIMSGNIELSEDPI 163

Query: 164 TDFLQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVSNYFCNNDDQE 223
                       DL+LLENQ+PFF+LQ+LY   + SK  ++     +I  ++F N  D+E
Sbjct: 164 FSIPWLLSSIQSDLLLLENQVPFFVLQTLY---VGSKIGVS-SDLNRIAFHFFKNPIDKE 223

Query: 224 LLFIDINLPLAAKVDHFLDLVR 241
             + + +    AK  H LDL+R
Sbjct: 224 GSYWEKHRNYKAK--HLLDLIR 238

BLAST of CmoCh05G003960 vs. Swiss-Prot
Match: Y3264_ARATH (Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana GN=At3g02645 PE=3 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 1.3e-08
Identity = 41/157 (26.11%), Postives = 85/157 (54.14%), Query Frame = 1

Query: 46  VTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLD-FTQLSVNT 105
           V+I+ VP+ L    P+ Y P  +SIGP H  + +L     +K +  R   + +     + 
Sbjct: 43  VSIFNVPKALMCSHPDSYTPHRVSIGPYHCLKPELHEMERYKLMIARKIRNQYNSFRFHD 102

Query: 106 IVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQSTCSHLDSPKMTDFLQ 165
           +VE ++++E + R+CY K +  + +  + ++ +D+ F++  L   +   +++  + + + 
Sbjct: 103 LVEKLQSMEIKIRACYHKYIGFNGETLLWIMAVDSSFLIEFLKIYSFRKVET--LINRVG 162

Query: 166 FNYETSRDLMLLENQLPFFLLQSLYDLVLHSKPSLND 201
            N E  RD+M++ENQ+P F+L+   +  L S  S +D
Sbjct: 163 HN-EILRDIMMIENQIPLFVLRKTLEFQLESTESADD 196

BLAST of CmoCh05G003960 vs. TrEMBL
Match: A0A0A0LPK8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G381700 PE=4 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 5.0e-28
Identity = 92/256 (35.94%), Postives = 144/256 (56.25%), Query Frame = 1

Query: 1   MELSA-SSNTNEISRAEIENETGTYDPVEASIN----RILQQSISFCSEGVTIYKVPELL 60
           ME++    + N  +++  E     YD +  S+N    R + +S S  S+  +IY VP+LL
Sbjct: 1   MEINVYDESNNNTTKSRDEEIKVIYDRMVGSVNQSMFREISRSASSFSKERSIYMVPKLL 60

Query: 61  RSIKPEVYMPTTISIGPLHSDR--KDLGANSFKSIFLRLFLDFTQLSVNTIVETVRNLEQ 120
           R   P+ Y P  ISIGPLH  R   DL     K  ++  FL   +L  N +++   + E+
Sbjct: 61  RKGNPKAYSPQVISIGPLHYYRTQNDL-IKEKKGSYVLNFLTVAKLDWNEMIKKFLSWEE 120

Query: 121 RARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQSTCSHLDSPKMTDFLQFNYETSRDLM 180
           RAR+ Y +++EM RDEF++LL+ D+ FVVM++I S  +       +   +F+    +DL+
Sbjct: 121 RARNYYVETIEMKRDEFIQLLIYDSCFVVMYVIGSMVAEFRDLDTSFLWRFSNGIFKDLL 180

Query: 181 LLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVSNYFCN-NDDQELL---FIDINLPLA 240
           LLENQLPFFLL  LY+L   ++PSL D SFI+++  YF    +    +   + DI+   A
Sbjct: 181 LLENQLPFFLLNHLYNLCASAQPSLKDISFIELLRGYFSKVREGMSYVKEGYFDID---A 240

Query: 241 AKVDHFLDLVRIPYEQ 246
           + V+H +D +RI   Q
Sbjct: 241 SAVNHLVDFLRIHLTQ 252

BLAST of CmoCh05G003960 vs. TrEMBL
Match: R0H4C7_9BRAS (Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006919mg PE=4 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.1e-22
Identity = 77/207 (37.20%), Postives = 117/207 (56.52%), Query Frame = 1

Query: 38  SISFCSEGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLDF 97
           S+S       IYKVP  LR + P+ Y P  +S GP H  +++L A    K  +LR F+  
Sbjct: 265 SLSSLLRPCCIYKVPNKLRRLNPDAYTPRLVSFGPFHRGKEELQAMEEHKHRYLRSFISR 324

Query: 98  TQLSVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQSTCSHLDSP 157
           T  S+  IV   R+ EQ ARSCYA+ ++++ DEFVE+LV+D  F+V  L++S    L   
Sbjct: 325 TNSSLEDIVRVGRSWEQNARSCYAEDVKLNSDEFVEMLVVDGSFLVELLLRSHYPRLRGE 384

Query: 158 KMTDF--LQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVSNYF-CN 217
           K   F  L    +  RD++L+ENQLPFF+++ ++ L+L         S IQ+   +F C+
Sbjct: 385 KDRIFGNLMMITDVCRDMILIENQLPFFVVKEIF-LLLFIYYQQGTPSIIQLAQRHFRCS 444

Query: 218 NDDQELLFIDINLPLAAKVDHFLDLVR 241
                L  ID N  + ++ +HF+DL+R
Sbjct: 445 -----LSRIDDN-KIISEPEHFVDLLR 464

BLAST of CmoCh05G003960 vs. TrEMBL
Match: G7KJY8_MEDTR (DUF247 domain protein OS=Medicago truncatula GN=MTR_6g009170 PE=4 SV=1)

HSP 1 Score: 113.6 bits (283), Expect = 4.1e-22
Identity = 72/235 (30.64%), Postives = 123/235 (52.34%), Query Frame = 1

Query: 31  INRILQQSISFCSEGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDL-GANSFKSIF 90
           IN +L  +  F   GV IYKVP  +R++  + Y PT +SIGP H D   L      K I+
Sbjct: 10  INALLDSAEPFSMVGVCIYKVPSAIRTLNEKAYTPTLVSIGPFHHDHPQLQNMERHKLIY 69

Query: 91  LRLFLDFTQLSVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVV--MHLIQ 150
           L+ FL  T   ++T+V  +++   R +SCY++++  S +E V+L+++D+ F++       
Sbjct: 70  LKAFLQRTNACLDTLVSNIKSNLSRFKSCYSETLPFSDNELVKLILIDSCFIIQLFWTYY 129

Query: 151 STCSHLDSPKMTDFLQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSK----PSLNDKSF 210
                L  P + D +      + DL+LLENQLPFF+++ +Y L   S     P      F
Sbjct: 130 YDDGFLFKPWLDDGI------ALDLLLLENQLPFFVIEEIYKLSSSSTNASVPKTTIPGF 189

Query: 211 IQIVSNYFCNNDDQELLFIDINLPLAAKVDHFLDLVRIPYEQCYIGYEDKNSMSN 259
           +++   YF +++   L F + ++     + HF DL+RI + Q  I     N+ ++
Sbjct: 190 LELTIKYFYSSNKSNLFFDNGDI----SIMHFTDLIRIFHLQLPIEIRPSNNATD 234

BLAST of CmoCh05G003960 vs. TrEMBL
Match: M5XHJ6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa018248mg PE=4 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 2.0e-21
Identity = 73/227 (32.16%), Postives = 129/227 (56.83%), Query Frame = 1

Query: 24  YDPVEASINRILQQSISFCSEGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA- 83
           +DP+  S++ +L + +   S    IY+VP+ LR +  + Y P  +SIGPLH  ++ L A 
Sbjct: 6   HDPLVTSMSEVLDR-LPPLSPSCCIYRVPKRLRRVSEQAYTPQVVSIGPLHHGKEALKAM 65

Query: 84  NSFKSIFLRLFLDFTQLSVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVV 143
              K+ +L+ FL  T +S+   ++ +R  E + RSCYA+++  SRDEFV ++++DA F++
Sbjct: 66  EELKNRYLQDFLRRTNVSLEYFIKKIRAQEAKLRSCYAETIGFSRDEFVRIILVDAAFII 125

Query: 144 MHLIQSTCSHL--DSPKMTDFLQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSKPSLND 203
             L++    HL  ++ ++ +  +   +   D+ +LENQLPFF+L+ L+D   H +  +  
Sbjct: 126 EVLLRFCYKHLRVENDRIFNQPRMLEDVWPDMRMLENQLPFFILEDLFD---HERNIVGI 185

Query: 204 KSFIQIVSNYF----CNNDDQELLFIDINLPLAAKVDHFLDLVRIPY 244
           ++ I  +S +F     +  D E     I  P   +V+HF+D VR  Y
Sbjct: 186 QTTIIDLSYHFFKTLMHMKDMEDTLTRIRPP--HQVEHFVDFVRKLY 226

BLAST of CmoCh05G003960 vs. TrEMBL
Match: A0A0D3DJY2_BRAOL (Uncharacterized protein OS=Brassica oleracea var. oleracea PE=4 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 2.7e-21
Identity = 73/219 (33.33%), Postives = 122/219 (55.71%), Query Frame = 1

Query: 37  QSISFCSEGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLD 96
           +S+S  S    IYKVP  LR + P+VY P  +S GP H  ++DL A    K  +L+ FL 
Sbjct: 18  ESLSSLSNQCCIYKVPNKLRRLNPDVYSPRLVSFGPFHRGKEDLQAMEEHKYRYLQSFLP 77

Query: 97  FTQLSVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQSTCSHL-- 156
               S+  +V   R  E+ ARSCYA+ ++++ DEFV++LV+D  F+V  +++S   HL  
Sbjct: 78  RASFSLEDLVRVARTWEEDARSCYAEDVKLNSDEFVKMLVVDGSFLVELILRSRYPHLIT 137

Query: 157 DSPKMTDFLQFNYETSRDLMLLENQLPFFLLQSLYDLVL----HSKPSLNDKSFIQIVSN 216
           ++ ++        +  RD++L+ENQLPFF+++  ++L+        PS+ D     + S+
Sbjct: 138 ENDRIFGKPWMITDVCRDMILIENQLPFFIVKGFFNLLTPYYQQGTPSILD----MVKSH 197

Query: 217 YFCNNDDQELLFIDINLPLAAKVDHFLDLVRIPYEQCYI 249
           + C      L  ID N+   ++ +HF+D +R     CY+
Sbjct: 198 FSC-----FLSNIDDNM-CDSEPEHFVDYLR----SCYL 222

BLAST of CmoCh05G003960 vs. TAIR10
Match: AT4G31980.1 (AT4G31980.1 unknown protein)

HSP 1 Score: 110.2 bits (274), Expect = 2.3e-24
Identity = 73/216 (33.80%), Postives = 116/216 (53.70%), Query Frame = 1

Query: 39  ISFCSEGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLDFT 98
           +S  S    IYKVP  LR + P+ Y P  +S GPLH  +++L A    K  +L  F+  T
Sbjct: 286 LSSLSTKCCIYKVPNKLRRLNPDAYTPRLVSFGPLHRGKEELQAMEDQKYRYLLSFIPRT 345

Query: 99  QLSVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQSTCSHL--DS 158
             S+  +V   R  EQ ARSCYA+ +++  DEFVE+LV+D  F+V  L++S    L  ++
Sbjct: 346 NSSLEDLVRLARTWEQNARSCYAEDVKLHSDEFVEMLVVDGSFLVELLLRSHYPRLRGEN 405

Query: 159 PKMTDFLQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVS---NYFC 218
            ++        +  RD++L+ENQLPFF+++ ++ L+L+        S IQ+     +YF 
Sbjct: 406 DRIFGNSMMITDVCRDMILIENQLPFFVVKEIFLLLLNYYQQ-GTPSIIQLAQRHFSYFL 465

Query: 219 NNDDQELLFIDINLPLAAKVDHFLDLVRIPYEQCYI 249
           +  D E            + +HF+DL+R     CY+
Sbjct: 466 SRIDDE--------KFITEPEHFVDLLR----SCYL 488

BLAST of CmoCh05G003960 vs. TAIR10
Match: AT3G47200.1 (AT3G47200.1 Plant protein of unknown function (DUF247))

HSP 1 Score: 83.6 bits (205), Expect = 2.3e-16
Identity = 66/202 (32.67%), Postives = 104/202 (51.49%), Query Frame = 1

Query: 44  EGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLDFTQ---L 103
           E   I++VPE   ++ P+ Y P  +SIGP H   K L      K   L+LFLD  +   +
Sbjct: 44  ESCCIFRVPESFVALNPKAYKPKVVSIGPYHYGEKHLQMIQQHKPRLLQLFLDEAKKKDV 103

Query: 104 SVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVM-HLIQSTCSHLDSPKM 163
             N +V+ V +LE + R  Y++ ++   D  + ++VLD  F++M  LI S    L    +
Sbjct: 104 EENVLVKAVVDLEDKIRKSYSEELKTGHD-LMFMMVLDGCFILMVFLIMSGNIELSEDPI 163

Query: 164 TDFLQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVSNYFCNNDDQE 223
                       DL+LLENQ+PFF+LQ+LY   + SK  ++     +I  ++F N  D+E
Sbjct: 164 FSIPWLLSSIQSDLLLLENQVPFFVLQTLY---VGSKIGVS-SDLNRIAFHFFKNPIDKE 223

Query: 224 LLFIDINLPLAAKVDHFLDLVR 241
             + + +    AK  H LDL+R
Sbjct: 224 GSYWEKHRNYKAK--HLLDLIR 238

BLAST of CmoCh05G003960 vs. TAIR10
Match: AT3G47210.1 (AT3G47210.1 Plant protein of unknown function (DUF247))

HSP 1 Score: 82.0 bits (201), Expect = 6.7e-16
Identity = 70/220 (31.82%), Postives = 106/220 (48.18%), Query Frame = 1

Query: 44  EGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLDFTQLSVN 103
           E   I++VP+    + PE Y P  +SIGP H  RK L      K  FL LFL    +  +
Sbjct: 91  ESCCIFRVPKSFAEMNPEAYKPKVVSIGPYHHGRKHLEMIQQHKLRFLHLFLRTASVDRD 150

Query: 104 TIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVMHLI---------QSTCSHL 163
            +   V + E   R  Y++ +E S  E V +++LD  F++M L+         +S    L
Sbjct: 151 VLFNAVVDWEDEIRKSYSEGLEGSPHELVYMMILDGCFILMLLLIVSRKIELYESEDPIL 210

Query: 164 DSPKMTDFLQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSKP-SLNDKSFIQIVSNYFC 223
             P +   +Q       DL+LLENQ+PFF+LQ+L+D      P  LN  +F     ++F 
Sbjct: 211 TIPWILPSIQ------SDLLLLENQVPFFVLQTLFDKSEIGVPGDLNRMAF-----SFFN 270

Query: 224 NNDDQELLFIDINLPLAAKVDHFLDLVRIPYEQCYIGYED 253
            + D+   +   +    AK  H LDL+R+ +     GYED
Sbjct: 271 LSMDKPERYWVKHRNFNAK--HLLDLIRMSFLP-MDGYED 296

BLAST of CmoCh05G003960 vs. TAIR10
Match: AT3G50160.1 (AT3G50160.1 Plant protein of unknown function (DUF247))

HSP 1 Score: 79.7 bits (195), Expect = 3.3e-15
Identity = 49/175 (28.00%), Postives = 90/175 (51.43%), Query Frame = 1

Query: 44  EGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDL-GANSFKSIFLRLFLDFTQLSVN 103
           + + IY+VP  L+    + YMP  +SIGP H   K L      K   + + +   +  + 
Sbjct: 102 DNLCIYRVPPYLQENDTKSYMPQIVSIGPYHHGHKHLMPMERHKWRAVNMVMARAKHDIE 161

Query: 104 TIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQST------CSHLDSP 163
             ++ ++ LE++AR+CY   + M+R+EF+E+LVLD  F++  + + T        +  + 
Sbjct: 162 MYIDAMKELEEKARACYQGPINMNRNEFIEMLVLDGVFII-EIFKGTSEGFQEIGYAPND 221

Query: 164 KMTDFLQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVSNYF 212
            +           RD+++LENQLP+ +L+ L  L    +P + DK  +Q+   +F
Sbjct: 222 PVFGMRGLMQSIRRDMVMLENQLPWSVLKGLLQL---QRPDVLDKVNVQLFQPFF 272

BLAST of CmoCh05G003960 vs. TAIR10
Match: AT3G50180.1 (AT3G50180.1 Plant protein of unknown function (DUF247))

HSP 1 Score: 76.3 bits (186), Expect = 3.7e-14
Identity = 52/179 (29.05%), Postives = 87/179 (48.60%), Query Frame = 1

Query: 48  IYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLDFTQLSVNTIVE 107
           IYKVP  L     + Y P T+S+GP H  R+   +    K   + + L  T   +   ++
Sbjct: 180 IYKVPHYLHGNDKKSYFPQTVSLGPYHHGRQQTQSMECHKWRAVNMVLKRTNQGIEVFLD 239

Query: 108 TVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQSTCSHLDSPKMTDFLQFNY 167
            +  LE++AR+CY  S+ +S +EF E+L+LD  F+ + L+Q             FL+  Y
Sbjct: 240 AMIELEEKARACYEGSIVLSSNEFTEMLLLDGCFI-LELLQGVNE--------GFLKLGY 299

Query: 168 ETS--------------RDLMLLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVSNYF 212
           + +              RD+++LENQLP F+L  L +L      + N    +++V  +F
Sbjct: 300 DHNDPVFAVRGSMHSIQRDMIMLENQLPLFVLNRLLEL---QPGTQNQTGLVELVVRFF 346

BLAST of CmoCh05G003960 vs. NCBI nr
Match: gi|659088876|ref|XP_008445209.1| (PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo])

HSP 1 Score: 135.2 bits (339), Expect = 1.9e-28
Identity = 88/247 (35.63%), Postives = 138/247 (55.87%), Query Frame = 1

Query: 8   NTNEISRAEIENETGTYDPVEASINRILQQSISFC---SEGVTIYKVPELLRSIKPEVYM 67
           + N  +++  E     YD +  S+N  + + IS     S   +IY VP+LLR+  P+ Y 
Sbjct: 9   SNNNATKSRDEEIKEIYDRMVESVNHSISREISISTSFSRERSIYMVPKLLRNGNPKAYS 68

Query: 68  PTTISIGPLHSDR--KDLGANSFKSIFLRLFLDFTQLSVNTIVETVRNLEQRARSCYAKS 127
           P  ISIGPLH  R   DL     K  ++  FL   +L  N ++      E+RAR+ Y ++
Sbjct: 69  PQVISIGPLHYYRTQNDLTIKEKKGSYVLNFLTVAKLGWNEMINKFLCWEERARNYYVET 128

Query: 128 MEMSRDEFVELLVLDAYFVVMHLIQSTCSHLDSPKMTDFLQFNYETSRDLMLLENQLPFF 187
           ++M RDEF++LL+ D+ FVVM++I S  +       +   +F+    +DL+LLENQLPFF
Sbjct: 129 IKMERDEFIQLLIYDSCFVVMYIIGSMVAEFRDLDTSFLWRFSNGIFKDLLLLENQLPFF 188

Query: 188 LLQSLYDLVLHSKPSLNDKSFIQIVSNYFCN-NDDQELL---FIDINLPLAAKVDHFLDL 246
           LL  LY+L   ++PSL D SFI+++  YF    +    +   ++DI+   A +V+H +D 
Sbjct: 189 LLHHLYNLCAFAQPSLKDISFIELLRGYFTEVREGMSYVNEGYLDID---ANEVNHLVDF 248

BLAST of CmoCh05G003960 vs. NCBI nr
Match: gi|449442176|ref|XP_004138858.1| (PREDICTED: UPF0481 protein At3g47200-like [Cucumis sativus])

HSP 1 Score: 133.3 bits (334), Expect = 7.2e-28
Identity = 92/256 (35.94%), Postives = 144/256 (56.25%), Query Frame = 1

Query: 1   MELSA-SSNTNEISRAEIENETGTYDPVEASIN----RILQQSISFCSEGVTIYKVPELL 60
           ME++    + N  +++  E     YD +  S+N    R + +S S  S+  +IY VP+LL
Sbjct: 1   MEINVYDESNNNTTKSRDEEIKVIYDRMVGSVNQSMFREISRSASSFSKERSIYMVPKLL 60

Query: 61  RSIKPEVYMPTTISIGPLHSDR--KDLGANSFKSIFLRLFLDFTQLSVNTIVETVRNLEQ 120
           R   P+ Y P  ISIGPLH  R   DL     K  ++  FL   +L  N +++   + E+
Sbjct: 61  RKGNPKAYSPQVISIGPLHYYRTQNDL-IKEKKGSYVLNFLTVAKLDWNEMIKKFLSWEE 120

Query: 121 RARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQSTCSHLDSPKMTDFLQFNYETSRDLM 180
           RAR+ Y +++EM RDEF++LL+ D+ FVVM++I S  +       +   +F+    +DL+
Sbjct: 121 RARNYYVETIEMKRDEFIQLLIYDSCFVVMYVIGSMVAEFRDLDTSFLWRFSNGIFKDLL 180

Query: 181 LLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVSNYFCN-NDDQELL---FIDINLPLA 240
           LLENQLPFFLL  LY+L   ++PSL D SFI+++  YF    +    +   + DI+   A
Sbjct: 181 LLENQLPFFLLNHLYNLCASAQPSLKDISFIELLRGYFSKVREGMSYVKEGYFDID---A 240

Query: 241 AKVDHFLDLVRIPYEQ 246
           + V+H +D +RI   Q
Sbjct: 241 SAVNHLVDFLRIHLTQ 252

BLAST of CmoCh05G003960 vs. NCBI nr
Match: gi|727512971|ref|XP_010432793.1| (PREDICTED: putative UPF0481 protein At3g02645 isoform X2 [Camelina sativa])

HSP 1 Score: 120.2 bits (300), Expect = 6.3e-24
Identity = 85/278 (30.58%), Postives = 146/278 (52.52%), Query Frame = 1

Query: 37  QSISFCSEGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDRKDLGA-NSFKSIFLRLFLD 96
           +S+S  S   +IYKVP  LR + P+ Y P  +S G  H  +++L A    K  +L+ F+ 
Sbjct: 19  ESLSSLSAQCSIYKVPNKLRKLNPDAYTPRIVSFGTFHRGKEELQAMEEHKYRYLKSFIP 78

Query: 97  FTQLSVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVLDAYFVVMHLIQSTCSHLDS 156
            T  S+  +V   R  EQ AR+CYA+ ++++ DEFVE+LV+D  F+V  L++S    L  
Sbjct: 79  RTNFSLEDLVRIARTWEQHARNCYAEDVKLTTDEFVEMLVVDGSFLVELLLRSRNPLLRD 138

Query: 157 PKMTDF--LQFNYETSRDLMLLENQLPFFLLQSLYDLVLHSKPSLNDKSFIQIVSN---- 216
            K   F       +  RD++L+ENQLPFF+++ L+ L+ + +  +  +     V N    
Sbjct: 139 EKDRIFGKPMMIADVCRDMLLIENQLPFFVVKGLFLLLYYQQVPIRLEETTLNVDNAPEA 198

Query: 217 ------------YFCNNDDQELLFID--INLPLAAKVDHFLDLVR--IPYEQCYIGYEDK 276
                          ++   ++ F+D  + +P     D    L R  I +EQC+  + DK
Sbjct: 199 TELHTAGVRFKPAETSSCLLDITFVDGLLKIPTVFIDDLTESLYRNIIVFEQCH--FSDK 258

Query: 277 NSMSNFAAFMQFLVQTDQDVKLLIKGGIIDNNLGSVKE 292
           N + ++   +   V++ +D +LLI+ GI+ NNLG+ ++
Sbjct: 259 NFL-HYTTLLGCFVKSPKDAELLIRSGILVNNLGNAED 293

BLAST of CmoCh05G003960 vs. NCBI nr
Match: gi|727512973|ref|XP_010432794.1| (PREDICTED: UPF0481 protein At3g47200-like isoform X1 [Camelina sativa])

HSP 1 Score: 117.5 bits (293), Expect = 4.1e-23
Identity = 78/237 (32.91%), Postives = 129/237 (54.43%), Query Frame = 1

Query: 18  ENETGTYDPVEASINRILQQSISFCSEGVTIYKVPELLRSIKPEVYMPTTISIGPLHSDR 77
           E +    D ++A ++ +   +I  C     IYKVP  LR + P+ Y P  +S GPLH  +
Sbjct: 5   EGDDALIDSIKAKLDSLSSLTIHCC-----IYKVPNKLRRLNPDAYTPRLVSFGPLHRGK 64

Query: 78  KDLGA-NSFKSIFLRLFLDFTQLSVNTIVETVRNLEQRARSCYAKSMEMSRDEFVELLVL 137
           ++L A    K  +L+ F+  T  ++  +V  VR  EQ+ARSCYA+ ++++ DEFVE+LV+
Sbjct: 65  EELQAMEEHKYRYLQSFILRTDCTLEELVRVVRTWEQKARSCYAEDVKLNSDEFVEMLVV 124

Query: 138 DAYFVVMHLIQSTCSHLDSPKMTDF--LQFNYETSRDLMLLENQLPFFLLQSLYDLVLHS 197
           D  F+V  L++S    L   K + F       +  RDL+L+ENQLPFF+++ ++ L+L  
Sbjct: 125 DGSFLVELLLRSHYPQLRGEKDSIFGNSMMITDVCRDLILIENQLPFFVVKEIF-LLLFV 184

Query: 198 KPSLNDKSFIQIVSNY---FCNNDDQELLFIDINLPLAAKVDHFLDLVRIPYEQCYI 249
                  S +++   +   F +N D E+        L ++  HF+DL+R     CY+
Sbjct: 185 YYQQGTPSILKLAQRHFSCFLSNIDDEM--------LTSEPKHFVDLLR----SCYL 223

BLAST of CmoCh05G003960 vs. NCBI nr
Match: gi|645281380|ref|XP_008245592.1| (PREDICTED: UPF0481 protein At3g47200-like [Prunus mume])

HSP 1 Score: 116.7 bits (291), Expect = 6.9e-23
Identity = 86/250 (34.40%), Postives = 138/250 (55.20%), Query Frame = 1

Query: 9   TNEISRAEIENETGTYDPVEASINRILQQSISFCSEGVTIYKVPELLRSIKPEVYMPTTI 68
           +N+ +  ++EN    Y P+E S+ + L + +S  S    IY+VPE LR +    Y P  +
Sbjct: 4   SNQAAPVDVENP---YIPLETSMRQELGR-LSPLSSSCCIYRVPERLRLVNENAYTPQVV 63

Query: 69  SIGPLHSDRKDLGA-NSFKSIFLRLFLDFTQLSVNTIVETVRNLEQRARSCYAKSMEMSR 128
           SIGPLH  +K L A    K  +L+ FL  T++S+   ++ +R+ E R R+CYA+++  S 
Sbjct: 64  SIGPLHHGKKGLEAMEEHKKRYLQEFLCRTKVSLEDCIKKIRDQETRLRNCYAETIGFSS 123

Query: 129 DEFVELLVLDAYFVVMHLIQSTCSHLDSPKMTDFLQFN-----YETSRDLMLLENQLPFF 188
           DEFV ++++DA F++  L++S   +  +P+  +   FN     ++   D+ LLENQLPFF
Sbjct: 124 DEFVRIILVDAAFIIELLLKS---NFRTPRKENDRIFNKPVMFHDIMTDMQLLENQLPFF 183

Query: 189 LLQSLYDL---VLHSKPSLND---KSFIQIVSNYFCNND-------DQELLFIDINLPLA 240
           +L+ L+ L    L S    +D    +F+  +S  F  +        DQE  FI   L   
Sbjct: 184 ILEDLFYLQEGTLSSDSDTSDDRLSTFVTDLSLKFFTSTMILMYQADQE--FILKRLSST 243

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y3720_ARATH4.1e-1532.67UPF0481 protein At3g47200 OS=Arabidopsis thaliana GN=At3g47200 PE=2 SV=1[more]
Y3264_ARATH1.3e-0826.11Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana GN=At3g02645 PE=3 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0LPK8_CUCSA5.0e-2835.94Uncharacterized protein OS=Cucumis sativus GN=Csa_2G381700 PE=4 SV=1[more]
R0H4C7_9BRAS1.1e-2237.20Uncharacterized protein OS=Capsella rubella GN=CARUB_v10006919mg PE=4 SV=1[more]
G7KJY8_MEDTR4.1e-2230.64DUF247 domain protein OS=Medicago truncatula GN=MTR_6g009170 PE=4 SV=1[more]
M5XHJ6_PRUPE2.0e-2132.16Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa018248mg PE=4 SV=1[more]
A0A0D3DJY2_BRAOL2.7e-2133.33Uncharacterized protein OS=Brassica oleracea var. oleracea PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G31980.12.3e-2433.80 unknown protein[more]
AT3G47200.12.3e-1632.67 Plant protein of unknown function (DUF247)[more]
AT3G47210.16.7e-1631.82 Plant protein of unknown function (DUF247)[more]
AT3G50160.13.3e-1528.00 Plant protein of unknown function (DUF247)[more]
AT3G50180.13.7e-1429.05 Plant protein of unknown function (DUF247)[more]
Match NameE-valueIdentityDescription
gi|659088876|ref|XP_008445209.1|1.9e-2835.63PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo][more]
gi|449442176|ref|XP_004138858.1|7.2e-2835.94PREDICTED: UPF0481 protein At3g47200-like [Cucumis sativus][more]
gi|727512971|ref|XP_010432793.1|6.3e-2430.58PREDICTED: putative UPF0481 protein At3g02645 isoform X2 [Camelina sativa][more]
gi|727512973|ref|XP_010432794.1|4.1e-2332.91PREDICTED: UPF0481 protein At3g47200-like isoform X1 [Camelina sativa][more]
gi|645281380|ref|XP_008245592.1|6.9e-2334.40PREDICTED: UPF0481 protein At3g47200-like [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004158DUF247_pln
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh05G003960.1CmoCh05G003960.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 243..290
score: 6.1E-9coord: 48..241
score: 6.3
NoneNo IPR availablePANTHERPTHR31549FAMILY NOT NAMEDcoord: 4..241
score: 4.8
NoneNo IPR availablePANTHERPTHR31549:SF2SUBFAMILY NOT NAMEDcoord: 4..241
score: 4.8

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None