Cp4.1LG03g10910 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g10910
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionOB-fold-like isoform 1
LocationCp4.1LG03 : 11433739 .. 11438402 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTAGCTTGGGAAAGTTGGAGACTGGTTTTCAGCTCAAGAAAATCCGGCATTCTGAACCCTGCTAAACCCCTACCACCGGCGGAGGTCTTAAAAGATGGGGAAGAAGAAGCATAAGCGACCGGAATCGGAGTCGGAGGCACTTGACGGAGGCGATTTTGAAGCTCGGCATGAAACTGAGCTCATTAATGGATGCTCTTCTGAGACGAAGAAGAAGAAGAAGAAGAGGAAGAAGCAGAATGTTGAGAATGAGGAAATCGAGGGGGAGAAGAGGAAACCGATTTCAAAACCTACAGTGAGCATGGCAGTATCCGGTTCCATCATTGATAACGCTCAATCCCTTGAGCTTGCCACCAGAGTACGCATTTTCTATTTTATGAGCGAGTGTTTTTCGAGTTCTTCCCCCTTTCCCATGTCCCGTGGAGGATGATATTTGTTATGCCTTCTCTTTGAATGTTTTGATTAGTTCTCTTTTCTCTATGCATCTCTACTACTGAAATGGAGTGATGTGATTAATTATGAAACAACTTATGGACTTCTTTCCAGTTGGCTGGTCAAATCGCTCGAGCGGCAACCATATTCCGAATTGACGAGGTCTCCATCTTTTTCATTTGTTTCATCCTCTTCTTTGCTTCAAATTTTCTAGAACAGTTATTGGCATCATCACTGGAATTGTTCAACTCAGGAAGATAGTTTTTGTATGTATCAGGTGGTGGTATTTGACAGTGGAAGGAGTTCAATGACTGGCTCGGATGTTGCGGCAGCTAACAATTCAGATGAGGATGAAAGTGGCGCTGCTTTTCTTATAAGGATCTTGAAGTATCTTGAGACTCCACAATACTTGAGAAAAGCTCTTTTCCCAAAGCACAACAATTTAAGATTTGTGGTCCGGCCTAAACATTCCTTGGTTGCTTTAATCAAAGAAGTAATATGTTCTGTTTTTCTTCTGACCTTTTTATTGTGAATTTGTTATAGGGGATGTTGCCCCCGCTTGATGCCCCTCACCATTTGCGCAAGCATGAATGGGGTCCCTATCGAGAAGGTAAATTTTGAAATCTCCTTATGGGTTTTCTATAAAATTTGCATCTGAATCTTTGCCCTGCATGTGAGGATATTGTTCCCTTTCTTGTTCCAAGCACACCTTTGAAACTATAGCTTTTAGTTGCATTAATCTCGAGGATCGTCGTCGTACTTTTGAACTTTGGGCTATACAACAATTCCTGGAGATTCATTTTTCTCAATGTATAAGCTTCTCTTTACCGCTTCTCAGAATCTATAAAGATATTGCTAGTGATTTTGTTCATCTTCTGTCTGATAATTTCTTTAATGCTGGAGTAACATAGTTTGGAGAAGTTTTCCATTCCACATTTGCAACTGTCGCTTGAAAAATTCAAGTTTCTTCTATTCATTTTGGAAGACTCTTGAAGATGAGTCTAGGACTGGGACTATATAATTACATGCATATCTATCTGGCTATATCTTTGGATGACTTCTGATTTATTATGTTGCCATCCTCGGCTACTGCCTTCTTTTTTTTTTTTTTCACTTTTCATTTTGGACTTCATTTTGCATGTTAAAATTAGGTGTTACCTTGAAAGAAAGAGCTCCAGATGCTAGAGGAACATTAGTTGATGTTGGTTTGAGTAAGGTAAAGCCAAATTTTCTCGAGTATATAGAATGGAAACACAATTTTTTTCCTACGTTGAGCTTGAGCTATGCTACCTGTTGGTTTTTAGATGAATTGGAAAGTGTACGCTAATTCAATTTCTCTTTGATTTTTTTGAAGTTAAATGCTTCCAACAACCATATTTTACTTAACTGGGTGTAGTTCGTTGGTTTAATAATTTGATTTTAAATTGTGCAGAACGTGGTAGTTGATGAAATACTTGAACCCGGAAGAAGAGTAACAGTTGCTATGGGAACAGATCGCAATTTACTTGCTGGTATTGTCTGATGCTAGTACATATTTGAATAAAATATACAAATGTACAAATTGACAGTTGTTTTAATCCCTGCTAGTCTATAGTAGATGCCTTGATTAGTATTGGCTCTATGCTTGTACTTTTAATGTAATGCTTCAAGCCCACCACTAGCAGATATTGTCCTCCCACCACTAGCAGATATTGTCCTCCCACCACTAGTAGATATTGTCCTCTTTGAGTTTTTCCTTTCGGGCTTTTCCTCAAGGTTTTTAAAATGTCTCTACTAGGGAGAGGTTTCCACACCTTTATAAAGGATGATTCGTTCTTCTCTCCAACCGATGTAGGATCTCACATTTAAAGTGTGCAACACTTATTGATGTCTACAAGTGTTATTTAAAAACTGGTAAAAAGAGAATATGGACACCAAAAATTAGAGTTTTGTAACTTAGCTGCAGTGAAAATACATTATTAGACTATATCTATTAATTGTTTTCCGAAAAACCAATAAACTAGTGACTGCTGGCTAAGCTCGTTCAGCTGTCATCTGCCACAATCCTAGTTCAGTCTCCACTTTCATGCGCACTTAATTGTCGTCATACATGTGTTAGTCATGCTAATGCAAGTTTGTCATAAATGGAAAATGAGTATTTGCTCATGGATTTCAAAATCCTAAACTTCATTTGTCCAGATCTACCCCGCCAGGTTGTCTCATCTTCAAAGCCTGTGGAAGAAGAACTTTACTGGGGTTATAGAGTGCGATATGCTTCTTCTTTGAGTGCAGTTTTTAAGGAGAGTTCATACGAGGTATATGCTTATAACTTTTCGTGAATCTTCTAAAAATTAATATCATTTTCTTCACATTTTATTGCTGTTTTGTTGCAAGGGTGGTTATGATCACTTGATTGGAACCTCAGAGCATGGAACAATAATTAAGTCCTCGGAACTAATACTACCTTCTTTCAGGTATTTTGCTTAAGTCCACCGCCATGTGACCTGGGTAGTTCGGTTTCAAACTGATACCTTTAGCACATGTTGCTCGTTGATTTCTTGTTACCCTTGTGCTAATTAATTGTGTTTAAATATAAAAATGAAATGGTTGTTGTTTTTCTTTTTGTTCCATTTTTCTCGTCATTTACAATGTCATATAACAGAAGTTAGCATCTTTAATGTCTTTGAACTATATTCAGTGGGTTGAACTCAAATCTTGAGAATTTTTACCATAGCTGTACGAATACCCCTTTCATCATAGCCACCAAACTTACAGAAGCATATGCTGGCTTTATTAACATTATAGTTTGTAATGGAAAAATGTGTCTAAGAGGGGCTTTTAAAGAACAGTATTAAAGGCATCTTGCCAATTAAATGTATGGCGAGCCTCTGTTTTATGCATTTTCAACAGACGATGTAAAAAGGTTCCATTTTTTTAAAGAACTTATATGCATTTTCTCACATGAACAATTCTCTGTCCGAGCCAGATTGTTTCAATTGACAAGCATCTTTTCTTTTCTTTTTTTTTTTTTTTTTTTTGTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTTTTTCTGCCTTTTGAGTTGATTAATTGTTAATCATTGGCAGACATCTTTTAATTGCCTTTGGTGGACTTGCCGGTTTGGAGGAGAGCATTGAGGAAGATAACAACTTCAAGGTGTTCATCTTAATAACTTCACGGACATAGTACCATAGGCTAATCTTTCAATGAAATGATAGTTCTTAAAAGCACTCATCTAGTAATCTTTAAAGCATTTGATAGTTATAAAATGAACTAAATAAAAATTATGTTTCTGTCTTTGGATTACTTAAGTAAATAGTTAATACTAACATGTCGTGACTGTTTCGATCGGTTAGAGTAAAAATGCTTGTGAGATCTTTTCTTCGTACTTGAATACATGTCCACTGCAAGGAAGCCGAACAATTCGAACAGAGGTACTCCACCATTATCATCTCCTCTATTTTCTTTTTCATTCCTGCATATTCTCTCATTTAAGCTTGCTTGGTCTTTAGGAAGCAATACTCATCTCTCTCCAGTACTTCCAAGAACCGATAAGCAGGGCATTGCAGATTTCAGCAGATTAGATTTGATTGTTAGAAGCTTCAAAATCCAAGATCCTGCCAATGTGCTCGCATGGAGCCGCTTTCCCACATGGCTTGAACTTGGGCAGCTGATTCAGTTGTGCTCACAAAAGGGTTTAAAGATACGTCCGTCCCTTCTATGTTGTAACTTTAATTGTCCTTCAAATTTGGACGCTTTAATTCATATATATGTTTTGTTTTGAACTTCTAGGCCTTCAGAATATGAACAATATGTGGGTTATTGAAATGGAACCCATTTTCATCTATTGTGTCTGTTACAGATCTCCAATCAGAAATGTCTCCATAACTCAAGAACAAATGAAAAAGAAGAAAGCATAGGGTGGCTATTGCCAAATTGCCGTCTGTCTGATAAATTTTAGAGTGAACTATGGGGAGTCTTTAGAATGAACCCATCTTCGCAGTCCAAAGTCTTCACATTTTTTATAAATGCTCTAGTCGTAGCTCTTAAGATTAAAGCTATGATACCAACTGTCACGGTTGTACTTTTTCAGCCGTGCAGCGTCGTGATCT

mRNA sequence

CGTAGCTTGGGAAAGTTGGAGACTGGTTTTCAGCTCAAGAAAATCCGGCATTCTGAACCCTGCTAAACCCCTACCACCGGCGGAGGTCTTAAAAGATGGGGAAGAAGAAGCATAAGCGACCGGAATCGGAGTCGGAGGCACTTGACGGAGGCGATTTTGAAGCTCGGCATGAAACTGAGCTCATTAATGGATGCTCTTCTGAGACGAAGAAGAAGAAGAAGAAGAGGAAGAAGCAGAATGTTGAGAATGAGGAAATCGAGGGGGAGAAGAGGAAACCGATTTCAAAACCTACAGTGAGCATGGCAGTATCCGGTTCCATCATTGATAACGCTCAATCCCTTGAGCTTGCCACCAGATTGGCTGGTCAAATCGCTCGAGCGGCAACCATATTCCGAATTGACGAGGAAGATAGTTTTTGTATGTATCAGGTGGTGGTATTTGACAGTGGAAGGAGTTCAATGACTGGCTCGGATGTTGCGGCAGCTAACAATTCAGATGAGGATGAAAGTGGCGCTGCTTTTCTTATAAGGATCTTGAAGTATCTTGAGACTCCACAATACTTGAGAAAAGCTCTTTTCCCAAAGCACAACAATTTAAGATTTGTGGGGATGTTGCCCCCGCTTGATGCCCCTCACCATTTGCGCAAGCATGAATGGGGTCCCTATCGAGAAGGTGTTACCTTGAAAGAAAGAGCTCCAGATGCTAGAGGAACATTAGTTGATGTTGGTTTGAGTAAGAACGTGGTAGTTGATGAAATACTTGAACCCGGAAGAAGAGTAACAGTTGCTATGGGAACAGATCGCAATTTACTTGCTGATCTACCCCGCCAGGTTGTCTCATCTTCAAAGCCTGTGGAAGAAGAACTTTACTGGGGTTATAGAGTGCGATATGCTTCTTCTTTGAGTGCAGTTTTTAAGGAGAGTTCATACGAGGGTGGTTATGATCACTTGATTGGAACCTCAGAGCATGGAACAATAATTAAGTCCTCGGAACTAATACTACCTTCTTTCAGACATCTTTTAATTGCCTTTGGTGGACTTGCCGGTTTGGAGGAGAGCATTGAGGAAGATAACAACTTCAAGAGTAAAAATGCTTGTGAGATCTTTTCTTCGTACTTGAATACATGTCCACTGCAAGGAAGCCGAACAATTCGAACAGAGGAAGCAATACTCATCTCTCTCCAGTACTTCCAAGAACCGATAAGCAGGGCATTGCAGATTTCAGCAGATTAGATTTGATTGTTAGAAGCTTCAAAATCCAAGATCCTGCCAATGTGCTCGCATGGAGCCGCTTTCCCACATGGCTTGAACTTGGGCAGCTGATTCAGTTGTGCTCACAAAAGGGTTTAAAGATACGTCCGTCCCTTCTATGTTGTAACTTTAATTGTCCTTCAAATTTGGACGCTTTAATTCATATATATGTTTTGTTTTGAACTTCTAGGCCTTCAGAATATGAACAATATGTGGGTTATTGAAATGGAACCCATTTTCATCTATTGTGTCTGTTACAGATCTCCAATCAGAAATGTCTCCATAACTCAAGAACAAATGAAAAAGAAGAAAGCATAGGGTGGCTATTGCCAAATTGCCGTCTGTCTGATAAATTTTAGAGTGAACTATGGGGAGTCTTTAGAATGAACCCATCTTCGCAGTCCAAAGTCTTCACATTTTTTATAAATGCTCTAGTCGTAGCTCTTAAGATTAAAGCTATGATACCAACTGTCACGGTTGTACTTTTTCAGCCGTGCAGCGTCGTGATCT

Coding sequence (CDS)

ATGGGGAAGAAGAAGCATAAGCGACCGGAATCGGAGTCGGAGGCACTTGACGGAGGCGATTTTGAAGCTCGGCATGAAACTGAGCTCATTAATGGATGCTCTTCTGAGACGAAGAAGAAGAAGAAGAAGAGGAAGAAGCAGAATGTTGAGAATGAGGAAATCGAGGGGGAGAAGAGGAAACCGATTTCAAAACCTACAGTGAGCATGGCAGTATCCGGTTCCATCATTGATAACGCTCAATCCCTTGAGCTTGCCACCAGATTGGCTGGTCAAATCGCTCGAGCGGCAACCATATTCCGAATTGACGAGGAAGATAGTTTTTGTATGTATCAGGTGGTGGTATTTGACAGTGGAAGGAGTTCAATGACTGGCTCGGATGTTGCGGCAGCTAACAATTCAGATGAGGATGAAAGTGGCGCTGCTTTTCTTATAAGGATCTTGAAGTATCTTGAGACTCCACAATACTTGAGAAAAGCTCTTTTCCCAAAGCACAACAATTTAAGATTTGTGGGGATGTTGCCCCCGCTTGATGCCCCTCACCATTTGCGCAAGCATGAATGGGGTCCCTATCGAGAAGGTGTTACCTTGAAAGAAAGAGCTCCAGATGCTAGAGGAACATTAGTTGATGTTGGTTTGAGTAAGAACGTGGTAGTTGATGAAATACTTGAACCCGGAAGAAGAGTAACAGTTGCTATGGGAACAGATCGCAATTTACTTGCTGATCTACCCCGCCAGGTTGTCTCATCTTCAAAGCCTGTGGAAGAAGAACTTTACTGGGGTTATAGAGTGCGATATGCTTCTTCTTTGAGTGCAGTTTTTAAGGAGAGTTCATACGAGGGTGGTTATGATCACTTGATTGGAACCTCAGAGCATGGAACAATAATTAAGTCCTCGGAACTAATACTACCTTCTTTCAGACATCTTTTAATTGCCTTTGGTGGACTTGCCGGTTTGGAGGAGAGCATTGAGGAAGATAACAACTTCAAGAGTAAAAATGCTTGTGAGATCTTTTCTTCGTACTTGAATACATGTCCACTGCAAGGAAGCCGAACAATTCGAACAGAGGAAGCAATACTCATCTCTCTCCAGTACTTCCAAGAACCGATAAGCAGGGCATTGCAGATTTCAGCAGATTAG

Protein sequence

MGKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRKPISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRSSMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPHHLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLADLPRQVVSSSKPVEEELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSELILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAILISLQYFQEPISRALQISAD
BLAST of Cp4.1LG03g10910 vs. Swiss-Prot
Match: CI114_MOUSE (Putative methyltransferase C9orf114 homolog OS=Mus musculus GN=D2Wsu81e PE=1 SV=1)

HSP 1 Score: 256.9 bits (655), Expect = 3.4e-67
Identity = 156/343 (45.48%), Postives = 207/343 (60.35%), Query Frame = 1

Query: 36  ETKKKKKKRKKQNVENEEIEGEKRKPISKPTVSMAVSGSIIDNAQSLELATRLAGQIARA 95
           E ++ +++  K+  E EE   ++       T+S+A+ GSI+DNAQS EL T LAGQIARA
Sbjct: 45  ERQRAQEEEAKRQEEEEEAAAQRSNQGRPYTLSVALPGSILDNAQSPELRTYLAGQIARA 104

Query: 96  ATIFRIDEEDSFCMYQVVVFDSGRSSMTGSDVAAANNSDEDESGAAFLIRILKYLETPQY 155
            TIF +DE        +VVFD      T S         +       L RIL+YLE PQY
Sbjct: 105 CTIFCVDE--------IVVFDE-EGQDTKSVEGEFRGVGKKGQACVQLARILQYLECPQY 164

Query: 156 LRKALFPKHNNLRFVGMLPPLDAPHHLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKN 215
           LRKA FPKH +L+F G+L PLD+PHH+R+ E   +REGV +        G+LV+ G+ K 
Sbjct: 165 LRKAFFPKHQDLQFAGILNPLDSPHHMRQDEESEFREGVVVDRPTKAGHGSLVNCGMKKE 224

Query: 216 VVVDEILEPGRRVTVAMGTDRNLLADLPR------QVVSSSKP-VEEELYWGYRVRYASS 275
           V +D+ L+PG RVTV +   +     LP        VVSS  P  +  LYWGY VR AS 
Sbjct: 225 VKIDKKLDPGLRVTVRLNQQQ-----LPECKTYKGTVVSSQDPRTKAGLYWGYTVRLASC 284

Query: 276 LSAVFKESSYEGGYDHLIGTSEHGTIIKSSELILPSFRHLLIAFGGLAGLEESIEEDNNF 335
           LSAVF E+ ++ GYD  IGTSE G+ + S++  LPSFRH L+ FGGL GLE +++ D N 
Sbjct: 285 LSAVFAEAPFQDGYDLTIGTSERGSDVASAQ--LPSFRHALVVFGGLQGLEAAVDADPNL 344

Query: 336 KSKNACEIFSSYLNTCPLQGSRTIRTEEAILISLQYFQEPISR 372
           +  +   +F  Y+NTC  QGSRTIRTEEAILISL   Q  +++
Sbjct: 345 EVADPSVLFDFYVNTCLSQGSRTIRTEEAILISLAALQPGLTQ 371

BLAST of Cp4.1LG03g10910 vs. Swiss-Prot
Match: CI114_HUMAN (Putative methyltransferase C9orf114 OS=Homo sapiens GN=C9orf114 PE=1 SV=3)

HSP 1 Score: 251.9 bits (642), Expect = 1.1e-65
Identity = 160/379 (42.22%), Postives = 219/379 (57.78%), Query Frame = 1

Query: 2   GKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQN-----VENEEIEG 61
           G+K+   P    + ++   ++ + + E       +  KK ++++ Q      +E EE   
Sbjct: 5   GRKRPCGPGEHGQRIEWRKWKQQKKEEKKKWKDLKLMKKLERQRAQEEQAKRLEEEEAAA 64

Query: 62  EKRKPISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFD 121
           EK       T+S+A+ GSI+DNAQS EL T LAGQIARA  IF +DE        +VVFD
Sbjct: 65  EKEDRGRPYTLSVALPGSILDNAQSPELRTYLAGQIARACAIFCVDE--------IVVFD 124

Query: 122 S-GRSSMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPP 181
             G+ + T           +       L RIL+YLE PQYLRKA FPKH +L+F G+L P
Sbjct: 125 EEGQDAKTVE--GEFTGVGKKGQACVQLARILQYLECPQYLRKAFFPKHQDLQFAGLLNP 184

Query: 182 LDAPHHLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTD 241
           LD+PHH+R+ E   +REG+ +        G+ V+ G+ K V +D+ LEPG RVTV +   
Sbjct: 185 LDSPHHMRQDEESEFREGIVVDRPTRPGHGSFVNCGMKKEVKIDKNLEPGLRVTVRLNQQ 244

Query: 242 RNL-LADLPRQVVSSSKP-VEEELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGT 301
           ++        +VVSS  P  +  LYWGY VR AS LSAVF E+ ++ GYD  IGTSE G+
Sbjct: 245 QHPDCKTYHGKVVSSQDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDGYDLTIGTSERGS 304

Query: 302 IIKSSELILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIR 361
            + S++L  P+FRH L+ FGGL GLE   + D N +      +F  Y+NTCP QGSRTIR
Sbjct: 305 DVASAQL--PNFRHALVVFGGLQGLEAGADADPNLEVAEPSVLFDLYVNTCPGQGSRTIR 364

Query: 362 TEEAILISLQYFQEPISRA 373
           TEEAILISL   Q  + +A
Sbjct: 365 TEEAILISLAALQPGLIQA 371

BLAST of Cp4.1LG03g10910 vs. Swiss-Prot
Match: YMP6_CAEEL (Putative methyltransferase B0361.6 OS=Caenorhabditis elegans GN=B0361.6 PE=3 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 1.1e-41
Identity = 127/347 (36.60%), Postives = 185/347 (53.31%), Query Frame = 1

Query: 39  KKKKKRKKQNVENEEIEGEKRKPISKP------TVSMAVSGSIIDNAQSLELATRLAGQI 98
           K +KKRK +    +E E  K+  I K       T+S+AV G  ++NAQS EL T +AGQI
Sbjct: 34  KDEKKRKNEEKIIKEAEEAKKAKIEKVDHTPPFTISIAVPGQFLNNAQSAELRTYMAGQI 93

Query: 99  ARAATIFRIDE----EDSFCMYQVVVFDSGRSSMTGSDVAAANNSDEDESGAAFLIRILK 158
           ARAAT++R+DE    ++S  M    V      +  G+ + A  N +    G  +L +IL+
Sbjct: 94  ARAATLYRVDEIIIYDESCRMTDEAVNAYYNGTWQGNLIPAETNYE----GCFYLAKILE 153

Query: 159 YLETPQYLRKALFPKHNNLRFVGMLPPLDAPHHLRKHEWG-PYREGVTLKERAPDARGTL 218
           YLE PQYLRK LFP    L+  G+L PLDA HHL+  E    +REGV LK+R+ D RG +
Sbjct: 154 YLECPQYLRKDLFPIQKPLKNAGLLNPLDAQHHLKYDEKTLRFREGVVLKKRSKDGRGPI 213

Query: 219 VDVGLSKNVVVDE---ILEPGRRVTVAMGTDRNLL--ADLPRQVVSSSKPVEEE--LYWG 278
            ++GL K   +D     L P  RVTV +   +NL     L R  ++S   V  E  LYWG
Sbjct: 214 CNIGLEKEFEIDSDAVQLPPYTRVTVEI---KNLTEQCKLYRGSITSGATVTRETGLYWG 273

Query: 279 YRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSELILPSFRHLLIAFGGLAGLEE 338
           Y VR  + L  V +       +D + G S  G +    ++ + +   +L+ FGG+AG++ 
Sbjct: 274 YSVRLMTGLQKVLQAKK----FDIVAGVSPRGKLASQMDVCILNKPKILLVFGGVAGVDA 333

Query: 339 SIEEDNNFKSKNACEIFSSYLNTCPL-QGSRTIRTEEAILISLQYFQ 367
           ++E +   + + A + F   + T  L  GSR+ R EE +L  L   Q
Sbjct: 334 AVESEELAEWRRAEDAFDVLIRTTSLSNGSRSERVEENVLSVLAQVQ 369

BLAST of Cp4.1LG03g10910 vs. Swiss-Prot
Match: Y1688_HALSA (Uncharacterized protein VNG_1688C OS=Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) GN=VNG_1688C PE=4 SV=2)

HSP 1 Score: 84.7 bits (208), Expect = 2.3e-15
Identity = 90/323 (27.86%), Postives = 138/323 (42.72%), Query Frame = 1

Query: 66  TVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRSSMTGS 125
           T S+ V  S++  A+    ATR  G +ARAA +FRID        +VVVF          
Sbjct: 2   TRSVLVPSSLVREAEDKREATRKLGYVARAAAVFRID--------RVVVFP--------- 61

Query: 126 DVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPHHLRKH 185
                +   E + G  F+  +L+Y  TP YLRK  F   + L + G+LPP      LR  
Sbjct: 62  -----DEDGERQWGGGFVETVLRYAATPPYLRKEAFDTRDELAYAGVLPP------LRLS 121

Query: 186 EW--------GPYREGVTLKERAPDARGTLVDVGLSKNVVVDE----ILEPGRRVTVAMG 245
            W        G  R+G+ + +   + R   V+ G+   + + E     +  G RVT+ + 
Sbjct: 122 SWTGSDSSGSGSLRQGI-VTQVGSEGR-VRVNCGMQHPISLHEPPGMAVSEGERVTIRVS 181

Query: 246 TDRNLLADLPRQVVSSSKPVEEELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGT 305
           + R + A L          V++ L  G+ V       A+ +  +        I TS HG 
Sbjct: 182 SRRPVRAKL----------VDDPLP-GFSVERTGLGDALDRSDA-----GVRIATSRHGE 241

Query: 306 IIKSSELILPSFRH------LLIAFGG--------LAGLEESIEEDNNFKSKNACEIFSS 363
            +  S   L  +R       + +AFG         L    +++ E     S +A   F +
Sbjct: 242 PL--SVASLGGYRERIARDGVTVAFGAPERGLPPMLGVSADAVNESVTDSSADAPARFDA 276

BLAST of Cp4.1LG03g10910 vs. Swiss-Prot
Match: Y1612_HALMA (Uncharacterized protein rrnAC1612 OS=Haloarcula marismortui (strain ATCC 43049 / DSM 3752 / JCM 8966 / VKM B-1809) GN=rrnAC1612 PE=4 SV=2)

HSP 1 Score: 78.2 bits (191), Expect = 2.2e-13
Identity = 90/320 (28.12%), Postives = 140/320 (43.75%), Query Frame = 1

Query: 66  TVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRSSMTGS 125
           T S+ V  S+   A+    ATR  G +ARAA ++R+D        ++ V+          
Sbjct: 2   TTSVLVPSSLAREAEDRREATRKLGYVARAAAVYRVD--------RLTVYP--------- 61

Query: 126 DVAAANNSDEDESGA---AFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPHHL 185
                   D D +G     F+  +L+Y  TP +LRK ++ K + L +VG+LPPL      
Sbjct: 62  --------DPDGAGKWEDGFVETVLRYAATPPHLRKEMWGKRDELEYVGVLPPLRVRSQT 121

Query: 186 --RKHEWGPYREGVTLKERAPDARGTLVDVG----LSKNVVVDEILEPGRRVTVAMGTDR 245
                  G  R+G+ + E   D R   V+ G    +S  V  D  +E G RVTV      
Sbjct: 122 GSGSEGSGSLRQGI-VTEVGADGR-VRVNCGMQHPISLPVPADMDVEQGERVTVR----- 181

Query: 246 NLLADLPRQVVSSSKPVEEELY----WGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHG 305
                     VSS +PV  +L      G+ V  A   +A+ ++ +        I +S +G
Sbjct: 182 ----------VSSRRPVRAKLVDAPTTGFDVVAADLDAALSRDDA-----GLTIASSRYG 241

Query: 306 TIIKSSELILPSFRH-----LLIAFG----GLAG-LEESIEEDNNFKSKNACEIFSSYLN 363
             + S+ L   + R      + +AFG    GL   L+ + +     ++ +  E F  +LN
Sbjct: 242 EPVTSTRLGQLAERRDAEGGMTVAFGAPERGLPSILDVAPDAVGGDQTSDEPEGFDLWLN 274

BLAST of Cp4.1LG03g10910 vs. TrEMBL
Match: A0A0A0LAE4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G149370 PE=4 SV=1)

HSP 1 Score: 641.0 bits (1652), Expect = 9.4e-181
Identity = 336/377 (89.12%), Postives = 347/377 (92.04%), Query Frame = 1

Query: 1   MGKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRK 60
           MGKKKHKRPE ESEALD  DFEA HETELINGCS E KKKKKK+K   VEN  IE EKRK
Sbjct: 1   MGKKKHKRPEQESEALDRDDFEANHETELINGCSPEKKKKKKKKK---VENGSIEAEKRK 60

Query: 61  PISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRS 120
           PISKPTVS+AVSGSIIDNAQSLELATRLAGQIARAATIFRI+E        VVVFDSGRS
Sbjct: 61  PISKPTVSIAVSGSIIDNAQSLELATRLAGQIARAATIFRINE--------VVVFDSGRS 120

Query: 121 SMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180
           S TGS+VAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH
Sbjct: 121 STTGSEVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180

Query: 181 HLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLA 240
           HLRKHEWGPYREGVTLKERAPDA+GT VDVGLSKNVVVDEILEPG RVTVAMGTDRNL +
Sbjct: 181 HLRKHEWGPYREGVTLKERAPDAKGTSVDVGLSKNVVVDEILEPGTRVTVAMGTDRNLFS 240

Query: 241 DLPRQVVSSSKPVEEELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSEL 300
           DLPRQVVSSSKPVEE LYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHG +IKSSEL
Sbjct: 241 DLPRQVVSSSKPVEEGLYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGMVIKSSEL 300

Query: 301 ILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAILI 360
            LP FRHLLIAFGGLAGLEESIEEDNNFKSKNA EIFSSYLNTCPLQGSRTIRTEEAI I
Sbjct: 301 TLPPFRHLLIAFGGLAGLEESIEEDNNFKSKNAHEIFSSYLNTCPLQGSRTIRTEEAIFI 360

Query: 361 SLQYFQEPISRALQISA 378
           SLQYFQEPI++A+QI+A
Sbjct: 361 SLQYFQEPINKAMQIAA 366

BLAST of Cp4.1LG03g10910 vs. TrEMBL
Match: A5ASC7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_012192 PE=4 SV=1)

HSP 1 Score: 485.3 bits (1248), Expect = 6.6e-134
Identity = 264/375 (70.40%), Postives = 301/375 (80.27%), Query Frame = 1

Query: 1   MGKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRK 60
           MGKKK +R + E+E  +     A +ETELING  S +KKKK K  K   +  +I      
Sbjct: 1   MGKKKKRRSDFEAETENN---TAENETELING-DSRSKKKKNKTHKDKYQATDI------ 60

Query: 61  PISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRS 120
               PT+++AV GSII NAQSLELATRLAGQ+ARAATIFRIDE        VVVFD   +
Sbjct: 61  ----PTLTIAVPGSIIHNAQSLELATRLAGQVARAATIFRIDE--------VVVFDCKST 120

Query: 121 SMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180
           S   S VA  + SDE+E+GAAFLIRIL+YLETPQYLRK LFPKHN+L+FVGMLPPLDAPH
Sbjct: 121 SGDNSTVATPDASDENETGAAFLIRILRYLETPQYLRKILFPKHNSLKFVGMLPPLDAPH 180

Query: 181 HLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLA 240
           HLRKHEWGPYREGVTLKERAP + GTLVDVGL+KNVV+D++LEPG RVTVAMGT+RNL A
Sbjct: 181 HLRKHEWGPYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDA 240

Query: 241 DLPRQVVSSSKPVEE-ELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSE 300
           D   QVVSSSKP EE   YWGY+VRYAS++S+VFKE  ++GGYDHLIGTSEHG I+KSSE
Sbjct: 241 DFVHQVVSSSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSE 300

Query: 301 LILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAIL 360
           L +PSFRHLLIAFGGLAGLEES+EED++ K KN  EIF SYLNTCP QGSRTIRTEEAIL
Sbjct: 301 LDIPSFRHLLIAFGGLAGLEESVEEDHSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAIL 353

Query: 361 ISLQYFQEPISRALQ 375
           ISLQYFQEPI+RALQ
Sbjct: 361 ISLQYFQEPINRALQ 353

BLAST of Cp4.1LG03g10910 vs. TrEMBL
Match: D7T6U1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g02950 PE=4 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 1.1e-133
Identity = 264/375 (70.40%), Postives = 300/375 (80.00%), Query Frame = 1

Query: 1   MGKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRK 60
           MGKKK +R + E+E  +     A +ETELING  S +KKKK K  K   +  +I      
Sbjct: 1   MGKKKKRRSDFEAETENN---TAENETELING-DSRSKKKKNKTHKDKYQATDI------ 60

Query: 61  PISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRS 120
               PT+++AV GSII NAQSLELATRLAGQIARAATIFRIDE        VVVFD   +
Sbjct: 61  ----PTLTIAVPGSIIHNAQSLELATRLAGQIARAATIFRIDE--------VVVFDCKST 120

Query: 121 SMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180
           S   S VA  + SDE+E+G AFLIRIL+YLETPQYLRK LFPKHN+L+FVGMLPP+DAPH
Sbjct: 121 SGDDSTVATPDASDENETGPAFLIRILRYLETPQYLRKTLFPKHNSLKFVGMLPPVDAPH 180

Query: 181 HLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLA 240
           HLRKHEWGPYREGVTLKERAP + GTLVDVGL+KNVV+D++LEPG RVTVAMGT+RNL A
Sbjct: 181 HLRKHEWGPYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDA 240

Query: 241 DLPRQVVSSSKPVEE-ELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSE 300
           D   QVVSSSKP EE   YWGY+VRYAS++S+VFKE  ++GGYDHLIGTSEHG I+KSSE
Sbjct: 241 DFVHQVVSSSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSE 300

Query: 301 LILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAIL 360
           L +PSFRHLLIAFGGLAGLEES+EEDN+ K KN  EIF SYLNTCP QGSRTIRTEEAIL
Sbjct: 301 LDIPSFRHLLIAFGGLAGLEESVEEDNSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAIL 353

Query: 361 ISLQYFQEPISRALQ 375
           ISLQYFQEPI+RALQ
Sbjct: 361 ISLQYFQEPINRALQ 353

BLAST of Cp4.1LG03g10910 vs. TrEMBL
Match: B9RTY3_RICCO (Protein C9orf114, putative OS=Ricinus communis GN=RCOM_0913800 PE=4 SV=1)

HSP 1 Score: 474.9 bits (1221), Expect = 8.9e-131
Identity = 260/380 (68.42%), Postives = 305/380 (80.26%), Query Frame = 1

Query: 1   MGKKKHKRPESESEALDGGDFEARHETE---LINGCSSETKKKKKKRKKQNVENEEIEGE 60
           MGKKK +   +E+EA    +    HE     ++NG S   KKKKKK K++N   +E E E
Sbjct: 1   MGKKKKR---AEAEAQTETETVENHEPVNDVVVNGDSDRKKKKKKKEKERNERKKE-ENE 60

Query: 61  KRKPISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDS 120
            ++     T+S+AV GSIIDNAQSLELATRLAGQIARAATIFRIDE        VVVFD+
Sbjct: 61  SKETA---TISIAVPGSIIDNAQSLELATRLAGQIARAATIFRIDE--------VVVFDN 120

Query: 121 GRSSMTG--SDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPP 180
             SS+    + +   NNSDE+ESGAAFLIRIL+YLETPQYLRKALFP+ N+LRFVG+LPP
Sbjct: 121 ESSSVKEDRTTMITGNNSDENESGAAFLIRILRYLETPQYLRKALFPRLNSLRFVGLLPP 180

Query: 181 LDAPHHLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTD 240
           LDAPHHLRKHEW P+REGVTLKE+AP++ GTLVDVGLSKNVV+D+++EPG RVTV MGTD
Sbjct: 181 LDAPHHLRKHEWAPFREGVTLKEKAPNSIGTLVDVGLSKNVVIDQVVEPGIRVTVEMGTD 240

Query: 241 RNLLADLPRQVVSSSKPVEEE-LYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTI 300
           RNL ++LPRQVVS SKP EE  +YWGYRVRYAS++S VF +  Y+GGYDHL+GTSEHG I
Sbjct: 241 RNLDSELPRQVVSLSKPREEAGMYWGYRVRYASNISTVFNDCPYKGGYDHLVGTSEHGQI 300

Query: 301 IKSSELILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRT 360
           I +S+L LP+FRHLLIAFGGLAGLEESIEEDN+ K KN  E+F+SYLNTCP QGSRTIRT
Sbjct: 301 INASKLSLPTFRHLLIAFGGLAGLEESIEEDNSLKGKNVREVFNSYLNTCPHQGSRTIRT 360

Query: 361 EEAILISLQYFQEPISRALQ 375
           EEAI ISLQYFQEPI+RALQ
Sbjct: 361 EEAIFISLQYFQEPINRALQ 365

BLAST of Cp4.1LG03g10910 vs. TrEMBL
Match: A0A067G5A9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g017050mg PE=4 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 1.5e-130
Identity = 258/380 (67.89%), Postives = 300/380 (78.95%), Query Frame = 1

Query: 1   MGKKKHK---RPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGE 60
           MG KK +    PE + EA  G + E+++E  L NG SS    KKKK++K++  N++    
Sbjct: 1   MGNKKKRGGLEPELK-EAATGENHESQNELSLANGDSSSCDNKKKKKRKRDQLNDDA--- 60

Query: 61  KRKPISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDS 120
              PI  PTVS+AV GSIIDN QSLELATRLAGQIARA TIFRIDE        VVVFD+
Sbjct: 61  ---PIEVPTVSVAVPGSIIDNTQSLELATRLAGQIARAVTIFRIDE--------VVVFDN 120

Query: 121 GRSSMTGSDVAAANNS---DEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLP 180
             SS   S  +AAN S   DE+ESGAAFL+R+L+YLETPQYLRKALF  H++LRFVGMLP
Sbjct: 121 KSSSDNYSRSSAANRSNRSDENESGAAFLVRLLQYLETPQYLRKALFSMHSSLRFVGMLP 180

Query: 181 PLDAPHHLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGT 240
           PLDAPHHLRKHEW P+REGVTLKE AP++ GTLVDVGL+K+VVVD++L+PG RVTVAMGT
Sbjct: 181 PLDAPHHLRKHEWAPFREGVTLKENAPNSVGTLVDVGLNKHVVVDQVLDPGVRVTVAMGT 240

Query: 241 DRNLLADLPRQVVSSSKPVEEELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTI 300
           +RNL AD PRQVV  SKP E  +YWGY+VRYA ++S+VFK  SY+GGYDHLIGTSEHG I
Sbjct: 241 NRNLDADSPRQVVPPSKPKESGMYWGYKVRYAPNISSVFKNCSYKGGYDHLIGTSEHGDI 300

Query: 301 IKSSELILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRT 360
           + SS+L LP+FRHLLIAFGGLAGLEESIEED+  K KNA E+F SY NTCP QGSRTIRT
Sbjct: 301 VNSSDLTLPTFRHLLIAFGGLAGLEESIEEDDGLKRKNAREVFHSYFNTCPHQGSRTIRT 360

Query: 361 EEAILISLQYFQEPISRALQ 375
           EEAI ISLQYFQEPISRAL+
Sbjct: 361 EEAIFISLQYFQEPISRALR 365

BLAST of Cp4.1LG03g10910 vs. TAIR10
Match: AT5G19300.1 (AT5G19300.1 Nucleic acid-binding, OB-fold-like (InterPro:IPR016027), Protein of unknown function DUF171 (InterPro:IPR003750))

HSP 1 Score: 440.7 bits (1132), Expect = 9.4e-124
Identity = 246/383 (64.23%), Postives = 291/383 (75.98%), Query Frame = 1

Query: 3   KKKHKRPESESEALDGGDFEARHETEL-INGCSSETKKKKKKRKKQNVENEEIEGEKRK- 62
           KKK+K  +  S      D E   E ++ ++G S E K KKK++ K   E  E+  EK K 
Sbjct: 30  KKKNKNKKKRSHE----DTEIEPEQKMSLDGDSKEEKIKKKRKNKNQEEEPELVTEKTKV 89

Query: 63  --------PISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQV 122
                      + TVS+A++GSII N QSLELATRLAGQIARAATIFRIDE        +
Sbjct: 90  QEEEKGNVEEGRATVSIAIAGSIIHNTQSLELATRLAGQIARAATIFRIDE--------I 149

Query: 123 VVFDSGRSSMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGM 182
           VVFD+  SS   S  AA N SD +ESGA+FL+RILKYLETPQYLRK+LFPK N+LR+VGM
Sbjct: 150 VVFDNKSSSEIES--AATNASDSNESGASFLVRILKYLETPQYLRKSLFPKQNDLRYVGM 209

Query: 183 LPPLDAPHHLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAM 242
           LPPLDAPHHLRKHEW  YREGVTL E+AP++ GTLVDVGLSK+VVVD++L PG RVTVAM
Sbjct: 210 LPPLDAPHHLRKHEWEQYREGVTLSEKAPNSEGTLVDVGLSKSVVVDQVLGPGIRVTVAM 269

Query: 243 GTDRNLLADLPRQVVSSSKPVEEE-LYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEH 302
           GTD +L  DL RQ+V  SKP EE  +YWGY+VRYAS LS+VFKE  +EGGYD+LIGTSEH
Sbjct: 270 GTDHDL--DLVRQIVPPSKPREEAGMYWGYKVRYASQLSSVFKECPFEGGYDYLIGTSEH 329

Query: 303 GTIIKSSELILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRT 362
           G +I SSEL +P+FRHLLIAFGGLAGLEESIE+DN +K KN  ++F+ YLNTCP QGSRT
Sbjct: 330 GLVISSSELKIPTFRHLLIAFGGLAGLEESIEDDNQYKGKNVRDVFNVYLNTCPHQGSRT 389

Query: 363 IRTEEAILISLQYFQEPISRALQ 375
           IR EEA+ ISLQYFQEPISRA++
Sbjct: 390 IRAEEAMFISLQYFQEPISRAVR 396

BLAST of Cp4.1LG03g10910 vs. NCBI nr
Match: gi|778678539|ref|XP_004134144.2| (PREDICTED: putative methyltransferase C9orf114 [Cucumis sativus])

HSP 1 Score: 641.0 bits (1652), Expect = 1.3e-180
Identity = 336/377 (89.12%), Postives = 347/377 (92.04%), Query Frame = 1

Query: 1   MGKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRK 60
           MGKKKHKRPE ESEALD  DFEA HETELINGCS E KKKKKK+K   VEN  IE EKRK
Sbjct: 1   MGKKKHKRPEQESEALDRDDFEANHETELINGCSPEKKKKKKKKK---VENGSIEAEKRK 60

Query: 61  PISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRS 120
           PISKPTVS+AVSGSIIDNAQSLELATRLAGQIARAATIFRI+E        VVVFDSGRS
Sbjct: 61  PISKPTVSIAVSGSIIDNAQSLELATRLAGQIARAATIFRINE--------VVVFDSGRS 120

Query: 121 SMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180
           S TGS+VAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH
Sbjct: 121 STTGSEVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180

Query: 181 HLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLA 240
           HLRKHEWGPYREGVTLKERAPDA+GT VDVGLSKNVVVDEILEPG RVTVAMGTDRNL +
Sbjct: 181 HLRKHEWGPYREGVTLKERAPDAKGTSVDVGLSKNVVVDEILEPGTRVTVAMGTDRNLFS 240

Query: 241 DLPRQVVSSSKPVEEELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSEL 300
           DLPRQVVSSSKPVEE LYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHG +IKSSEL
Sbjct: 241 DLPRQVVSSSKPVEEGLYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGMVIKSSEL 300

Query: 301 ILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAILI 360
            LP FRHLLIAFGGLAGLEESIEEDNNFKSKNA EIFSSYLNTCPLQGSRTIRTEEAI I
Sbjct: 301 TLPPFRHLLIAFGGLAGLEESIEEDNNFKSKNAHEIFSSYLNTCPLQGSRTIRTEEAIFI 360

Query: 361 SLQYFQEPISRALQISA 378
           SLQYFQEPI++A+QI+A
Sbjct: 361 SLQYFQEPINKAMQIAA 366

BLAST of Cp4.1LG03g10910 vs. NCBI nr
Match: gi|659076473|ref|XP_008438700.1| (PREDICTED: uncharacterized protein C9orf114 [Cucumis melo])

HSP 1 Score: 639.0 bits (1647), Expect = 5.1e-180
Identity = 335/377 (88.86%), Postives = 346/377 (91.78%), Query Frame = 1

Query: 1   MGKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRK 60
           MGKKKHKRPE ESEALD  DFEA  ETELINGCS E KKKKKK+K   VEN  IE EKRK
Sbjct: 1   MGKKKHKRPEPESEALDRDDFEANRETELINGCSPEKKKKKKKKK---VENGSIEAEKRK 60

Query: 61  PISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRS 120
           PISKPTVS+AVSGSIIDNAQSLELATRLAGQIARAATIFRIDE        VVVFDSGRS
Sbjct: 61  PISKPTVSIAVSGSIIDNAQSLELATRLAGQIARAATIFRIDE--------VVVFDSGRS 120

Query: 121 SMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180
           S TGS++AAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH
Sbjct: 121 STTGSEIAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180

Query: 181 HLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLA 240
           HLRKHEWGPYREGVTLKERAPDA+GT VDVGLSKNVVVDEILEPG RVTVAMGTDRNL +
Sbjct: 181 HLRKHEWGPYREGVTLKERAPDAKGTSVDVGLSKNVVVDEILEPGTRVTVAMGTDRNLFS 240

Query: 241 DLPRQVVSSSKPVEEELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSEL 300
           DLPRQVVSSSKPVEE LYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHG +IKSSEL
Sbjct: 241 DLPRQVVSSSKPVEEGLYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGMVIKSSEL 300

Query: 301 ILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAILI 360
            LP FRHLLIAFGGLAGLEESIEEDNNFKSKNA EIFSSYLNTCPLQGSRTIRTEEAI I
Sbjct: 301 TLPCFRHLLIAFGGLAGLEESIEEDNNFKSKNAHEIFSSYLNTCPLQGSRTIRTEEAIFI 360

Query: 361 SLQYFQEPISRALQISA 378
           SLQYFQEPI++A+QI+A
Sbjct: 361 SLQYFQEPINKAIQIAA 366

BLAST of Cp4.1LG03g10910 vs. NCBI nr
Match: gi|147790065|emb|CAN75987.1| (hypothetical protein VITISV_012192 [Vitis vinifera])

HSP 1 Score: 485.3 bits (1248), Expect = 9.4e-134
Identity = 264/375 (70.40%), Postives = 301/375 (80.27%), Query Frame = 1

Query: 1   MGKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRK 60
           MGKKK +R + E+E  +     A +ETELING  S +KKKK K  K   +  +I      
Sbjct: 1   MGKKKKRRSDFEAETENN---TAENETELING-DSRSKKKKNKTHKDKYQATDI------ 60

Query: 61  PISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRS 120
               PT+++AV GSII NAQSLELATRLAGQ+ARAATIFRIDE        VVVFD   +
Sbjct: 61  ----PTLTIAVPGSIIHNAQSLELATRLAGQVARAATIFRIDE--------VVVFDCKST 120

Query: 121 SMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180
           S   S VA  + SDE+E+GAAFLIRIL+YLETPQYLRK LFPKHN+L+FVGMLPPLDAPH
Sbjct: 121 SGDNSTVATPDASDENETGAAFLIRILRYLETPQYLRKILFPKHNSLKFVGMLPPLDAPH 180

Query: 181 HLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLA 240
           HLRKHEWGPYREGVTLKERAP + GTLVDVGL+KNVV+D++LEPG RVTVAMGT+RNL A
Sbjct: 181 HLRKHEWGPYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDA 240

Query: 241 DLPRQVVSSSKPVEE-ELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSE 300
           D   QVVSSSKP EE   YWGY+VRYAS++S+VFKE  ++GGYDHLIGTSEHG I+KSSE
Sbjct: 241 DFVHQVVSSSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSE 300

Query: 301 LILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAIL 360
           L +PSFRHLLIAFGGLAGLEES+EED++ K KN  EIF SYLNTCP QGSRTIRTEEAIL
Sbjct: 301 LDIPSFRHLLIAFGGLAGLEESVEEDHSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAIL 353

Query: 361 ISLQYFQEPISRALQ 375
           ISLQYFQEPI+RALQ
Sbjct: 361 ISLQYFQEPINRALQ 353

BLAST of Cp4.1LG03g10910 vs. NCBI nr
Match: gi|225432580|ref|XP_002277845.1| (PREDICTED: putative methyltransferase C9orf114 [Vitis vinifera])

HSP 1 Score: 484.6 bits (1246), Expect = 1.6e-133
Identity = 264/375 (70.40%), Postives = 300/375 (80.00%), Query Frame = 1

Query: 1   MGKKKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRK 60
           MGKKK +R + E+E  +     A +ETELING  S +KKKK K  K   +  +I      
Sbjct: 1   MGKKKKRRSDFEAETENN---TAENETELING-DSRSKKKKNKTHKDKYQATDI------ 60

Query: 61  PISKPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRS 120
               PT+++AV GSII NAQSLELATRLAGQIARAATIFRIDE        VVVFD   +
Sbjct: 61  ----PTLTIAVPGSIIHNAQSLELATRLAGQIARAATIFRIDE--------VVVFDCKST 120

Query: 121 SMTGSDVAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPH 180
           S   S VA  + SDE+E+G AFLIRIL+YLETPQYLRK LFPKHN+L+FVGMLPP+DAPH
Sbjct: 121 SGDDSTVATPDASDENETGPAFLIRILRYLETPQYLRKTLFPKHNSLKFVGMLPPVDAPH 180

Query: 181 HLRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLA 240
           HLRKHEWGPYREGVTLKERAP + GTLVDVGL+KNVV+D++LEPG RVTVAMGT+RNL A
Sbjct: 181 HLRKHEWGPYREGVTLKERAPSSVGTLVDVGLNKNVVIDQVLEPGIRVTVAMGTNRNLDA 240

Query: 241 DLPRQVVSSSKPVEE-ELYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSE 300
           D   QVVSSSKP EE   YWGY+VRYAS++S+VFKE  ++GGYDHLIGTSEHG I+KSSE
Sbjct: 241 DFVHQVVSSSKPREEVGTYWGYKVRYASNISSVFKECPFKGGYDHLIGTSEHGLIVKSSE 300

Query: 301 LILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAIL 360
           L +PSFRHLLIAFGGLAGLEES+EEDN+ K KN  EIF SYLNTCP QGSRTIRTEEAIL
Sbjct: 301 LDIPSFRHLLIAFGGLAGLEESVEEDNSLKGKNVREIFDSYLNTCPNQGSRTIRTEEAIL 353

Query: 361 ISLQYFQEPISRALQ 375
           ISLQYFQEPI+RALQ
Sbjct: 361 ISLQYFQEPINRALQ 353

BLAST of Cp4.1LG03g10910 vs. NCBI nr
Match: gi|743785814|ref|XP_011026090.1| (PREDICTED: putative methyltransferase C9orf114 isoform X1 [Populus euphratica])

HSP 1 Score: 476.9 bits (1226), Expect = 3.4e-131
Identity = 264/373 (70.78%), Postives = 302/373 (80.97%), Query Frame = 1

Query: 4   KKHKRPESESEALDGGDFEARHETELINGCSSETKKKKKKRKKQNVENEEIEGEKRKPIS 63
           KK K+ E+E++A    D  A +E EL NG  S  KKKKKK K++NV ++E+   K K I 
Sbjct: 3   KKQKKAEAETDAERVEDDRAENELELTNG-DSHKKKKKKKNKERNVSDKEVI--KAKEI- 62

Query: 64  KPTVSMAVSGSIIDNAQSLELATRLAGQIARAATIFRIDEEDSFCMYQVVVFDSGRSSMT 123
            PTVS+A+SGSII+NAQSLELATRLAGQIARAATIFRIDE        VVVFD+ +SS  
Sbjct: 63  -PTVSVAISGSIINNAQSLELATRLAGQIARAATIFRIDE--------VVVFDN-KSSYE 122

Query: 124 GSD--VAAANNSDEDESGAAFLIRILKYLETPQYLRKALFPKHNNLRFVGMLPPLDAPHH 183
             D  +   N SDE+ESGAAF +RIL+YLETPQYLRKALFPKH NLRFVGMLPPLDAPHH
Sbjct: 123 KEDRTLTTDNYSDENESGAAFFVRILRYLETPQYLRKALFPKHCNLRFVGMLPPLDAPHH 182

Query: 184 LRKHEWGPYREGVTLKERAPDARGTLVDVGLSKNVVVDEILEPGRRVTVAMGTDRNLLAD 243
           LRKHEW P+REGVTL E+ P++  TLVDVGLSKNV ++++LEPG RVTVAMGT+RNL +D
Sbjct: 183 LRKHEWAPFREGVTLNEKVPNSGETLVDVGLSKNVSINQVLEPGIRVTVAMGTNRNLDSD 242

Query: 244 LPRQVVSSSKPVEEE-LYWGYRVRYASSLSAVFKESSYEGGYDHLIGTSEHGTIIKSSEL 303
            PRQVVS  KP EE  LYWGYRVRYAS++S+VFK+  Y+GGYDHLIGTSEHG II SSEL
Sbjct: 243 SPRQVVSLLKPREEAGLYWGYRVRYASNISSVFKDCPYKGGYDHLIGTSEHGLIINSSEL 302

Query: 304 ILPSFRHLLIAFGGLAGLEESIEEDNNFKSKNACEIFSSYLNTCPLQGSRTIRTEEAILI 363
            LP+FRHLLIAFGGLAGLEESIEED+N K KN  E+F SYLNTCP QGSRTIRTEEAI I
Sbjct: 303 SLPAFRHLLIAFGGLAGLEESIEEDSNLKGKNVREVFDSYLNTCPHQGSRTIRTEEAIFI 361

Query: 364 SLQYFQEPISRAL 374
           SLQYFQEPI+RAL
Sbjct: 363 SLQYFQEPINRAL 361

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CI114_MOUSE3.4e-6745.48Putative methyltransferase C9orf114 homolog OS=Mus musculus GN=D2Wsu81e PE=1 SV=... [more]
CI114_HUMAN1.1e-6542.22Putative methyltransferase C9orf114 OS=Homo sapiens GN=C9orf114 PE=1 SV=3[more]
YMP6_CAEEL1.1e-4136.60Putative methyltransferase B0361.6 OS=Caenorhabditis elegans GN=B0361.6 PE=3 SV=... [more]
Y1688_HALSA2.3e-1527.86Uncharacterized protein VNG_1688C OS=Halobacterium salinarum (strain ATCC 700922... [more]
Y1612_HALMA2.2e-1328.13Uncharacterized protein rrnAC1612 OS=Haloarcula marismortui (strain ATCC 43049 /... [more]
Match NameE-valueIdentityDescription
A0A0A0LAE4_CUCSA9.4e-18189.12Uncharacterized protein OS=Cucumis sativus GN=Csa_3G149370 PE=4 SV=1[more]
A5ASC7_VITVI6.6e-13470.40Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_012192 PE=4 SV=1[more]
D7T6U1_VITVI1.1e-13370.40Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g02950 PE=4 SV=... [more]
B9RTY3_RICCO8.9e-13168.42Protein C9orf114, putative OS=Ricinus communis GN=RCOM_0913800 PE=4 SV=1[more]
A0A067G5A9_CITSI1.5e-13067.89Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g017050mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G19300.19.4e-12464.23 Nucleic acid-binding, OB-fold-like (InterPro:IPR016027), Protein of ... [more]
Match NameE-valueIdentityDescription
gi|778678539|ref|XP_004134144.2|1.3e-18089.12PREDICTED: putative methyltransferase C9orf114 [Cucumis sativus][more]
gi|659076473|ref|XP_008438700.1|5.1e-18088.86PREDICTED: uncharacterized protein C9orf114 [Cucumis melo][more]
gi|147790065|emb|CAN75987.1|9.4e-13470.40hypothetical protein VITISV_012192 [Vitis vinifera][more]
gi|225432580|ref|XP_002277845.1|1.6e-13370.40PREDICTED: putative methyltransferase C9orf114 [Vitis vinifera][more]
gi|743785814|ref|XP_011026090.1|3.4e-13170.78PREDICTED: putative methyltransferase C9orf114 isoform X1 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR012340NA-bd_OB-fold
IPR003750Put_MeTrfase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009561 megagametogenesis
biological_process GO:0006996 organelle organization
biological_process GO:0009560 embryo sac egg cell differentiation
biological_process GO:0000741 karyogamy
biological_process GO:0006626 protein targeting to mitochondrion
biological_process GO:0032259 methylation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005622 intracellular
molecular_function GO:0003674 molecular_function
molecular_function GO:0008168 methyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g10910.1Cp4.1LG03g10910.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003750Putative RNA methyltransferasePANTHERPTHR12150UNCHARACTERIZEDcoord: 38..378
score: 6.8E
IPR003750Putative RNA methyltransferasePFAMPF02598Methyltrn_RNA_3coord: 66..364
score: 2.7
IPR012340Nucleic acid-binding, OB-foldunknownSSF50249Nucleic acid-binding proteinscoord: 179..260
score: 1.3
NoneNo IPR availableunknownCoilCoilcoord: 35..55
scor

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG03g10910Cp4.1LG08g04320Cucurbita pepo (Zucchini)cpecpeB482