Cp4.1LG03g11810 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g11810
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPlant protein of unknown function (DUF247)
LocationCp4.1LG03: 10806003 .. 10813848 (+)
RNA-Seq ExpressionCp4.1LG03g11810
SyntenyCp4.1LG03g11810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACTCGGTGATTCTCTCGATGTCTTCTCGGGGCCGACATTTCAATTTGGACGATGCCGGCGGAAACTCAGTCATGGAGTTAGATGGCGTCAATGGCTTTAGGAAGAAGAAGATGACGATCCGCTTCATAAATTTATCGATTACGCGAGGCCTTTGCTATTATTTGAAGATGAAGAAAACTTCGATCCTAATGTTAATGGAACGGAGACCAATGCGGCCGGTTGGAGTTTGATCGCCTTCCGGTTCCTCAGAACTTGTATCGCCTACTTGAGTTCTGTTACCGCTGCGATTTTGCTATCTGAGCTCTCGCAGGTACCGCGAATTCGAACTCCTCTAAGGTTTCTTTCTTCGACTATAGATACTTTGAGCAAAAGTTTATGTTAATTGCCTGATTTCCAATCAAAGGAAGATGCAACAGTGAGGACTGCCTGAATTCTTGATGTTTCACAACTGGATTAAAGATTTTACCGTAATTGATTCCAGGAGTTCAATGGAGTCTCTTGGTTACAAGTTTGAATTTGTAGTAATACGCCGTCTGAGTTCCTTTTTAATATGTACACCCATTTATTACTAAGTGTTCATAACAACGAGAGGGTGGAATAAGGGCCCGTGTTTTATTCGAAATGAGGGATGAATACTTAATACTCTGTCATATCTTGGTTCTAGGCTTTTGAAATTGATGCGTCCTTAGAGTTGCTTGATCCAATTGTAGTCCTGTTGTCTTTTTAAGTATTAGGCAATGGTGGCCAACCTCGAGTGATCCAGATTGCACCCTGACCGCACATATTGGTTTTTAAGTATTAGTATCCTCTCAACTGTCCTTTGCGATTCGGTATCTCATCTGCTGAGAGGCTAGTCAACTGTCAATTATGAAATTATCCATGTGAGTAATGATTGGTTGCCACCCAAGCTTCATAATCTGAATTGGGAATAAAAAATTTAATGGCGATGAAGGCATTCATTTCTTCTCTAGGAGGAGAATGCATTCTGAGCTTCTTAATCTCTCCAGTCAAGTTTGTGTGATTAACGTATATAGGAAGAATAAAATTTTGCCAAAGCATAAAATTGTTTACACTGATGTAACCTGATTGACCCGATTTTCTAGTGGTGGGAGTTGGGATTAGCCATGAAAGCTCTGGTGACAAGAGATGTTCTGGGATGAAAGAAATTTAGCTGAAGTAAAACATGGTGCATAATGCAGCATCTTGTACAGCTGCATTTATGATGTTGGTGTTCTTTTGTTTTTCCCCTTCACTAAAATGGTCAATGCTTAAACTTTAGAGGCTGTAGTTTCACAAACGGATGGATTTTGGATATCCAATCATGAACTGACTGCCTTATCTTCAATATTTGCAGGATTGAATCTATTGGTCCACTGGAAATTTATGAGAAGATTAATGGCTTACGGATGATGCAAATCAGTCTTGTTGATAATGATGATTTCAATTTAAAGTTTATCGTGGGGTGAACAGGGGTTACTAGCCAATACTATAAGGTGAATCTTTGAGTTTGATGTTAACTATTTCAAGAATATTCTTGCATGATTATATCAACGTAACAGTTCTCATTTGAACAGTGTTGGTTTACTTGCACTTGATAGACCATAAATTGCAACTGTAATTGAGAATGGCATTGGAACAAGTGATGATCTTTGTCTTGAATATGTTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCATCATAAGGAGCAAGGGAGTTGTAGAACCCCACCTCTCACCACACCATGCACACACAAAATTAAGAGACTTCTCTGTTTCTTGGATCCCGTCAATATGCAAATATTCCACAGCACAGGTCTCTGTTTTAGCACAGAATATAAACCAAGCTTCAAGAACGCTTGGCACGTCGTATCCTACACTTAGTACGTCTTTATGAAGATAAATCTTAATAGCCATCTGTTTGTGAAGTTTACAGTCTTTTGTGGTCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTTAGGTTTTATGAGTCTTGACATTTATATAGCTCTTCAAAGAACAAAAACTTCTCATCTCCTATACTTCCACTCATTTTTGTTCTTTGTGTCTTCTCTTGTAAAGATCGTTAGGAAGGGTAAGCGTTGGATATACAGTATATAGTGTCCTGACATGCACCATAAACAAGAATCGGTGAGCTAATTCAAACTATTTACTCCGTTATCTAATAAATAAATTGGAAGTCTTGGACAATTTGAAATGCTTATATACATAAAATAATAACTTGTCGTTGATTTGAGGATTTTCTCCTTTTAGATATTTTGGCGATTTCTTCATGGCATTTCTAATTTGATTTGAGTCTTTTGCTTGTGAAGCTTGGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACTTTAGCTGCTTGCCAACATTGTTTTCTCCTTGTCATCTTAAACTTTCACGACTTTCTGATCTTACGAAGCAATACTCATGGTACAAAGGTGAAGCTTTTTGTTTTTCCTTCTCGTTCTTTCTTTGTCTATAGTATGTTATGTGTGATGAGGAACTTTTCAAAATGGAAAGGTTTCAGATTTCTTTCCATTTACCAGGACCACTAGAGAGCTCTAATCGCTCATCCCCAATAGGCTACTTCGCTGAAAGGATTTATGGGAGCCGTACATCAATATCCTTTGGAGATCAAGTGAGGCAAAGCTCCGTACCTTTGTTGCTTAGAAGCAGCAAAGTAGGGATTCCTCATTCTTATAACGTAATATATAAACAACAAACGTTAGGAGAGAGACAAATGACCAATGAAGTTTCTCTTTTTGCAACCTTTTTTGGGAATTTAGGGAAAATTAGGCTTGATAACGATTTTGTTACTGATTTCTGTTTTTGAAAATTATGTTTGGTTCCTCTTAATTTCTTTATTATGGATTTAGAAACATAATAGTGAATTGAAGGAGGAAGATCGGCAGATGTGGAAATAATGACTATTCTGTTAATGATCCACTTTCATGGGAGATTACTTTGCACTCAAGTGCAAGTGTGAATAATGCTTCATTCAGGAGGAACTATCTTTCTAACAGTGGCTGCTGCAAGACGGCAAAAGGTATTGTAAAGACCTACAGGTATTATTCATTCTCCATTTCAATGTCTTGAACTGCAATACCCAATTAAATTCATAACACAGGCATTTTAATGATATTTTTGTCCTTTTTTTTTTCATATTTTATGATGCATTATAATCATTTATCTAAAGATTTGGACGTTGAAATATTCTCGCCCTTTGTGGTTAGAAATTTAATGGAAACAAAAGTGTAAGATCCTAGTTATTATAAAGGTTGAGTGTAGACTAAAGAAATATTCTTTAATCCATTACACATTTTTGGTTATGTAGACCTTTCAAATAGGTTCATGACCCCTATACTTGACCTTTTTGATAGCATTAATTCTAAGTCCAAGGTCTAGTTTTGAGCCAAAAAAGTATAGATAGCGTGACACAAATTAATAAATAATAGTCTCTCAATTTGAACTAGAATCTAAAACCAAACATTTTAACGTTTCATGTTCTGATCAAGCTTTTAAATATCGATATTGCTAGAAATGTTCAAATCATTTTGATTCTTTTATGATTAATATTTGATCTAAACTAGCGATATTTTACTGTATGTCATTCATTTTTGTTTCGATTTAAGATTTGTATTAGTCTCAAATATAATGTTTTAACATTGTTAAAAGGCAAGTAAGATAGGAGTCTTGTGACTTTTAGAATTGCACCACCAATTCTTGAATTATATTTGAAGCAATGAATTTGTTAATTCTGTTGCTCTTTAAGCAAAAGGTTAATATGCTTTCGGTGATTATTGTGAAGTTCATGTGTTTCATGTTCATTTCTACTGTGTCTGTTTATCCAATTCCCCAAGCCACCCCTTGATTCTATTTCTGCCTCCATAACCTTATATATTACACCTACATCCTTCTTAATTCATTTCAAAATCACCTGTTTTTCTTTCTTTCTTTCTTTCCTTAATTATTTTTTCTGTCCATCTTCTTTGCAGGGCATCACCAGATTTTAAGCCCTCTTAAGGTATGTTCACCAATATTCCCTTTTTTGTTAGCAAATTATGTGTGAAATTTATGATTGCTTGTATAAATAGAAGATTAATATCACTTAAAAGTACCTATATTTATAGCTTAGGATTTTCAAGTCCTAATTTTTTACACAACTAAAACTATTTAATAGGTGTTACATTTTAGGAAATTTATTTTGTGTAGTATAGGTTGAAATTATGATTAAAAAATATTAAAAGTACTAAAATAGGTCTAATGTGTTGATGATGAAAGTCCTACATCCCTAATTTAGGGAATGATCATGAGTTTATGATCAAAGAATACTTTCTCCTTGGGCTAATTTAGGAATGATCGTGGGTTTATAATTAAATAATACTCTCTTAATTGGAATGAGACCGTTTGGGAAGCCCAAAGCAAAGCCATGAGAGCTTATGTTCAAAGTGGGTAATATCATATGATTGTAGAGAGTCTATGAGACCTTCTGGGAAGTCTAAAGCAAAGCCATGAGAGCTTATGTATACGATTGTAGAGAGTCTATGAGATCTTCTGTGAAGCCCAAAGCAAAGCTATGAGAGCTTATGTATACAATTGTAGAGAGTCTATGAGACCTTCTGAGAAGCCCAAAGCAAAGCCACGAGAGCTTATGTTCAAAGTGGGCAATATCATACGATTGTAGAGAGTCGTGTTCGTGTAACATAATGGAAGTTTTGAAAATTTAAAGACAAATAAATCAAAGTTCAAAGCACACCAACCAAAATATTATTTAAACCTAATTTTATTCATTTGGGAAGGTAAACTGATATTTTTAAAAAAAATAAAATTATGTTAAATTGTATAGTAAATTGATATTCTTCCATTTTTACTTTTTCAATGGTCTTTCAACCTTCTGTGTTTAGTGAATTTTGTTTATGTATATAGTTGAATTTGTATATTAGAGATTGAAATCTCTAACTTTTCTTTTTATCGACTTAGTTTAAATTAAAGTTGAAATTAATATTTAATACTAAATCTTTAACAAGAAATCTCATACCAATATAAATGAGTTTAAATTACAAATAACATATTAAATGAAATTATTAAATAGATTAAAGAATAAATTATTAAATTAATCAAAACAAAAATGTTATATAAATTTAAAATATCAAGTAAATTACACTTTTGACAACTTTGATCCTCGTGTTAAATTAGTTCTTATGTTTTAAATTTTGAATATCAAATCCCTAATTTAAAATGTTAATTACTTGACATTTTCTAAACAAATAATCATAATAATAAAATAATAATAATAAATCTTTCTCTTCTCTCACCCTTCCCTTGCTACTCGGAGAAAAGCCTTCTACCAATGTAGATAGTATTCTTAAATTATAAATCCATCGTCATATTTCTGAATTATAAACTCATGATCATTCCTTAAATTAACCGACATGGACGTTCTCCAAGAGATTTCGTACCAATGAAGATAGTATTTTTGGATTATAAATCCATGATAATTTTCTAAATTAATCGATGTGAACTTTCATCGTCCAACAGCTCCTCTTTCAACCTCCCCCTTTGGTTAACTATTTGGTGCCGATGATGCACCCACCCCTCTTCCTCCCGCATGAGAAATCAATCTGGGAATTAAAAACAAGATAAAGAAGATGAATAAGAAGAAAGAAATGACGGAGGGGAGTAGCGCGGCTTTGCCGCAGGAGGGTCAGAGGAGGGAGAGGATGAAGGAGGACGGCGGTGGGGTGGCAATGGCAGCCATGAGAAGGGAAAGGGAGGAAGAAAGAAAGAAGTGTTGGATGATGAAAGTCCCAGGTCGACTAATTTAGAGAATAATTATGAATTTATATTCAAAGAATATTCTTTCTATTTGTGCAAGGCCTTTGGAAGCCTAAAGCCCAAAGCAAAGTTACAAAAAAGTGCCTAATCATCTATTAAGAAAGAAGAAGGAGAGAAGTTAAGTATAAAATAATAATAATAATTACTAAATCTAGAAAAAAGTTTTTTTTTTCTTATAGTAAAAAAGAGGTCCCACTGTTAAATGGTTGAATTTGAAATTTTAAATCTTAAATACTACCAAATATTTTAAAATACTTATAATTAATTTATTATTATTAATTAATTAACTTTTATCAATACTTTTGAATGTCATGATTAATTCCATAACTAAATGCAAGTTTAATTTATCTCCATAAAAAATCGATAATATTCTCAAACACAGCCATCCTTAATGTTAATATTTTGTTAAATATTTGTGTGTGTTCATGAATTCAGTAACAACATTATCAACAAAATATTAAAAATAACGAATTATTTTTATTGTTTGTATATTTCTATGATTTTTCATATTTAAAATCTTGATCTTCATCTATTTACTGTTTTTTTCTCAATGAAACTATTGTTCGTTGTTTGTGCTCACAATTTGAATGAATTTGATCAATAATATTTACCATTTACCTTGCCTCTGTTCCGGAAATAAAGAGCCTCTTTCTTGGCCGTCGCCGGGAAGATGGATTCATCAAGAGCATTGTCTCATTCGATTGATGTTCCGGCGACCTCACAAGGAAGCTCTGAGGAAGAATCTCTCCTATCTTCCATTGAAGGAAAATTGGAAGCCTTCTGTTCATCCATTACCATCTTCAGAGCTCCAAATGAAATCAGTATCGAAGATAGAAACGTCTTCGTCCCCGCCAAAGTCTCAATCGGTCCTTTCCACCACGGCGCGCCACATCTCGAATCAATGGAAAACCTCAAGTGGTGCTACTTGTCCGCTTTCTTGAAGAACAATCCGTCCGTCGATTTACAATATCTTGTTGAACTCGTTGTTAAATCTGAGAGCCGATTGAGAAAATGCTATGAGGAGGAGTTTTATGGTTTCGACAGTGATAAGTTTTCGCAGATTATGTTGCTTGATTGCTGCTTCATTCTCGAGCTGCTTTTGCGATTCTCGAAAAAGAGGCTCAGGCGACGGAATGACTCTGTTTTCACTACTCCTGGTTTGCTCTTCGATTTGAGATGCGACTTGATGTTGCTTGAAAATCAGATTCCCTACTTCCTTCTCAAAGACGTTTATGAAAATGTGCAAGATCCGACCGAGGAAAATATGTCTCTCAATGACCTAACCTTCCGATTCTTCAAAACTTTGGTTGTTGGAGATCGGCAATTAGTTTACGACAATTTCACGGTGGAAGCAGATCATCTACTCGAAATGGTTCACTCTTGTTTTCTCTCCACCTATCCTCGAGTGGAGACGAACGACAAATCGAAGTCGAGAGAATTACCTAGTGCGTCGAAGCTTAAAACTGCGGGAATCAAAATCAAGAACGCCAGATCTTCAAAGAGCTTATTAGACATCAAATTTCAGAACGGCGTCCTCGAAATTCCACCTCTCAAGGTGTATCAGAAGACAGAGGTGATTCTAAGGAATCTCGTAGCGTATGAGATCCATCAATCCGGAAGCGACCGGCAAGTGAAATCGTACATCAATTTCATGAGCCACCTTCTCCAGTCTGATCAAGACGTGAAGATTCTCTATAGAAGGAAAATCCTAATCGATCAGGAAGACGACGAGGAGCAGATTATTCGAAATCTGAAATGGATGAGCGAGAAGGAGAGCTTATCGGGAACGTACTTTGCCGGCATTGTTCAGAAATTAAACGAGAAGCCGGACCGATGCGTCGCACGGTGGCGGAAGTTGAGAAGGAATCCAGTGGCCATCGGCATCGTCGCCGTTTTGGTGGTGGTTGTGATCTTCGTCGCGGCCTTCTTCTCTGCATTTTCAGTACTTCAGCGTCGTTACAAATGA

mRNA sequence

ATGGACTCGGAAGAAGAAGATGACGATCCGCTTCATAAATTTATCGATTACGCGAGGCCTTTGCTATTATTTGAAGATGAAGAAAACTTCGATCCTAATGTTAATGGAACGGAGACCAATGCGGCCGGTTGGAGTTTGATCGCCTTCCGTTTATCGTGGGGTGAACAGGGGTTACTAGCCAATACTATAAGGGTAAGCGTTGGATATACAGTATATAGTGTCCTGACATGCACCATAAACAAGAATCGCTTGGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACTTTAGCTGCTTGCCAACATTGTTTTCTCCTTGTCATCTTAAACTTTCACGACTTTCTGATCTTACGAAGCAATACTCATGGTACAAAGGACCACTAGAGAGCTCTAATCGCTCATCCCCAATAGGCTACTTCGCTGAAAGGATTTATGGGAGCCGTACATCAATATCCTTTGGAGATCAAGTGAGGCAAAGCTCCGTACCTTTGTTGCTTAGAAGCAGCAAAGTAGGGATTCCTCATTCTTATAACATTACTTTGCACTCAAGTGCAAGTGTGAATAATGCTTCATTCAGGAGGAACTATCTTTCTAACAGTGGCTGCTGCAAGACGGCAAAAGGGCATCACCAGATTTTAAGCCCTCTTAAGATGGATTCATCAAGAGCATTGTCTCATTCGATTGATGTTCCGGCGACCTCACAAGGAAGCTCTGAGGAAGAATCTCTCCTATCTTCCATTGAAGGAAAATTGGAAGCCTTCTGTTCATCCATTACCATCTTCAGAGCTCCAAATGAAATCAGTATCGAAGATAGAAACGTCTTCGTCCCCGCCAAAGTCTCAATCGGTCCTTTCCACCACGGCGCGCCACATCTCGAATCAATGGAAAACCTCAAGTGGTGCTACTTGTCCGCTTTCTTGAAGAACAATCCGTCCGTCGATTTACAATATCTTGTTGAACTCGTTGTTAAATCTGAGAGCCGATTGAGAAAATGCTATGAGGAGGAGTTTTATGGTTTCGACAGTGATAAGTTTTCGCAGATTATGTTGCTTGATTGCTGCTTCATTCTCGAGCTGCTTTTGCGATTCTCGAAAAAGAGGCTCAGGCGACGGAATGACTCTGTTTTCACTACTCCTGGTTTGCTCTTCGATTTGAGATGCGACTTGATGTTGCTTGAAAATCAGATTCCCTACTTCCTTCTCAAAGACGTTTATGAAAATGTGCAAGATCCGACCGAGGAAAATATGTCTCTCAATGACCTAACCTTCCGATTCTTCAAAACTTTGGTTGTTGGAGATCGGCAATTAGTTTACGACAATTTCACGGTGGAAGCAGATCATCTACTCGAAATGGTTCACTCTTGTTTTCTCTCCACCTATCCTCGAGTGGAGACGAACGACAAATCGAAGTCGAGAGAATTACCTAGTGCGTCGAAGCTTAAAACTGCGGGAATCAAAATCAAGAACGCCAGATCTTCAAAGAGCTTATTAGACATCAAATTTCAGAACGGCGTCCTCGAAATTCCACCTCTCAAGGTGTATCAGAAGACAGAGGTGATTCTAAGGAATCTCGTAGCGTATGAGATCCATCAATCCGGAAGCGACCGGCAAGTGAAATCGTACATCAATTTCATGAGCCACCTTCTCCAGTCTGATCAAGACGTGAAGATTCTCTATAGAAGGAAAATCCTAATCGATCAGGAAGACGACGAGGAGCAGATTATTCGAAATCTGAAATGGATGAGCGAGAAGGAGAGCTTATCGGGAACGTACTTTGCCGGCATTGTTCAGAAATTAAACGAGAAGCCGGACCGATGCGTCGCACGGTGGCGGAAGTTGAGAAGGAATCCAGTGGCCATCGGCATCGTCGCCGTTTTGGTGGTGGTTGTGATCTTCGTCGCGGCCTTCTTCTCTGCATTTTCAGTACTTCAGCGTCGTTACAAATGA

Coding sequence (CDS)

ATGGACTCGGAAGAAGAAGATGACGATCCGCTTCATAAATTTATCGATTACGCGAGGCCTTTGCTATTATTTGAAGATGAAGAAAACTTCGATCCTAATGTTAATGGAACGGAGACCAATGCGGCCGGTTGGAGTTTGATCGCCTTCCGTTTATCGTGGGGTGAACAGGGGTTACTAGCCAATACTATAAGGGTAAGCGTTGGATATACAGTATATAGTGTCCTGACATGCACCATAAACAAGAATCGCTTGGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACTTTAGCTGCTTGCCAACATTGTTTTCTCCTTGTCATCTTAAACTTTCACGACTTTCTGATCTTACGAAGCAATACTCATGGTACAAAGGACCACTAGAGAGCTCTAATCGCTCATCCCCAATAGGCTACTTCGCTGAAAGGATTTATGGGAGCCGTACATCAATATCCTTTGGAGATCAAGTGAGGCAAAGCTCCGTACCTTTGTTGCTTAGAAGCAGCAAAGTAGGGATTCCTCATTCTTATAACATTACTTTGCACTCAAGTGCAAGTGTGAATAATGCTTCATTCAGGAGGAACTATCTTTCTAACAGTGGCTGCTGCAAGACGGCAAAAGGGCATCACCAGATTTTAAGCCCTCTTAAGATGGATTCATCAAGAGCATTGTCTCATTCGATTGATGTTCCGGCGACCTCACAAGGAAGCTCTGAGGAAGAATCTCTCCTATCTTCCATTGAAGGAAAATTGGAAGCCTTCTGTTCATCCATTACCATCTTCAGAGCTCCAAATGAAATCAGTATCGAAGATAGAAACGTCTTCGTCCCCGCCAAAGTCTCAATCGGTCCTTTCCACCACGGCGCGCCACATCTCGAATCAATGGAAAACCTCAAGTGGTGCTACTTGTCCGCTTTCTTGAAGAACAATCCGTCCGTCGATTTACAATATCTTGTTGAACTCGTTGTTAAATCTGAGAGCCGATTGAGAAAATGCTATGAGGAGGAGTTTTATGGTTTCGACAGTGATAAGTTTTCGCAGATTATGTTGCTTGATTGCTGCTTCATTCTCGAGCTGCTTTTGCGATTCTCGAAAAAGAGGCTCAGGCGACGGAATGACTCTGTTTTCACTACTCCTGGTTTGCTCTTCGATTTGAGATGCGACTTGATGTTGCTTGAAAATCAGATTCCCTACTTCCTTCTCAAAGACGTTTATGAAAATGTGCAAGATCCGACCGAGGAAAATATGTCTCTCAATGACCTAACCTTCCGATTCTTCAAAACTTTGGTTGTTGGAGATCGGCAATTAGTTTACGACAATTTCACGGTGGAAGCAGATCATCTACTCGAAATGGTTCACTCTTGTTTTCTCTCCACCTATCCTCGAGTGGAGACGAACGACAAATCGAAGTCGAGAGAATTACCTAGTGCGTCGAAGCTTAAAACTGCGGGAATCAAAATCAAGAACGCCAGATCTTCAAAGAGCTTATTAGACATCAAATTTCAGAACGGCGTCCTCGAAATTCCACCTCTCAAGGTGTATCAGAAGACAGAGGTGATTCTAAGGAATCTCGTAGCGTATGAGATCCATCAATCCGGAAGCGACCGGCAAGTGAAATCGTACATCAATTTCATGAGCCACCTTCTCCAGTCTGATCAAGACGTGAAGATTCTCTATAGAAGGAAAATCCTAATCGATCAGGAAGACGACGAGGAGCAGATTATTCGAAATCTGAAATGGATGAGCGAGAAGGAGAGCTTATCGGGAACGTACTTTGCCGGCATTGTTCAGAAATTAAACGAGAAGCCGGACCGATGCGTCGCACGGTGGCGGAAGTTGAGAAGGAATCCAGTGGCCATCGGCATCGTCGCCGTTTTGGTGGTGGTTGTGATCTTCGTCGCGGCCTTCTTCTCTGCATTTTCAGTACTTCAGCGTCGTTACAAATGA

Protein sequence

MDSEEEDDDPLHKFIDYARPLLLFEDEENFDPNVNGTETNAAGWSLIAFRLSWGEQGLLANTIRVSVGYTVYSVLTCTINKNRLEALWIENHVGASFVNFSCLPTLFSPCHLKLSRLSDLTKQYSWYKGPLESSNRSSPIGYFAERIYGSRTSISFGDQVRQSSVPLLLRSSKVGIPHSYNITLHSSASVNNASFRRNYLSNSGCCKTAKGHHQILSPLKMDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEFYGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIPYFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFLSTYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKTEVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRNLKWMSEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFFSAFSVLQRRYK
Homology
BLAST of Cp4.1LG03g11810 vs. ExPASy Swiss-Prot
Match: Q9SD53 (UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1)

HSP 1 Score: 143.7 bits (361), Expect = 7.5e-33
Identity = 113/356 (31.74%), Postives = 175/356 (49.16%), Query Frame = 0

Query: 237 SQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVPAKVSIGPFHHGAPHLE 296
           S GS E   LL S  GK      S  IFR P      +   + P  VSIGP+H+G  HL+
Sbjct: 28  SSGSKEPVLLLES-AGK-----ESCCIFRVPESFVALNPKAYKPKVVSIGPYHYGEKHLQ 87

Query: 297 SMENLKWCYLSAFLKNNPSVDLQ--YLVELVVKSESRLRKCYEEEFYGFDSDKFSQIMLL 356
            ++  K   L  FL      D++   LV+ VV  E ++RK Y EE        F  +M+L
Sbjct: 88  MIQQHKPRLLQLFLDEAKKKDVEENVLVKAVVDLEDKIRKSYSEELKTGHDLMF--MMVL 147

Query: 357 DCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIPYFLLKDVYENVQDP 416
           D CFIL + L  S   +    D +F+ P LL  ++ DL+LLENQ+P+F+L+ +Y  V   
Sbjct: 148 DGCFILMVFLIMS-GNIELSEDPIFSIPWLLSSIQSDLLLLENQVPFFVLQTLY--VGSK 207

Query: 417 TEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL--------STYPRV 476
              +  LN + F FFK  +  +      +   +A HLL+++   FL        ++ P V
Sbjct: 208 IGVSSDLNRIAFHFFKNPIDKEGSYWEKHRNYKAKHLLDLIRETFLPNTSESDKASSPHV 267

Query: 477 ETN-DKSKSRELP-----------SASKLKTAGIKIKNARSSK-SLLDIKFQNGVLEIPP 536
           +    + KS  +P           SA +L+  GIK +  RS + S+L+++ +   L+IP 
Sbjct: 268 QVQLHEGKSGNVPSVDSKAVPLILSAKRLRLQGIKFRLRRSKEDSILNVRLKKNKLQIPQ 327

Query: 537 LKVYQKTEVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILID 570
           L+          N VA+E   + S  ++ +YI FM  LL +++DV  L   K++I+
Sbjct: 328 LRFDGFISSFFLNCVAFEQFYTDSSNEITTYIVFMGCLLNNEEDVTFLRNDKLIIE 372

BLAST of Cp4.1LG03g11810 vs. ExPASy Swiss-Prot
Match: P0C897 (Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 PE=3 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 9.2e-15
Identity = 109/486 (22.43%), Postives = 194/486 (39.92%), Query Frame = 0

Query: 260 SITIFRAPNEISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQ 319
           +++IF  P  +     + + P +VSIGP+H   P L  ME  K            S    
Sbjct: 42  TVSIFNVPKALMCSHPDSYTPHRVSIGPYHCLKPELHEMERYKLMIARKIRNQYNSFRFH 101

Query: 320 YLVELVVKSESRLRKCYEEEFYGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVF 379
            LVE +   E ++R CY  ++ GF+ +    IM +D  F++E L  +S +++    +++ 
Sbjct: 102 DLVEKLQSMEIKIRACY-HKYIGFNGETLLWIMAVDSSFLIEFLKIYSFRKV----ETLI 161

Query: 380 TTPGLLFDLRCDLMLLENQIPYFLLKDVYENVQDPTE--ENMSLNDLT--FRFFKTLVV- 439
              G    LR D+M++ENQIP F+L+   E   + TE  +++ L+ LT   +    LV+ 
Sbjct: 162 NRVGHNEILR-DIMMIENQIPLFVLRKTLEFQLESTESADDLLLSVLTGLCKDLSPLVIK 221

Query: 440 -GDRQLVYDNFTVEADHLLEMVHSCFLSTYPRVE-------------------------- 499
             D Q++   F  E +H+L+ ++   +   PR+E                          
Sbjct: 222 FDDDQILKAQFQ-ECNHILDFLYQMIV---PRIEEEELEEDDEENRADENGGNRAIRFMD 281

Query: 500 ----------------------------------------------TNDKSKSRE----- 559
                                                          N+ + +R+     
Sbjct: 282 EIKHQFKRVFASRPADLILRFPWRIISNLPGFMALKLSADYLFTRQENEATTTRQESVSI 341

Query: 560 -------------LPSASKLKTAGIKIK-NARSSKSLLDIKFQNGVLEIPPLKVYQKTEV 619
                        +PS S L  AG++ K  A  + S +     +G   +P + +   TE 
Sbjct: 342 LDIEKPPLVEELTIPSVSDLHKAGVRFKPTAHGNISTVTFDSNSGQFYLPVINLDINTET 401

Query: 620 ILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRNLK 647
           +LRNLVAYE   +        Y   ++ ++ S++DV++L  + +L+ +   +++      
Sbjct: 402 VLRNLVAYEATNTSGPLVFTRYTELINGIIDSEEDVRLLREQGVLVSRLKSDQEAAEMWN 461

BLAST of Cp4.1LG03g11810 vs. NCBI nr
Match: XP_023526431.1 (UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo] >XP_023526432.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo] >XP_023526433.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo] >XP_023526435.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo] >XP_023526436.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo] >XP_023526437.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo] >XP_023526438.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo] >XP_023526439.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 835 bits (2157), Expect = 3.36e-301
Identity = 431/431 (100.00%), Postives = 431/431 (100.00%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP
Sbjct: 1   MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF
Sbjct: 61  AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP
Sbjct: 121 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL
Sbjct: 181 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT
Sbjct: 241 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN
Sbjct: 301 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 360

Query: 581 LKWMSEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFF 640
           LKWMSEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFF
Sbjct: 361 LKWMSEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFF 420

Query: 641 SAFSVLQRRYK 651
           SAFSVLQRRYK
Sbjct: 421 SAFSVLQRRYK 431

BLAST of Cp4.1LG03g11810 vs. NCBI nr
Match: XP_022955709.1 (UPF0481 protein At3g47200-like [Cucurbita moschata] >XP_022955710.1 UPF0481 protein At3g47200-like [Cucurbita moschata] >XP_022955711.1 UPF0481 protein At3g47200-like [Cucurbita moschata] >XP_022955712.1 UPF0481 protein At3g47200-like [Cucurbita moschata] >XP_022955713.1 UPF0481 protein At3g47200-like [Cucurbita moschata] >XP_022955714.1 UPF0481 protein At3g47200-like [Cucurbita moschata] >XP_022955716.1 UPF0481 protein At3g47200-like [Cucurbita moschata])

HSP 1 Score: 822 bits (2123), Expect = 5.01e-296
Identity = 423/431 (98.14%), Postives = 427/431 (99.07%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP
Sbjct: 1   MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF
Sbjct: 61  AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP
Sbjct: 121 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLLKDVYENVQDPTEENMSLNDLTFRFFKT+V GDRQLVYDNF VEADHLLEMVHSCFL
Sbjct: 181 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTMVAGDRQLVYDNFMVEADHLLEMVHSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYPRVETNDKSKS+ELP+ASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT
Sbjct: 241 STYPRVETNDKSKSKELPTASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKIL DQEDDEEQIIRN
Sbjct: 301 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILNDQEDDEEQIIRN 360

Query: 581 LKWMSEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFF 640
           LKWMSE+ESLSGTYFAGIVQKLNEKPDRCVARWRKLRR PVAIGIVAVLVVVVIFVAAFF
Sbjct: 361 LKWMSERESLSGTYFAGIVQKLNEKPDRCVARWRKLRRTPVAIGIVAVLVVVVIFVAAFF 420

Query: 641 SAFSVLQRRYK 651
           SAFSVLQRRYK
Sbjct: 421 SAFSVLQRRYK 431

BLAST of Cp4.1LG03g11810 vs. NCBI nr
Match: KAG7018572.1 (UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 820 bits (2117), Expect = 4.10e-295
Identity = 422/431 (97.91%), Postives = 427/431 (99.07%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP
Sbjct: 1   MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           +KVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF
Sbjct: 61  SKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP
Sbjct: 121 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLLKDVYENVQDPTEE MSLNDLTFRFFKT+V GDRQLVYDNF VEADHLLEMVHSCFL
Sbjct: 181 YFLLKDVYENVQDPTEEIMSLNDLTFRFFKTMVAGDRQLVYDNFMVEADHLLEMVHSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYPRVETNDKSKS+ELP+ASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT
Sbjct: 241 STYPRVETNDKSKSKELPTASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKIL DQEDDEEQIIRN
Sbjct: 301 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILNDQEDDEEQIIRN 360

Query: 581 LKWMSEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFF 640
           LKWMSE+ESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFF
Sbjct: 361 LKWMSERESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFF 420

Query: 641 SAFSVLQRRYK 651
           SAFSVLQRRYK
Sbjct: 421 SAFSVLQRRYK 431

BLAST of Cp4.1LG03g11810 vs. NCBI nr
Match: XP_022979569.1 (UPF0481 protein At3g47200-like [Cucurbita maxima])

HSP 1 Score: 727 bits (1876), Expect = 8.01e-259
Identity = 370/390 (94.87%), Postives = 379/390 (97.18%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           MDSSRALSHSIDVPATSQGSSEEESLLSSIE KLEAFCSSITIFRA NEISIEDRNVFVP
Sbjct: 1   MDSSRALSHSIDVPATSQGSSEEESLLSSIERKLEAFCSSITIFRATNEISIEDRNVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYL+ELVVKSESRLRKCYEEEF
Sbjct: 61  AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLIELVVKSESRLRKCYEEEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           YGFDS+KFSQIMLLDCCFILELLLR+SKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP
Sbjct: 121 YGFDSNKFSQIMLLDCCFILELLLRYSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLLKDVY NVQDPTEENMSLNDLTFRFFKT+V GDRQ VYDNF VEADHLLEM+HSCFL
Sbjct: 181 YFLLKDVYANVQDPTEENMSLNDLTFRFFKTMVAGDRQFVYDNFMVEADHLLEMIHSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYPR+ETND SKSRELPSASKLKTAGIKIKN +SSKSLLDIKFQNGVLEIPPLKVYQKT
Sbjct: 241 STYPRMETNDNSKSRELPSASKLKTAGIKIKNFKSSKSLLDIKFQNGVLEIPPLKVYQKT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKIL DQE+DEEQIIRN
Sbjct: 301 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILNDQENDEEQIIRN 360

Query: 581 LKWMSEKESLSGTYFAGIVQKLNEKPDRCV 610
           LKWM EKESLSGTYFAGIVQKLN+K DRCV
Sbjct: 361 LKWMREKESLSGTYFAGIVQKLNKKRDRCV 390

BLAST of Cp4.1LG03g11810 vs. NCBI nr
Match: XP_038880915.1 (UPF0481 protein At3g47200-like [Benincasa hispida])

HSP 1 Score: 715 bits (1846), Expect = 6.86e-254
Identity = 368/432 (85.19%), Postives = 393/432 (90.97%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           M+SS+  SHSID+ A +QGSS+EESLLSS+EGKLEAFCSSITIFRAPN+ISIED+NVFVP
Sbjct: 1   MESSKPFSHSIDISAIAQGSSQEESLLSSVEGKLEAFCSSITIFRAPNDISIEDKNVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGAPHLE MENLKW YLS FLK+NPS+ L  L+ELVVKSESRLRKCYE EF
Sbjct: 61  AKVSIGPFHHGAPHLEPMENLKWRYLSTFLKHNPSLTLDDLIELVVKSESRLRKCYEGEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           Y  DSDKFSQ+MLLDCCFILELLLR+SKKR RR ND VF TPGLLFDLRCDLMLLENQIP
Sbjct: 121 YDLDSDKFSQMMLLDCCFILELLLRYSKKRFRRWNDPVFNTPGLLFDLRCDLMLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLL +VYENVQDP EENMSLNDLTFRFFKT+V GDR+ VYDNF VEADHLLEMVHSCFL
Sbjct: 181 YFLLDEVYENVQDPLEENMSLNDLTFRFFKTMVAGDRKFVYDNFMVEADHLLEMVHSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYPR+ETNDKSKSRELPSASKLKTAGIK KNARS KSLLDIKFQ GVLEIPPL+VYQ+T
Sbjct: 241 STYPRMETNDKSKSRELPSASKLKTAGIKFKNARSPKSLLDIKFQKGVLEIPPLRVYQQT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           E ILRNL AYEI Q GSD QVKSYINFMSHLLQSD+DVKIL RRKILID EDDEEQII+N
Sbjct: 301 EAILRNLAAYEIRQFGSDLQVKSYINFMSHLLQSDEDVKILCRRKILIDLEDDEEQIIQN 360

Query: 581 LKWM-SEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAF 640
           LKWM  EKESLSGTYFAGIVQKLNEKPDRC+ +WR LRRNPVAIG+ AV VVVVIFVAAF
Sbjct: 361 LKWMREEKESLSGTYFAGIVQKLNEKPDRCLTQWRGLRRNPVAIGVAAVWVVVVIFVAAF 420

Query: 641 FSAFSVLQRRYK 651
           FSA S+LQRRYK
Sbjct: 421 FSAISLLQRRYK 432

BLAST of Cp4.1LG03g11810 vs. ExPASy TrEMBL
Match: A0A6J1GVU1 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111457630 PE=4 SV=1)

HSP 1 Score: 822 bits (2123), Expect = 2.43e-296
Identity = 423/431 (98.14%), Postives = 427/431 (99.07%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP
Sbjct: 1   MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF
Sbjct: 61  AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP
Sbjct: 121 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLLKDVYENVQDPTEENMSLNDLTFRFFKT+V GDRQLVYDNF VEADHLLEMVHSCFL
Sbjct: 181 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTMVAGDRQLVYDNFMVEADHLLEMVHSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYPRVETNDKSKS+ELP+ASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT
Sbjct: 241 STYPRVETNDKSKSKELPTASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKIL DQEDDEEQIIRN
Sbjct: 301 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILNDQEDDEEQIIRN 360

Query: 581 LKWMSEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAFF 640
           LKWMSE+ESLSGTYFAGIVQKLNEKPDRCVARWRKLRR PVAIGIVAVLVVVVIFVAAFF
Sbjct: 361 LKWMSERESLSGTYFAGIVQKLNEKPDRCVARWRKLRRTPVAIGIVAVLVVVVIFVAAFF 420

Query: 641 SAFSVLQRRYK 651
           SAFSVLQRRYK
Sbjct: 421 SAFSVLQRRYK 431

BLAST of Cp4.1LG03g11810 vs. ExPASy TrEMBL
Match: A0A6J1IWZ4 (UPF0481 protein At3g47200-like OS=Cucurbita maxima OX=3661 GN=LOC111479248 PE=4 SV=1)

HSP 1 Score: 727 bits (1876), Expect = 3.88e-259
Identity = 370/390 (94.87%), Postives = 379/390 (97.18%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           MDSSRALSHSIDVPATSQGSSEEESLLSSIE KLEAFCSSITIFRA NEISIEDRNVFVP
Sbjct: 1   MDSSRALSHSIDVPATSQGSSEEESLLSSIERKLEAFCSSITIFRATNEISIEDRNVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYL+ELVVKSESRLRKCYEEEF
Sbjct: 61  AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLIELVVKSESRLRKCYEEEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           YGFDS+KFSQIMLLDCCFILELLLR+SKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP
Sbjct: 121 YGFDSNKFSQIMLLDCCFILELLLRYSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLLKDVY NVQDPTEENMSLNDLTFRFFKT+V GDRQ VYDNF VEADHLLEM+HSCFL
Sbjct: 181 YFLLKDVYANVQDPTEENMSLNDLTFRFFKTMVAGDRQFVYDNFMVEADHLLEMIHSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYPR+ETND SKSRELPSASKLKTAGIKIKN +SSKSLLDIKFQNGVLEIPPLKVYQKT
Sbjct: 241 STYPRMETNDNSKSRELPSASKLKTAGIKIKNFKSSKSLLDIKFQNGVLEIPPLKVYQKT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKIL DQE+DEEQIIRN
Sbjct: 301 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILNDQENDEEQIIRN 360

Query: 581 LKWMSEKESLSGTYFAGIVQKLNEKPDRCV 610
           LKWM EKESLSGTYFAGIVQKLN+K DRCV
Sbjct: 361 LKWMREKESLSGTYFAGIVQKLNKKRDRCV 390

BLAST of Cp4.1LG03g11810 vs. ExPASy TrEMBL
Match: A0A1S3AY98 (UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103483890 PE=4 SV=1)

HSP 1 Score: 652 bits (1683), Expect = 1.89e-229
Identity = 335/432 (77.55%), Postives = 380/432 (87.96%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           MDSS  +SH+I++P  SQ SS EESLLSSIEGKLEA CSS+TIF+AP+EI+IE RNVFVP
Sbjct: 1   MDSSTPVSHTINIPGISQESSREESLLSSIEGKLEANCSSVTIFKAPSEINIEGRNVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGA HL+S+ENLKW YLS FLK+N S+ LQ L+++VVKSESRL+KCYE++F
Sbjct: 61  AKVSIGPFHHGAAHLQSVENLKWRYLSTFLKHNSSLTLQDLIKIVVKSESRLKKCYEKKF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
              D D+FS IMLLDCCFILELLLR+SK+R +RRND VFTTPGLLFD++CDLMLLENQIP
Sbjct: 121 CSLDRDEFSLIMLLDCCFILELLLRYSKRRFKRRNDPVFTTPGLLFDIKCDLMLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLL ++YE V DP EENM L+DLTFRFF+T+V GDR+ + DNF VEADHLLEMVHSCFL
Sbjct: 181 YFLLDEIYEKVLDPREENMFLSDLTFRFFRTMVPGDRKFMGDNFVVEADHLLEMVHSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYP V+TNDK KS+ELPSASKLKTAGIK KNARSSKSLLDIKFQNGVLEIPPL+VYQ+T
Sbjct: 241 STYPPVKTNDKLKSKELPSASKLKTAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQQT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           E ILRNL AYEI QSG+D+QVKSY+ FMSHLLQSD DVKIL R+KIL   EDDEEQII N
Sbjct: 301 EAILRNLAAYEIRQSGTDQQVKSYLKFMSHLLQSDGDVKILCRKKILYALEDDEEQIIEN 360

Query: 581 LKWMSE-KESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAF 640
           LKW+ E KESLSGTYFAGIVQKLNEKPDR V RWR+LRR P AIG+ A L+VVVIF AAF
Sbjct: 361 LKWIREQKESLSGTYFAGIVQKLNEKPDRSVVRWRRLRRKPTAIGVAADLMVVVIFGAAF 420

Query: 641 FSAFSVLQRRYK 651
           F+AFS+LQRRYK
Sbjct: 421 FAAFSILQRRYK 432

BLAST of Cp4.1LG03g11810 vs. ExPASy TrEMBL
Match: A0A0A0L821 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G166290 PE=4 SV=1)

HSP 1 Score: 618 bits (1593), Expect = 8.55e-216
Identity = 322/432 (74.54%), Postives = 369/432 (85.42%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           MD S  +SH+I++   SQ S +EESLLS IE KLEA CSS TI++AP+EI+IEDRNVF+P
Sbjct: 1   MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ L++LVVKSESR RKCYE+EF
Sbjct: 61  AKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           Y  D D+FSQIMLLDCCFILELLLR++K+R RR ND VFTTPGLL+DLRCDL+LLENQIP
Sbjct: 121 YSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLL+++Y  V D  EENM L+DLT RFF+T+V GDR+ + DNF VEA+HLLEMV+SCFL
Sbjct: 181 YFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFL 240

Query: 461 STYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKT 520
           STYP VETNDK KS+ELPSASKLK AGIK KNARSSKSLLDIKFQNGVLEIPPL+VYQKT
Sbjct: 241 STYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT 300

Query: 521 EVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRN 580
           E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KIL   +D+EEQII  
Sbjct: 301 ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEK 360

Query: 581 LKWMSE-KESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAAF 640
           LKW+ E K+SLSGT+FAGIVQKL EKPDR VARWR+LR N  AI +  VL+VVVIF AAF
Sbjct: 361 LKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNSTAISVATVLMVVVIFGAAF 420

Query: 641 FSAFSVLQRRYK 651
           F+AFSVLQRRYK
Sbjct: 421 FAAFSVLQRRYK 432

BLAST of Cp4.1LG03g11810 vs. ExPASy TrEMBL
Match: A0A6J1CA62 (UPF0481 protein At3g47200-like OS=Momordica charantia OX=3673 GN=LOC111009362 PE=4 SV=1)

HSP 1 Score: 605 bits (1561), Expect = 6.32e-211
Identity = 312/433 (72.06%), Postives = 370/433 (85.45%), Query Frame = 0

Query: 221 MDSSRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVP 280
           M+ SRALSH+ID+PA S+  S+EESLL S+E K+EAFCSSI IF+ P+EISI++R VFVP
Sbjct: 1   MNPSRALSHAIDIPAISRERSDEESLLCSMEAKMEAFCSSIIIFKVPDEISIDNREVFVP 60

Query: 281 AKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEF 340
           AKVSIGPFHHGAPHLESME+LKW YL AFLK+NPSV L  L+E V KSESR+RKCYE EF
Sbjct: 61  AKVSIGPFHHGAPHLESMEDLKWNYLCAFLKHNPSVGLDDLLEFVAKSESRVRKCYEVEF 120

Query: 341 YGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQIP 400
           +  DS KF+++M+LDCCF+LELLLRFS KRL+RRND VFTTPGLL DL+ DL+LLENQIP
Sbjct: 121 HDLDSQKFARMMVLDCCFVLELLLRFSIKRLKRRNDPVFTTPGLLLDLKSDLILLENQIP 180

Query: 401 YFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL 460
           YFLL++VYE VQD  EENM LNDL FRFF+T+V G+RQ VYDNF  +ADHLL++VHSCFL
Sbjct: 181 YFLLREVYEKVQDSREENMPLNDLAFRFFRTIVAGERQSVYDNFQQDADHLLDIVHSCFL 240

Query: 461 STYPRVET-NDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQK 520
           STYPR+ET N+KSK+ ELP ASKLK+AGIK KNA + KS+LDIKFQNG LEIP L+V + 
Sbjct: 241 STYPRIETKNNKSKTAELPRASKLKSAGIKFKNAVTPKSVLDIKFQNGGLEIPTLEVSKH 300

Query: 521 TEVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIR 580
           TE IL+NL+AYEI Q GS +QVKSY++FMSHLLQSD+D+K+L  RKILI+ E DE QII 
Sbjct: 301 TETILKNLIAYEICQIGSAQQVKSYVDFMSHLLQSDEDMKLLCGRKILINLEKDETQIIA 360

Query: 581 NLKWM-SEKESLSGTYFAGIVQKLNEKPDRCVARWRKLRRNPVAIGIVAVLVVVVIFVAA 640
           NLKWM  +K +LSGTYFAG+VQKLNE PDR +  WR+LRRNPVAIG+VAV  +VVIFVAA
Sbjct: 361 NLKWMRQQKANLSGTYFAGVVQKLNEPPDRFIVWWRRLRRNPVAIGVVAVWALVVIFVAA 420

Query: 641 FFSAFSVLQRRYK 651
           FFSA S+LQRRY+
Sbjct: 421 FFSALSLLQRRYR 433

BLAST of Cp4.1LG03g11810 vs. TAIR 10
Match: AT4G31980.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF247, plant (InterPro:IPR004158), Protein of unknown function DUF862, eukaryotic (InterPro:IPR008580); BEST Arabidopsis thaliana protein match is: Plant protein of unknown function (DUF247) (TAIR:AT5G11290.1); Has 1967 Blast hits to 1844 proteins in 183 species: Archae - 0; Bacteria - 6; Metazoa - 223; Fungi - 83; Plants - 1477; Viruses - 0; Other Eukaryotes - 178 (source: NCBI BLink). )

HSP 1 Score: 168.3 bits (425), Expect = 2.0e-41
Identity = 126/439 (28.70%), Postives = 226/439 (51.48%), Query Frame = 0

Query: 224 SRALSHSIDVPATSQGSSEEESLLSSIEGKLEAFCSSIT----IFRAPNEISIEDRNVFV 283
           S    H +         +E ++L+ SI+ KL AF SS++    I++ PN++   + + + 
Sbjct: 253 SEVQCHRLCFAYERMNQNEGDALVDSIKAKL-AFLSSLSTKCCIYKVPNKLRRLNPDAYT 312

Query: 284 PAKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEE 343
           P  VS GP H G   L++ME+ K+ YL +F+    S  L+ LV L    E   R CY E+
Sbjct: 313 PRLVSFGPLHRGKEELQAMEDQKYRYLLSFIPRTNS-SLEDLVRLARTWEQNARSCYAED 372

Query: 344 FYGFDSDKFSQIMLLDCCFILELLLRFSKKRLRRRNDSVFTTPGLLFDLRCDLMLLENQI 403
                SD+F +++++D  F++ELLLR    RLR  ND +F    ++ D+  D++L+ENQ+
Sbjct: 373 -VKLHSDEFVEMLVVDGSFLVELLLRSHYPRLRGENDRIFGNSMMITDVCRDMILIENQL 432

Query: 404 PYFLLKDVY----ENVQDPTEENMSLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMV 463
           P+F++K+++       Q  T   + L    F +F + +  ++      F  E +H ++++
Sbjct: 433 PFFVVKEIFLLLLNYYQQGTPSIIQLAQRHFSYFLSRIDDEK------FITEPEHFVDLL 492

Query: 464 HSCFLSTYPRVETNDKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLK 523
            SC+L  +P        K    P A++L TAG++ K A +S  LLDI F +GVL+IP + 
Sbjct: 493 RSCYLPQFPIKLEYTTVKVDNAPEATELHTAGVRFKPAETSSCLLDISFADGVLKIPTIV 552

Query: 524 VYQKTEVILRNLVAYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEE 583
           V   TE + +N++ +E  +  S++    YI  +   ++S  D  +L    I+++   +  
Sbjct: 553 VDDLTESLYKNIIGFEQCRC-SNKNFLDYIMLLGCFIKSPTDADLLIHSGIIVNYLGNSV 612

Query: 584 QIIRNLKWMSEKESLSGT--YFAGIVQKLNEKPDRCVARWRKLRR-----NPVAIGIV-- 643
             + NL     KE +     YF+ + + L    +    RW+ + R     NP A+  V  
Sbjct: 613 D-VSNLFNSISKEVIYDRRFYFSMLSENLQAYCNTPWNRWKAILRRDYFHNPWAVASVFA 672

Query: 644 AVLVVVVIFVAAFFSAFSV 646
           A+L++++ F+ +  S  ++
Sbjct: 673 ALLLLLLTFIQSVCSILAL 680

BLAST of Cp4.1LG03g11810 vs. TAIR 10
Match: AT5G22540.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 162.9 bits (411), Expect = 8.5e-40
Identity = 123/403 (30.52%), Postives = 195/403 (48.39%), Query Frame = 0

Query: 263 IFRAPNEISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDL--QY 322
           I R P  ++  +   + P  VSIGP+HHG  HL+  +  K  +L  F+          Q 
Sbjct: 33  IVRIPQSLARINLKAYEPKIVSIGPYHHGKEHLKMTQQHKRRFLKFFVAKMEEKGFVPQE 92

Query: 323 LVELVVKSESRLRKCYEEEFYGFDSDKFSQIMLLDCCFILELLLRFS-KKRLRRRNDSVF 382
           LV+ V   E  +R  Y E+  G DS+   Q+M+LD CFIL L    S K      +D +F
Sbjct: 93  LVKAVSSLEGVIRGSYSEDL-GLDSENLVQMMVLDGCFILTLFFVVSGKVEYTNLDDPIF 152

Query: 383 TTPGLLFDLRCDLMLLENQIPYFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDRQL 442
             P +L  +R DL+LLENQ+PY LL+ ++E  +  T     LN++ F FF   +      
Sbjct: 153 RMPWILPSIRADLLLLENQVPYVLLQTLFETSKLVT--CSGLNEIAFEFFNYSLQKPETF 212

Query: 443 VYDNFTVEADHLLEMVHSCFLSTYPRVETNDKSKSRE---------LPSASKLKTAGIKI 502
              ++ +EA HLL+++   F+    +    D S             + SA KL   GIK 
Sbjct: 213 WEKHYGLEAKHLLDLIRKTFVPVPSQRRIKDHSSKSSFNDHEYLGFVLSAKKLHLRGIKF 272

Query: 503 KNARSSKSLLDIKFQNGVLEIPPLKVYQKTEVILRNLVAYEIHQSGSDRQVKSYINFMSH 562
           K  +++ S+LDI + NGVL IPP+ +   T  I  N VA+E   + S   + SY+ FM+ 
Sbjct: 273 KPRKNTDSILDISYSNGVLHIPPVVMDDFTASIFLNCVAFEQLYADSSNHITSYVAFMAC 332

Query: 563 LLQSDQDVKILYRRKILIDQEDDEEQIIRNLKWMSEKES--LSGTYFAGIVQKLNEKPDR 622
           L+  + D   L  R+IL +    E+++ R  K + +  +  L  +Y A + + +NE   +
Sbjct: 333 LINEESDASFLSERRILENYFGTEDEVSRFYKRIGKDIALDLEKSYLAKVFEGVNEYTSQ 392

Query: 623 -----CVARWRKLRRNP--VAIGIVAVLVVVVIFVAAFFSAFS 645
                C         +P   A    A+L+++   +  FF+A+S
Sbjct: 393 GFHVHCAEFIHTHFDSPWTFASSFAALLLLLFAALQVFFAAYS 432

BLAST of Cp4.1LG03g11810 vs. TAIR 10
Match: AT3G47210.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 156.8 bits (395), Expect = 6.1e-38
Identity = 120/392 (30.61%), Postives = 189/392 (48.21%), Query Frame = 0

Query: 241 SEEESLLSSIEGKLEAFCSSITIFRAPNEISIEDRNVFVPAKVSIGPFHHGAPHLESMEN 300
           SE+  LL    GK      S  IFR P   +  +   + P  VSIGP+HHG  HLE ++ 
Sbjct: 78  SEKRVLLLESAGK-----ESCCIFRVPKSFAEMNPEAYKPKVVSIGPYHHGRKHLEMIQQ 137

Query: 301 LKWCYLSAFLKNNPSVDLQYLVELVVKSESRLRKCYEEEFYGFDSDKFSQIMLLDCCFIL 360
            K  +L  FL+   SVD   L   VV  E  +RK Y E   G    +   +M+LD CFIL
Sbjct: 138 HKLRFLHLFLR-TASVDRDVLFNAVVDWEDEIRKSYSEGLEG-SPHELVYMMILDGCFIL 197

Query: 361 ELLLRFSKK-RLRRRNDSVFTTPGLLFDLRCDLMLLENQIPYFLLKDVYENVQDPTEENM 420
            LLL  S+K  L    D + T P +L  ++ DL+LLENQ+P+F+L+ +++  +     + 
Sbjct: 198 MLLLIVSRKIELYESEDPILTIPWILPSIQSDLLLLENQVPFFVLQTLFDKSEIGVPGD- 257

Query: 421 SLNDLTFRFFKTLVVGDRQLVYDNFTVEADHLLEMVHSCFL--STYPRVE-TNDKSKSRE 480
            LN + F FF   +    +    +    A HLL+++   FL    Y   + T  KS+ + 
Sbjct: 258 -LNRMAFSFFNLSMDKPERYWVKHRNFNAKHLLDLIRMSFLPMDGYEDFQLTKGKSRKKS 317

Query: 481 ------LPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKTEVILRNLVAY 540
                 L SA++L   GI       + S+LDI+ +   L+IP L++      IL N VA+
Sbjct: 318 SSGLTLLLSATRLSLQGIDFSLRSGADSMLDIRLKKNRLQIPVLRLDGFIISILLNCVAF 377

Query: 541 EIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILIDQEDDEEQIIRNLKWMSEKE-- 600
           E   + S   + SY+ FM  LL   +D   L RR+I+ +    E+++ +  K + +    
Sbjct: 378 EQFYAKSTNHITSYVVFMGCLLNGKEDATFLSRRRIIENYFGSEKEVSKFFKTICKDVVF 437

Query: 601 SLSGTYFAGIVQKLNEKPDRCVARWRKLRRNP 621
            +  +Y   +  ++NE   +  +  R    +P
Sbjct: 438 DIHASYLRNVFVEINENTSKWYSICRSFLLSP 460

BLAST of Cp4.1LG03g11810 vs. TAIR 10
Match: AT3G50180.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 156.8 bits (395), Expect = 6.1e-38
Identity = 102/321 (31.78%), Postives = 172/321 (53.58%), Query Frame = 0

Query: 261 ITIFRAPNEISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQY 320
           + I++ P+ +   D+  + P  VS+GP+HHG    +SME  KW  ++  LK   +  ++ 
Sbjct: 178 LCIYKVPHYLHGNDKKSYFPQTVSLGPYHHGRQQTQSMECHKWRAVNMVLKRT-NQGIEV 237

Query: 321 LVELVVKSESRLRKCYEEEFYGFDSDKFSQIMLLDCCFILELLLRFSKKRLR---RRNDS 380
            ++ +++ E + R CYE       S++F++++LLD CFILELL   ++  L+     ND 
Sbjct: 238 FLDAMIELEEKARACYEGSIV-LSSNEFTEMLLLDGCFILELLQGVNEGFLKLGYDHNDP 297

Query: 381 VFTTPGLLFDLRCDLMLLENQIPYFLLKDVYENVQDPTEENMSLNDLTFRFFKTLVVGDR 440
           VF   G +  ++ D+++LENQ+P F+L  + E +Q  T+    L +L  RFF  L+    
Sbjct: 298 VFAVRGSMHSIQRDMIMLENQLPLFVLNRLLE-LQPGTQNQTGLVELVVRFFIPLMPTAE 357

Query: 441 QLVYDN----FTVEADHLLEMVHSCFL-------STYPRVETNDKSKSRELPSASKLKTA 500
            L  ++     +    H L++ H   L       + Y RV   DK   R +P+ ++L+ A
Sbjct: 358 TLTENSPPRGVSNGELHCLDVFHRSLLFPRSSGKANYSRVA--DKHLQRVIPTVTELRDA 417

Query: 501 GIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKTEVILRNLVAYEIHQSGSDRQVKSYIN 560
           G K K  ++ +   DIKF NG LEIP L ++  T+ +  NL+A+E     S   + SYI 
Sbjct: 418 GFKFKLNKTDR-FWDIKFSNGYLEIPGLLIHDGTKSLFLNLIAFEQCHIESSNDITSYII 477

Query: 561 FMSHLLQSDQDVKILYRRKIL 568
           FM +L+ S +D+  L+   I+
Sbjct: 478 FMDNLIDSPEDISYLHHCGII 492

BLAST of Cp4.1LG03g11810 vs. TAIR 10
Match: AT3G50140.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 150.6 bits (379), Expect = 4.4e-36
Identity = 124/427 (29.04%), Postives = 200/427 (46.84%), Query Frame = 0

Query: 261 ITIFRAPNEISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWCYLSAFLKNNPSVDLQY 320
           I I+R P  +   D+N + P  VS+GP+HHG  HL  M+  KW  ++  +K      ++ 
Sbjct: 113 ICIYRVPLSLKKSDKNSYFPQAVSLGPYHHGDEHLRPMDYHKWRAVNMVMKRTKQ-GIEM 172

Query: 321 LVELVVKSESRLRKCYEEEFYGFDSDKFSQIMLLDCCFILELLL----RFSKKRLRRRND 380
            ++ + + E R R CYE    G  S+KF+Q+++LD CF+L+L       FSK     RND
Sbjct: 173 YIDAMKELEERARACYEGPI-GLSSNKFTQMLVLDGCFVLDLFRGAYEGFSKLGY-DRND 232

Query: 381 SVFTTPGLLFDLRCDLMLLENQIPYFLLKDVYENVQDPTEENMSLNDLTFRFF------- 440
            VF   G +  +R D+++LENQ+P F+L  + E       +   +  L  RFF       
Sbjct: 233 PVFAMRGSMHSIRRDMLMLENQLPLFVLNRLLELQLGTQYQTGLVAQLAVRFFNPLMPTY 292

Query: 441 --KTLVVGDRQLVYDNFTVEADHLLEMVHSC----------FLSTYPRVETN-------- 500
              T +   ++     F   AD   E +H             L   PR+  +        
Sbjct: 293 MSSTKIENSQENNNKFFNPIADKEKEELHCLDVFRRSLLQPSLKPDPRLSRSRWSRKPLV 352

Query: 501 -DKSKSRELPSASKLKTAGIKIKNARSSKSLLDIKFQNGVLEIPPLKVYQKTEVILRNLV 560
            DK + + L   ++L+ AGIK K  R S    DI+F+NG LEIP L ++  T+ +  NL+
Sbjct: 353 ADKRQQQLLHCVTELREAGIKFKR-RKSDRFWDIQFKNGCLEIPKLLIHDGTKSLFSNLI 412

Query: 561 AYEIHQSGSDRQVKSYINFMSHLLQSDQDVKILYRRKILID--QEDDEEQIIRNLKWMSE 620
           AYE     S   + SYI FM +L+ S +D++ L+   I+      D E   + N      
Sbjct: 413 AYEQCHIDSTNDITSYIIFMDNLIDSAEDIRYLHYYDIIEHWLGNDSEVADVFNRLCQEV 472

Query: 621 KESLSGTYFAGIVQKLNEKPDRCVARWRKLR--------RNPVAI--GIVAVLVVVVIFV 644
              L  TY + +  K++   +R   +W  L+         NP A      AV+++++   
Sbjct: 473 AFDLENTYLSELSNKVDRYYNR---KWNVLKATLKHKYFSNPWAYFSFFAAVILLLLTLF 532

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SD537.5e-3331.74UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1[more]
P0C8979.2e-1522.43Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 ... [more]
Match NameE-valueIdentityDescription
XP_023526431.13.36e-301100.00UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo] >XP_023526432.1 UPF0... [more]
XP_022955709.15.01e-29698.14UPF0481 protein At3g47200-like [Cucurbita moschata] >XP_022955710.1 UPF0481 prot... [more]
KAG7018572.14.10e-29597.91UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022979569.18.01e-25994.87UPF0481 protein At3g47200-like [Cucurbita maxima][more]
XP_038880915.16.86e-25485.19UPF0481 protein At3g47200-like [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1GVU12.43e-29698.14UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111457630 PE=... [more]
A0A6J1IWZ43.88e-25994.87UPF0481 protein At3g47200-like OS=Cucurbita maxima OX=3661 GN=LOC111479248 PE=4 ... [more]
A0A1S3AY981.89e-22977.55UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103483890 PE=4 SV=1[more]
A0A0A0L8218.55e-21674.54Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G166290 PE=4 SV=1[more]
A0A6J1CA626.32e-21172.06UPF0481 protein At3g47200-like OS=Momordica charantia OX=3673 GN=LOC111009362 PE... [more]
Match NameE-valueIdentityDescription
AT4G31980.12.0e-4128.70unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF247,... [more]
AT5G22540.18.5e-4030.52Plant protein of unknown function (DUF247) [more]
AT3G47210.16.1e-3830.61Plant protein of unknown function (DUF247) [more]
AT3G50180.16.1e-3831.78Plant protein of unknown function (DUF247) [more]
AT3G50140.14.4e-3629.04Plant protein of unknown function (DUF247) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 263..633
e-value: 2.9E-86
score: 290.3
IPR004158Protein of unknown function DUF247, plantPANTHERPTHR31549PROTEIN, PUTATIVE (DUF247)-RELATED-RELATEDcoord: 240..644
IPR035203Cell division control protein 24, OB domain 3PFAMPF17244CDC24_OB3coord: 53..126
e-value: 7.4E-7
score: 29.2
NoneNo IPR availablePANTHERPTHR31549:SF38SUBFAMILY NOT NAMEDcoord: 240..644

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g11810.1Cp4.1LG03g11810.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane