Cp4.1LG02g15750 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g15750
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptiontRNA-dihydrouridine synthase
LocationCp4.1LG02: 13301697 .. 13309036 (+)
RNA-Seq ExpressionCp4.1LG02g15750
SyntenyCp4.1LG02g15750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTGGCCTTAAACGCAACACCGAGTCAATCTTCGTTGCTCAATCTTCTCCGCCGCTCAATCTTCCTCCGCATCAACTCCGCCGCACCGACCAGCGCGGAAACGGAGGTTGTCAGGTTCCCGCCGCGCCGGTAACCATATTTCCATGTAAACCTTAACATTCGTTTTGTATAAATCTGCAAATTAAGCATTTAACTTGTTTGCGTTTCTGCAATTGTGTTCGCGAACTGCCAATCACTCTTATCATCGTAGCATTTCTTTGTTTAAATCATTGGTTCTGTAGAGGATGTTTGTTAAAAGGAGTTCTTAGCGGTTGTTCCGCCACTTTTAAAGCCTCTCAAGTTGCTCATGATCAGAAATTTTTGAGGACCGGACTGTGTATGAATTGAGGGATGATGAAGTTTGCTGGATGTTCTATTACAGCTTCTTTGACTCCTATCCAGCCTTTCATTCAGAGAAGCCATCCTAGGCTGTCAATCGGTGTTTTACAGGAGAATTTATATAAACCACAGAAACTTGGAGAAGGTTCAGAAGCAGAAATGGTTGCTAGACGTTATTCCCCTCCTTTGTTTAGGTGATTGTTTTCCCACTATAATTAATGCTCCGTGTTCCTTAATGTTTATATCTTTTGATTGAAATTGGCTTTTATTCACTTTCTCAGTGTTGCTCCTATGATGGATTGGACTGATAATCACTACAGAACTCTTGCTCGTATGATATCAAAGCATGCGTGGCTGTACACCGAGATGCTTGCTGCTGAAACAATTGTTTATCAGAAGGATAATCTTGTAAGTTTGCGGGAGAATCCTTTGGGGAATAGCTAATTTTTCAATATTTCCAACACATTTAATAAGTAGGGTTGCTTTTGTTTATCAAAGATGTCATTTCCTTCATTTTCCACATTTTACTCGAGGTTTTCAAAATTGTGTGTAACATGTCGGACTAGTAATCAATTATTTGAGGTAGCCCTCCCTCACCCTCAGGCGTACTTGTTGAGGATGATTGGGAGGGAGCGCCACATTGGCTGATTAAGGGAATGATAATGGGTTTATAATTAAGGAATACTTCTCCATTGGTATGAGGCCTTTTGGGGAAACCAAAAGCAACTCATGAGAGCTTATACTCAAAGTGGACAATATCATACAATTGTGGAGAGTCGTGGTTTCTAACATGGTATCAAAGTCATGCCCTTAACTTAGCTCGGTCAATAGAACACTTCAAGTGTCGAACAAAGAAGTTGTGATCCTCAAAAGTGTAGTCAAAAGTGACTAAAGTGTAGTCAAAAGTGACTCAAGTGTCGAACAAAGACTGTACTTTGTTCGAGGACTCCAGAAGAGTCGAGCTTCGAATAAGAGGAGGTTGTTCGAGGGCTCTTTAGACCTCAAGGGAGGATCAATGGTGTACTTTGTTCAAAAGTGACTGAAGTGTCGAACAAATGATGTACTTTGTTCGAGGACTCCAGAGGAGTCGAGCCTCGAATAAGAGGAGGTTGTTCGAGGGCTTTTAGACCTCAAGGGAGGCTCTATGGTGTACTTTGTTTGAAGGGAGGATTGTTGAGCATTGTTGGAAGGGAGTCCCACATCGACTGATTAAGGGAATGATCATGAATTTATAAGTAAAGAACATCTCCATTGGTATGAGGTCTTTTGGTGAAACCAAAAGCAACGCCATGAGAGCTTATACTCAAAGTGGACAATATCATACCATTGTGGAGAATCGTGGTTTCTAACAGTATTGCCTCCTTGTTAATACGTTGGGTTAGTAAATAATTATGTATGCCCCCTTCCTCCACTCTTATGATCTTATGAGAAGATTTGCCTCCAGTTACCCCAACAAAAGATCCTGGCCTTGATTCCTACAATATCCCACAACTTTCATGGTTTATAGTCCGAGTTGGACCATGGATTTCTCACATTTTACCAACATTTGGTCACCAAGAGTGACGGCATGGTCGTGTTTTCAATTTGTAGGATGCTCTAATTATTCAATTGTTGACTGCTATTTTTTACCTGTCATTATGAGGATGTACTTAATCTCCAAATTGGCCTGACCTCATTTCCTGTTTGTTTCTTTACCAGTCCAGGTTGGACTAGAGTTTTCTAATTTTTGATTGGTGAGAGTGAAGAATTGATGGGTGTTTTATTTTTCTCAAAGAAGAGTTACCATCTTATTGCTACATTTTGCTACATGCATGTTTGAACTAGTAGGTGTTTGAGTCATATGAAGAATTTCTTCTCTTTTCATGAATCAAACTTGATCTGATTACGATATGCTTCAAATGTCTATGTTTTTCTTGTGAAATTCGGTATTCGAAGTTCCTTCTTTTCATGTAATCTTCATTTATTTTGCACAGGACAGATTTTTAGCATTTTCTCCAGAGCAACATCCCATTGTCCTGCAAATTGGTGGAAACAATCTTAATAACATTGCAAAAGCTGTTGAACTTGCCAAGCCTTACGGCTACGATGAAATTAATTTAAAGTCTGTATTATATTTCTTTAGATTCTACAATAAATTCTTAAATTCTAATATCTTTTTTTAAATGTTTTTTAGCTGTGGATGTCCAAGTTCAAAGGTGGCTGGACATGGGTGCTTTGGTGTTCGCCTTATGCTGGATCCTAAGGTTTCTGGCTGTATGCTGGCTGTATGCTGGCTGTATTTTGATTTTGATCATCTTGTATATTTTATATTTTTCAGCCATGTGTCTGACGTAGGCTTTTCATGTTGACAAGGAAAGTTTGTTGGTGAAGCCATGTCAGTAATTGCTGCTAACACTGATGTCCCTGTAAGCGTGAAATGTCGAATTGGTGTCGATGACCATGATTCATATAATGAGCTTTGTAAGGTTTCTACTCTTTTTTCTCTTGTTAGAAAGCACCGACTACTGTTTCTTTGCTTAGATAAACATCGTTATGTCTTGAGGTTAGTATTGTATTATCGACAACTCAATCATAGGAACTCTATGATTTCTAAAATTAAGTTCGTTTAACTAGAGCGATTCCATGTTGGGTGACTTTCCGAACAATTTCATGTGAATGAAGACAAAACATGCAAACCTTAGTCCAGATGTTGGGCTTGGGTTGATACAAAATATTCTTTCCGAAGAGTTATTCTAGTTTGATGCAACATCATGTGGCTTCTTAGTTGTTGATTTCTCCTCTCATGTCAAGCACTTTTTGTTTCAGGTGATTTTGTTTACAAAGTTTCTTCTTTGTCGCCAACTAGGCATTTCATAATCCATTCCCGCAAGGCGTTACTCAACGGTATCAGCCCAGCTGAAAATCGAGAAATCCCTCCTTTGAAGTACGGCTCCTTAAGAACTTTATATAGGCCTAGAGAGCTTGTCATTTGTGGTTCAAGTATTTTTACTAAGAACAGGCCTTTGTAGTTCATAATATTGATGTGGTCACAGGTATGAATATTTCTATGCACTGATGCGCGACTTTCCAGACTTGAGATTCACAATAAATGGGGGCATTAAGACTGTTGATGAGGTGAACTTTCTGCTCAGCTCACTGGACCTTAAAGTATCAGAAATTGTGATTTGTTTTAAATGATAAATTTTTTACATCATAATTTAATGTTCATTTATTTTGAATGAGCTGATACAGGCTAATGCTGCTCTGAGACTAGGAGCTCATGGTGTAATGGTGGGCCGATCTGCTTACCAGAAGTATGTTTATGCACGTTTTGAAAGTCAATATTCTATCAGAGTTGGTTATGATATACCTTAATTTGACTGATGGCTATTGTTCATCCTCCGTAGCCCATGGCGTACTTTAGGACATGTCGACACTGCAATTTACGGTGCACCTAGCAGTGGTATCACACGACGTCAGGTGCAATTGTAAACTTGGCTGTGAGATTAGATTATGCTTCTGCTTGAAAGTTGACTATTATCTCATTTAGTTGTAAACTTTTCTATCTTCTAGGTCCTTGAACAGTATCAAGTATATGGGGATTCTGTTTTGGGTACGTATGGAAATAGACCAAGCATACGAGAGGTTGTGAAGGTAAGATCTTGTTGGTGGTTTGCAGCCTTAAAACATAACTTTTGTCTATTCAGTTATTTGGTCTAATCTTTTTTTTGTTATCTTATTGCTTTATACTGATCCAGAACCTTTTCAATTGTTGAGTTTAGGTGATATTTTTTTTTATTTTTTTTCGGGCTTCCCCTCAAAGTTTTTTGAACGCATACGCTTGGAAGAGATTTCTACCCCGAACAGACATGGCTCTCTTTTGGGTTTTCCTTTCGAACTTCTCCTCAAGGTTTTTAAATTGAGTATGCTAGGGAAATGTTTTCATGCCCATATAAAGAGTGTTTCGTTCGCCTCCCGATGTGGAATCTCACAATCCACTCCTTCGGGGTCAGCATCATCGCGTCCTTGTTGGCACTTGTTCCCTTCTCCAATCGATGTGGGACCCCCAATTCACCCCCTTCTAGACCCAGCGCCCTTGCTGGCACACCGCCTCATCTCCATCCCCTTTGGGGACCTCATCTCCATCCCCTTTGGGGCTTAATCTCCTCATTGGTACATCGCCTTTTGGGGCTTAATCTCCTCATTGGTACATCGCCTAGTGTCTCGCTCTGATACCATTTGTAACAATTCAAACCCACCGCGAGTAGATATTGTTTCTTTGGGCTTTCCATTTTTAAAACACCTCTACTCTCTATTTTTAAAACACCTCTACTCTAGAGAGAGTTTCCACAGAGTTTCCACACCCTCCTTTTCCTCAAGTTGGATGTGGTGGGATTTCAGATTTCATATTTACTTCCCCACCAACTGAACTGAGATGGGATTTCATAAACTTCTTATGAAATCGGTGTCTCTCTACTGACTGTATTCTAGCAAAATGCATCCCCACCCCACACAAACACCTGGTATGGTAACTTGTCTCTTTTATCTGCTGCAGCCGTTGCTTCATCTTTTTTATACAGATCCTGGGAATGGTCCATGGAAGCGTAAAGCCGATGCCGCTTTTCAGCACTGCAAGGTATGTAAAATATGCTACAGAAGAACAACTTCGATTTTAGTTTTACAAAACGGAGTATTAAATTTTCAACTTTTCTCGCGCATTGTTATCATAGTTTTCTATCCTTCAAAAGTCACTTCCTTTTGAACCTTGACTGGATACCTTATAAGTGTACTGCTACTGGACATTGTAGACCATCAAGTCATTTTTCGATGAAACACTTGTAGCAATTCCAGATTATATATTGGATGCTCCTGTAGCAGAACCTCCATCTGGACGTGAAGATCTTTTTGCCAATACACTTAGTTTGATGCCTCCTCCATATGAAGATAGAGAGCAAAAGGTATTATTGGAGGCTTAATTTTGTATCATCGTTTAAGCCATGTATAACAGCCTAACCTTAGAATACAATTTTCACTTATTCTTTGCAAATTCATTGATGAGGTAATCAAAGAAGTTGCCCTTGTAAACTGGTCATTCATCTTACTTATAAGCTTTTCATGTCTCACTACTCGTTGTTGCCATTTTGTTTTCCAACCATCAAGAGTCTCAAGGTGTTTGATTCCTGAGGATATCAAGAGTCTCAAGGTAAGAATCTGCTGGTCCAAAACTAGGGATAAGATAATGGTCCAAAGTTCAAACCCTAGAACGACAAATCGTTCGACAGAGGACACCTATAATATCGCCGATTCAGATCGCCTAAAACCTACTAACAAAAATCGCCATCGACGTCCTAAATTTCGAACCAAAATCACCTCCGATTCCAAATCTCAAAGAACGCATTAAGAAAAATTCCATTTGAGAAATGATTTCGAATGATCGGCGAGAGTCCCTCAGTACGCCTAGCATCTACCTGGAAGTTCATGGAGAAACTGGTTCCTCGACCTCTGCGACAAGCATCGGGAGCGCCAGTGATGGAAGAAACCTCCATACTTGTTCAGATTACTAAACTTTTCGTCATCGGGACTCAATTAGCGATGCCTCAGCTTGACCGGAGGCGGCGGGGGAGGGAAGGAAGGAAGGAAGGAAGAGGTATCTGGAACTAGAAGACTACTCCCCCTCCCCTGCTATTTTAGGGCTCCGAGATATTTTATTGGGCTTGTTTCGGCCCATCAATAAACGATTGGACACGAAGTTGTGAATACGCTAAACACGACCGCAGACAATCCGTCCCTTATTTTTTTCATATTTATTAAATATACAGTACAAGGAGAGAAGAGGATAATGTTGATGAATAGCATGACAATTAATTATTAAATTTCTGAGACCCACGTGACCACCCACGTGCCCTCCGTTCATTGATTGCATGTTACCACAGAGACGCGTGTCCACACTCAATCCTTTTCTCCCTCAAGGAAATCCGACACGTGGCACCATTGAGAACCGACATCCCCTGTAACTTTCATCTAGACCCTTTATTTATGTGTTTTTGTTTGGTATTTAGGCAAAATTATAAAGGCCAAGAATCGATTGAAAAGGGAATTTTGTGTTTGTTTCCCGGGAAAACGAAAAATGGGTGTGGCCGAAGAACCGATTCTCTCGCGCTTGGAGCGCCTCGATAATATGGTGAGTTGGGCTTTGCTTTCTTGTTTGTTTCTCGGGAAAATGACTAAGAAAATGGCCAAGAAAATGGCCAAGAAAATGGCCGTCCCCTTTTTAATTTTACGTAATTATAATTTATTTTATTTTTTGTGTGGGTTAAATAGTTGAGGCGGTTGGAGGAAATTAGAGGGTGCGGGAAGTCGCCTAAGAGCTCGTGTGCATCCACTCCATCAAGTGGGACCCTCACGAGCGATTACCAAACGTCGTCCGTTGATCTTTCCCCCAAAACTCTGGAGAAGCACTGCCGTCCGATCAACCATGTGGTCAAAGTCACTGAACTCAGAGGAAGCCTCGTGGAGCGGATGGACAATATCGAGGATAGAGTCCTCAAGGTACCTTTTTTTTTTCTCTATATATAAATTAAGATTATTTTTTATTTTTTTTATTGATTTTTTTTTTTTTTCATTGAAATTTTCAGCTTTGTTTGCAAATAGAGGGAGAGATAGAGAAGGAAAAAGAGATGATAATGGTCGGGAAGGATAAGAAGCCTAAGAAAAGCTTTAAAGAACGGATTCAATTGTGCATGACTGGACAGGGAATACGCCGTCGCCCATCTTGATTGTAGTATCGTAACCTTTGTTTGATAAAGATTTTGTTTCGAGCTCGAGATAGCTCATGAGCTAGTTCTAGTTTTGTTGAATATGTACATAAAATTTTGATGGTTTTTGAACATGTAAGCTTGTTTGTGACAAAATTTTCGTTTCTATGTGTTTCTTCCCATGGGACGTTGTAACGCCCTAAATCCACTGCTAGCAAATATTGTTCT

mRNA sequence

GGTGGCCTTAAACGCAACACCGAGTCAATCTTCGTTGCTCAATCTTCTCCGCCGCTCAATCTTCCTCCGCATCAACTCCGCCGCACCGACCAGCGCGGAAACGGAGGTTGTCAGGTTCCCGCCGCGCCGGTAACCATATTTCCATGTAAACCTTAACATTCGTTTTGTATAAATCTGCAAATTAAGCATTTAACTTGTTTGCGTTTCTGCAATTGTGTTCGCGAACTGCCAATCACTCTTATCATCGTAGCATTTCTTTGTTTAAATCATTGGTTCTGTAGAGGATGTTTGTTAAAAGGAGTTCTTAGCGGTTGTTCCGCCACTTTTAAAGCCTCTCAAGTTGCTCATGATCAGAAATTTTTGAGGACCGGACTGTGTATGAATTGAGGGATGATGAAGTTTGCTGGATGTTCTATTACAGCTTCTTTGACTCCTATCCAGCCTTTCATTCAGAGAAGCCATCCTAGGCTGTCAATCGGTGTTTTACAGGAGAATTTATATAAACCACAGAAACTTGGAGAAGGTTCAGAAGCAGAAATGGTTGCTAGACGTTATTCCCCTCCTTTGTTTAGTGTTGCTCCTATGATGGATTGGACTGATAATCACTACAGAACTCTTGCTCGTATGATATCAAAGCATGCGTGGCTGTACACCGAGATGCTTGCTGCTGAAACAATTGTTTATCAGAAGGATAATCTTGACAGATTTTTAGCATTTTCTCCAGAGCAACATCCCATTGTCCTGCAAATTGGTGGAAACAATCTTAATAACATTGCAAAAGCTGTTGAACTTGCCAAGCCTTACGGCTACGATGAAATTAATTTAAACTGTGGATGTCCAAGTTCAAAGGTGGCTGGACATGGGTGCTTTGGTGTTCGCCTTATGCTGGATCCTAAGGTTTCTGGCTCCATGTCAGTAATTGCTGCTAACACTGATGTCCCTGTAAGCGTGAAATGTCGAATTGGTGTCGATGACCATGATTCATATAATGAGCTTTGTGATTTTGTTTACAAAGTTTCTTCTTTGTCGCCAACTAGGCATTTCATAATCCATTCCCGCAAGGCGTTACTCAACGGTATCAGCCCAGCTGAAAATCGAGAAATCCCTCCTTTGAAGTATGAATATTTCTATGCACTGATGCGCGACTTTCCAGACTTGAGATTCACAATAAATGGGGGCATTAAGACTGTTGATGAGGCTAATGCTGCTCTGAGACTAGGAGCTCATGGTGTAATGGTGGGCCGATCTGCTTACCAGAACCCATGGCGTACTTTAGGACATGTCGACACTGCAATTTACGGTGCACCTAGCAGTGGTATCACACGACGTCAGGTCCTTGAACAGTATCAAGTATATGGGGATTCTGTTTTGGGTACGTATGGAAATAGACCAAGCATACGAGAGGTTGTGAAGCCGTTGCTTCATCTTTTTTATACAGATCCTGGGAATGGTCCATGGAAGCGTAAAGCCGATGCCGCTTTTCAGCACTGCAAGACCATCAAGTCATTTTTCGATGAAACACTTGTAGCAATTCCAGATTATATATTGGATGCTCCTGTAGCAGAACCTCCATCTGGACGTGAAGATCTTTTTGCCAATACACTTAGTTTGATGCCTCCTCCATATGAAGATAGAGAGCAAAAGGTAATCAAAGAAGTTGCCCTTGTAAACTGGTCATTCATCTTACTTATAAGCTTTTCATGTCTCACTACTCGTTGTTGCCATTTTGTTTTCCAACCATCAAGAGTCTCAAGGTGTTTGATTCCTGAGGATATCAAGAGTCTCAAGTTCATGGAGAAACTGGTTCCTCGACCTCTGCGACAAGCATCGGGAGCGCCAGTGATGGAAGAAACCTCCATACTTGTTCAGATTACTAAACTTTTCGTCATCGGGACTCAATTAGCGATGCCTCAGCTTGACCGGAGGCGGCGGGGGAGGGAAGGAAGGAAGGAAGGAAGAGGCAAAATTATAAAGGCCAAGAATCGATTGAAAAGGGAATTTTGTGTTTGTTTCCCGGGAAAACGAAAAATGGGTGTGGCCGAAGAACCGATTCTCTCGCGCTTGGAGCGCCTCGATAATATGTTGAGGCGGTTGGAGGAAATTAGAGGGTGCGGGAAGTCGCCTAAGAGCTCGTGTGCATCCACTCCATCAAGTGGGACCCTCACGAGCGATTACCAAACGTCGTCCGTTGATCTTTCCCCCAAAACTCTGGAGAAGCACTGCCGTCCGATCAACCATGTGGTCAAAGTCACTGAACTCAGAGGAAGCCTCGTGGAGCGGATGGACAATATCGAGGATAGAGTCCTCAAGCTTTGTTTGCAAATAGAGGGAGAGATAGAGAAGGAAAAAGAGATGATAATGGTCGGGAAGGATAAGAAGCCTAAGAAAAGCTTTAAAGAACGGATTCAATTGTGCATGACTGGACAGGGAATACGCCGTCGCCCATCTTGATTGTAGTATCGTAACCTTTGTTTGATAAAGATTTTGTTTCGAGCTCGAGATAGCTCATGAGCTAGTTCTAGTTTTGTTGAATATGTACATAAAATTTTGATGGTTTTTGAACATGTAAGCTTGTTTGTGACAAAATTTTCGTTTCTATGTGTTTCTTCCCATGGGACGTTGTAACGCCCTAAATCCACTGCTAGCAAATATTGTTCT

Coding sequence (CDS)

ATGATGAAGTTTGCTGGATGTTCTATTACAGCTTCTTTGACTCCTATCCAGCCTTTCATTCAGAGAAGCCATCCTAGGCTGTCAATCGGTGTTTTACAGGAGAATTTATATAAACCACAGAAACTTGGAGAAGGTTCAGAAGCAGAAATGGTTGCTAGACGTTATTCCCCTCCTTTGTTTAGTGTTGCTCCTATGATGGATTGGACTGATAATCACTACAGAACTCTTGCTCGTATGATATCAAAGCATGCGTGGCTGTACACCGAGATGCTTGCTGCTGAAACAATTGTTTATCAGAAGGATAATCTTGACAGATTTTTAGCATTTTCTCCAGAGCAACATCCCATTGTCCTGCAAATTGGTGGAAACAATCTTAATAACATTGCAAAAGCTGTTGAACTTGCCAAGCCTTACGGCTACGATGAAATTAATTTAAACTGTGGATGTCCAAGTTCAAAGGTGGCTGGACATGGGTGCTTTGGTGTTCGCCTTATGCTGGATCCTAAGGTTTCTGGCTCCATGTCAGTAATTGCTGCTAACACTGATGTCCCTGTAAGCGTGAAATGTCGAATTGGTGTCGATGACCATGATTCATATAATGAGCTTTGTGATTTTGTTTACAAAGTTTCTTCTTTGTCGCCAACTAGGCATTTCATAATCCATTCCCGCAAGGCGTTACTCAACGGTATCAGCCCAGCTGAAAATCGAGAAATCCCTCCTTTGAAGTATGAATATTTCTATGCACTGATGCGCGACTTTCCAGACTTGAGATTCACAATAAATGGGGGCATTAAGACTGTTGATGAGGCTAATGCTGCTCTGAGACTAGGAGCTCATGGTGTAATGGTGGGCCGATCTGCTTACCAGAACCCATGGCGTACTTTAGGACATGTCGACACTGCAATTTACGGTGCACCTAGCAGTGGTATCACACGACGTCAGGTCCTTGAACAGTATCAAGTATATGGGGATTCTGTTTTGGGTACGTATGGAAATAGACCAAGCATACGAGAGGTTGTGAAGCCGTTGCTTCATCTTTTTTATACAGATCCTGGGAATGGTCCATGGAAGCGTAAAGCCGATGCCGCTTTTCAGCACTGCAAGACCATCAAGTCATTTTTCGATGAAACACTTGTAGCAATTCCAGATTATATATTGGATGCTCCTGTAGCAGAACCTCCATCTGGACGTGAAGATCTTTTTGCCAATACACTTAGTTTGATGCCTCCTCCATATGAAGATAGAGAGCAAAAGGTAATCAAAGAAGTTGCCCTTGTAAACTGGTCATTCATCTTACTTATAAGCTTTTCATGTCTCACTACTCGTTGTTGCCATTTTGTTTTCCAACCATCAAGAGTCTCAAGGTGTTTGATTCCTGAGGATATCAAGAGTCTCAAGTTCATGGAGAAACTGGTTCCTCGACCTCTGCGACAAGCATCGGGAGCGCCAGTGATGGAAGAAACCTCCATACTTGTTCAGATTACTAAACTTTTCGTCATCGGGACTCAATTAGCGATGCCTCAGCTTGACCGGAGGCGGCGGGGGAGGGAAGGAAGGAAGGAAGGAAGAGGCAAAATTATAAAGGCCAAGAATCGATTGAAAAGGGAATTTTGTGTTTGTTTCCCGGGAAAACGAAAAATGGGTGTGGCCGAAGAACCGATTCTCTCGCGCTTGGAGCGCCTCGATAATATGTTGAGGCGGTTGGAGGAAATTAGAGGGTGCGGGAAGTCGCCTAAGAGCTCGTGTGCATCCACTCCATCAAGTGGGACCCTCACGAGCGATTACCAAACGTCGTCCGTTGATCTTTCCCCCAAAACTCTGGAGAAGCACTGCCGTCCGATCAACCATGTGGTCAAAGTCACTGAACTCAGAGGAAGCCTCGTGGAGCGGATGGACAATATCGAGGATAGAGTCCTCAAGCTTTGTTTGCAAATAGAGGGAGAGATAGAGAAGGAAAAAGAGATGATAATGGTCGGGAAGGATAAGAAGCCTAAGAAAAGCTTTAAAGAACGGATTCAATTGTGCATGACTGGACAGGGAATACGCCGTCGCCCATCTTGA

Protein sequence

MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLFSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSGSMSVIAANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIPPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKVIKEVALVNWSFILLISFSCLTTRCCHFVFQPSRVSRCLIPEDIKSLKFMEKLVPRPLRQASGAPVMEETSILVQITKLFVIGTQLAMPQLDRRRRGREGRKEGRGKIIKAKNRLKREFCVCFPGKRKMGVAEEPILSRLERLDNMLRRLEEIRGCGKSPKSSCASTPSSGTLTSDYQTSSVDLSPKTLEKHCRPINHVVKVTELRGSLVERMDNIEDRVLKLCLQIEGEIEKEKEMIMVGKDKKPKKSFKERIQLCMTGQGIRRRPS
Homology
BLAST of Cp4.1LG02g15750 vs. ExPASy Swiss-Prot
Match: Q7UBC5 (tRNA-dihydrouridine(20/20a) synthase OS=Shigella flexneri OX=623 GN=dusA PE=3 SV=3)

HSP 1 Score: 249.6 bits (636), Expect = 1.0e-64
Identity = 137/302 (45.36%), Postives = 187/302 (61.92%), Query Frame = 0

Query: 60  FSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQ 119
           FSVAPM+DWTD H R   R++S++  LYTEM+    I++ K +   +LA+S E+HP+ LQ
Sbjct: 22  FSVAPMLDWTDRHCRYFLRLLSRNTLLYTEMVTTGAIIHGKGD---YLAYSEEEHPVALQ 81

Query: 120 IGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK-VSGSMSVIA 179
           +GG++   +A+  +LA+  GYDEINLN GCPS +V  +G FG  LM + + V+  +  + 
Sbjct: 82  LGGSDPAALAQCAKLAEARGYDEINLNVGCPSDRVQ-NGMFGACLMGNAQLVADCVKAMR 141

Query: 180 ANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREI 239
               +PV+VK RIG+DD DSY  LCDF+  VS       FIIH+RKA L+G+SP ENREI
Sbjct: 142 DVVSIPVTVKTRIGIDDQDSYEFLCDFINTVSGKGECEMFIIHARKAWLSGLSPKENREI 201

Query: 240 PPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHV 299
           PPL Y   Y L RDFP L  +INGGIK+++EA A L+    GVMVGR AYQNP   L  V
Sbjct: 202 PPLDYPRVYQLKRDFPHLTMSINGGIKSLEEAKAHLQ-HMDGVMVGREAYQNP-GILAAV 261

Query: 300 DTAIYGAPSSGITRRQVLEQYQVYGDSVL--GTYGNRPSIREVVKPLLHLFYTDPGNGPW 359
           D  I+G+  +      V+     Y +  L  GTY     +  +++ +L LF   PG   W
Sbjct: 262 DREIFGSSDTDADPVAVVRAMYPYIEHELSQGTY-----LGHIIRHMLGLFQGIPGARQW 312

BLAST of Cp4.1LG02g15750 vs. ExPASy Swiss-Prot
Match: Q8CWK7 (tRNA-dihydrouridine(20/20a) synthase OS=Vibrio vulnificus (strain CMCP6) OX=216895 GN=dusA PE=3 SV=2)

HSP 1 Score: 249.6 bits (636), Expect = 1.0e-64
Identity = 132/299 (44.15%), Postives = 185/299 (61.87%), Query Frame = 0

Query: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120
           SVAPM+DWTD H R   R+++K   LYTEM+    I++ K +   FLA++ E+HP+ LQ+
Sbjct: 8   SVAPMLDWTDRHCRYFHRLMTKETLLYTEMITTGAIIHGKGD---FLAYNQEEHPVALQL 67

Query: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK-VSGSMSVIAA 180
           GG+N  ++A   +LA   GYDE+NLN GCPS +V  +G FG  LM +P+ V+  ++ +  
Sbjct: 68  GGSNPQDLATCAKLAAERGYDEVNLNVGCPSDRVQ-NGRFGACLMAEPQLVADCVAAMKE 127

Query: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240
             D+PV+VK RIG+DD DSY  L DFV  VS       F IH+RKA L+G+SP ENREIP
Sbjct: 128 VVDIPVTVKTRIGIDDQDSYEFLTDFVSIVSEKGGCEQFTIHARKAWLSGLSPKENREIP 187

Query: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300
           PL Y   Y L +DF  L   INGG+K+++EA   L+    GVM+GR AYQ+P+  L  VD
Sbjct: 188 PLDYPRAYQLKKDFSHLTIAINGGVKSLEEAKEHLQ-HLDGVMIGREAYQSPY-LLASVD 247

Query: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKR 359
             ++G+ S    RRQ++E+   Y +  L    N   +  + + +L LF   PG   W+R
Sbjct: 248 QELFGSQSPIKKRRQIVEEMYPYIEQQL---ANGAYLGHMTRHMLGLFQNMPGARQWRR 297

BLAST of Cp4.1LG02g15750 vs. ExPASy Swiss-Prot
Match: Q8X5V6 (tRNA-dihydrouridine(20/20a) synthase OS=Escherichia coli O157:H7 OX=83334 GN=dusA PE=3 SV=3)

HSP 1 Score: 248.4 bits (633), Expect = 2.3e-64
Identity = 137/302 (45.36%), Postives = 186/302 (61.59%), Query Frame = 0

Query: 60  FSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQ 119
           FSVAPM+DWTD H R   R++S++  LYTEM+    I++ K +   +LA+S E+HP+ LQ
Sbjct: 28  FSVAPMLDWTDRHCRYFLRLLSRNTLLYTEMVTTGAIIHGKGD---YLAYSEEEHPVALQ 87

Query: 120 IGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK-VSGSMSVIA 179
           +GG++   +A+  +LA+  GYDEINLN GCPS +V  +G FG  LM + + V+  +  + 
Sbjct: 88  LGGSDPAALAQCAKLAEARGYDEINLNVGCPSDRVQ-NGMFGACLMGNAQLVADCVKAMR 147

Query: 180 ANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREI 239
               +PV+VK RIG+DD DSY  LCDF+  VS       FIIH+RKA L+G+SP ENREI
Sbjct: 148 DVVSIPVTVKTRIGIDDQDSYEFLCDFINTVSGKGECEMFIIHARKAWLSGLSPKENREI 207

Query: 240 PPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHV 299
           PPL Y   Y L RDFP L  +INGGIK+++EA A L+    GVMVGR AYQNP   L  V
Sbjct: 208 PPLDYPRVYQLKRDFPHLTMSINGGIKSLEEAKAHLQ-HMDGVMVGREAYQNP-GILAAV 267

Query: 300 DTAIYGAPSSGITRRQVLEQYQVYGDSVL--GTYGNRPSIREVVKPLLHLFYTDPGNGPW 359
           D  I+G+  +      V+     Y +  L  GTY     +  + + +L LF   PG   W
Sbjct: 268 DREIFGSSDTDADPVAVVRAMYPYIERELSQGTY-----LGHITRHMLGLFQGIPGARQW 318

BLAST of Cp4.1LG02g15750 vs. ExPASy Swiss-Prot
Match: P32695 (tRNA-dihydrouridine(20/20a) synthase OS=Escherichia coli (strain K12) OX=83333 GN=dusA PE=1 SV=4)

HSP 1 Score: 248.4 bits (633), Expect = 2.3e-64
Identity = 137/302 (45.36%), Postives = 186/302 (61.59%), Query Frame = 0

Query: 60  FSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQ 119
           FSVAPM+DWTD H R   R++S++  LYTEM+    I++ K +   +LA+S E+HP+ LQ
Sbjct: 28  FSVAPMLDWTDRHCRYFLRLLSRNTLLYTEMVTTGAIIHGKGD---YLAYSEEEHPVALQ 87

Query: 120 IGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK-VSGSMSVIA 179
           +GG++   +A+  +LA+  GYDEINLN GCPS +V  +G FG  LM + + V+  +  + 
Sbjct: 88  LGGSDPAALAQCAKLAEARGYDEINLNVGCPSDRVQ-NGMFGACLMGNAQLVADCVKAMR 147

Query: 180 ANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREI 239
               +PV+VK RIG+DD DSY  LCDF+  VS       FIIH+RKA L+G+SP ENREI
Sbjct: 148 DVVSIPVTVKTRIGIDDQDSYEFLCDFINTVSGKGECEMFIIHARKAWLSGLSPKENREI 207

Query: 240 PPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHV 299
           PPL Y   Y L RDFP L  +INGGIK+++EA A L+    GVMVGR AYQNP   L  V
Sbjct: 208 PPLDYPRVYQLKRDFPHLTMSINGGIKSLEEAKAHLQ-HMDGVMVGREAYQNP-GILAAV 267

Query: 300 DTAIYGAPSSGITRRQVLEQYQVYGDSVL--GTYGNRPSIREVVKPLLHLFYTDPGNGPW 359
           D  I+G+  +      V+     Y +  L  GTY     +  + + +L LF   PG   W
Sbjct: 268 DREIFGSSDTDADPVAVVRAMYPYIERELSQGTY-----LGHITRHMLGLFQGIPGARQW 318

BLAST of Cp4.1LG02g15750 vs. ExPASy Swiss-Prot
Match: Q8FB30 (tRNA-dihydrouridine(20/20a) synthase OS=Escherichia coli O6:H1 (strain CFT073 / ATCC 700928 / UPEC) OX=199310 GN=dusA PE=3 SV=1)

HSP 1 Score: 248.1 bits (632), Expect = 3.0e-64
Identity = 136/302 (45.03%), Postives = 186/302 (61.59%), Query Frame = 0

Query: 60  FSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQ 119
           FS+APM+DWTD H R   R++S++  LYTEM+    I++ K +   +LA+S E+HP+ LQ
Sbjct: 14  FSIAPMLDWTDRHCRYFLRLLSRNTLLYTEMVTTGAIIHGKGD---YLAYSEEEHPVALQ 73

Query: 120 IGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK-VSGSMSVIA 179
           +GG++   +A+  +LA+  GYDEINLN GCPS +V  +G FG  LM + + V+  +  + 
Sbjct: 74  LGGSDPAALAQCAKLAEARGYDEINLNVGCPSDRVQ-NGMFGACLMGNAQLVADCVKAMR 133

Query: 180 ANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREI 239
               +PV+VK RIG+DD DSY  LCDF+  VS       FIIH+RKA L+G+SP ENREI
Sbjct: 134 DVVSIPVTVKTRIGIDDQDSYEFLCDFINTVSGKGECEMFIIHARKAWLSGLSPKENREI 193

Query: 240 PPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHV 299
           PPL Y   Y L RDFP L  +INGGIK+++EA A L+    GVMVGR AYQNP   L  V
Sbjct: 194 PPLDYPRVYQLKRDFPHLTMSINGGIKSLEEAKAHLQ-HMDGVMVGREAYQNP-GILAAV 253

Query: 300 DTAIYGAPSSGITRRQVLEQYQVYGDSVL--GTYGNRPSIREVVKPLLHLFYTDPGNGPW 359
           D  I+G+  +      V+     Y +  L  GTY     +  + + +L LF   PG   W
Sbjct: 254 DREIFGSSDTDADPVAVVRAMYPYIERELSQGTY-----LGHITRHMLGLFQGIPGARQW 304

BLAST of Cp4.1LG02g15750 vs. NCBI nr
Match: XP_023525441.1 (uncharacterized protein LOC111789048 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023525442.1 uncharacterized protein LOC111789048 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023525443.1 uncharacterized protein LOC111789048 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023525444.1 uncharacterized protein LOC111789048 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023525445.1 uncharacterized protein LOC111789048 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 853 bits (2205), Expect = 4.80e-308
Identity = 417/423 (98.58%), Postives = 419/423 (99.05%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60
           MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF
Sbjct: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60

Query: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120
           SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI
Sbjct: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120

Query: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG-SMSVIAA 180
           GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK  G +MSVIAA
Sbjct: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKFVGEAMSVIAA 180

Query: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240
           NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP
Sbjct: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240

Query: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300
           PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD
Sbjct: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300

Query: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360
           TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK
Sbjct: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360

Query: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKV 420
           ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKV
Sbjct: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKV 420

Query: 421 IKE 422
           + E
Sbjct: 421 LLE 423

BLAST of Cp4.1LG02g15750 vs. NCBI nr
Match: XP_022949368.1 (uncharacterized protein LOC111452746 isoform X1 [Cucurbita moschata] >XP_022949369.1 uncharacterized protein LOC111452746 isoform X1 [Cucurbita moschata] >XP_022949370.1 uncharacterized protein LOC111452746 isoform X1 [Cucurbita moschata] >XP_022949371.1 uncharacterized protein LOC111452746 isoform X1 [Cucurbita moschata] >XP_022949373.1 uncharacterized protein LOC111452746 isoform X1 [Cucurbita moschata])

HSP 1 Score: 848 bits (2192), Expect = 4.57e-306
Identity = 414/423 (97.87%), Postives = 417/423 (98.58%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60
           MMKFAGCSITASLTPIQPFIQRSHPRLSI VLQENLYKPQKLGEGSEAEMVARRYSPPLF
Sbjct: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSISVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60

Query: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120
           SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI
Sbjct: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120

Query: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG-SMSVIAA 180
           GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK  G +MSVIAA
Sbjct: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKFVGEAMSVIAA 180

Query: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240
           NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSS SPTRHFIIHSRKALLNGISPAENREIP
Sbjct: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSSSPTRHFIIHSRKALLNGISPAENREIP 240

Query: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300
           PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD
Sbjct: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300

Query: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360
           TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK
Sbjct: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360

Query: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKV 420
           ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQK+
Sbjct: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKI 420

Query: 421 IKE 422
           + E
Sbjct: 421 LLE 423

BLAST of Cp4.1LG02g15750 vs. NCBI nr
Match: KAG6606881.1 (hypothetical protein SDJN03_00223, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 847 bits (2188), Expect = 1.85e-305
Identity = 413/423 (97.64%), Postives = 416/423 (98.35%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60
           MMKFAGCSITASLTPIQPFIQRSHPRLSI VLQENLYKPQKLGEGSEAEMVARRYSPPLF
Sbjct: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSISVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60

Query: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120
           SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI
Sbjct: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120

Query: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG-SMSVIAA 180
           GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK  G +MSVIAA
Sbjct: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKFVGEAMSVIAA 180

Query: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240
           NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSS SPTRHFIIHSRKALLNGISPAENREIP
Sbjct: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSSSPTRHFIIHSRKALLNGISPAENREIP 240

Query: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300
           PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD
Sbjct: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300

Query: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360
           TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK
Sbjct: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360

Query: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKV 420
           ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTL LMPPPYEDREQK+
Sbjct: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLGLMPPPYEDREQKI 420

Query: 421 IKE 422
           + E
Sbjct: 421 LLE 423

BLAST of Cp4.1LG02g15750 vs. NCBI nr
Match: XP_022998438.1 (uncharacterized protein LOC111493074 isoform X1 [Cucurbita maxima] >XP_022998439.1 uncharacterized protein LOC111493074 isoform X1 [Cucurbita maxima] >XP_022998440.1 uncharacterized protein LOC111493074 isoform X1 [Cucurbita maxima] >XP_022998441.1 uncharacterized protein LOC111493074 isoform X1 [Cucurbita maxima] >XP_022998442.1 uncharacterized protein LOC111493074 isoform X1 [Cucurbita maxima])

HSP 1 Score: 842 bits (2175), Expect = 1.76e-303
Identity = 411/423 (97.16%), Postives = 414/423 (97.87%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60
           MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF
Sbjct: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60

Query: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120
           SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI
Sbjct: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120

Query: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG-SMSVIAA 180
           GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK  G +MSVIAA
Sbjct: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKFVGEAMSVIAA 180

Query: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240
           NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP
Sbjct: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240

Query: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300
           PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD
Sbjct: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300

Query: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360
           TAIYGAPSSGITRRQVLEQYQVYGDS+LG YGNRPSIREVVKPLLHLFYTDPGNGPWKRK
Sbjct: 301 TAIYGAPSSGITRRQVLEQYQVYGDSILGKYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360

Query: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKV 420
           ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAE P GREDLF NTLSLMPPPYED EQKV
Sbjct: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAESPPGREDLFVNTLSLMPPPYEDTEQKV 420

Query: 421 IKE 422
           + E
Sbjct: 421 LLE 423

BLAST of Cp4.1LG02g15750 vs. NCBI nr
Match: KAG7036588.1 (dusA [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 813 bits (2100), Expect = 5.66e-292
Identity = 402/429 (93.71%), Postives = 407/429 (94.87%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60
           MMKFAGCSITASLTPIQPFIQRSHPRLSI VLQENLYKPQKLGEGSEAEMVARRYSPPLF
Sbjct: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSISVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60

Query: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120
           SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI
Sbjct: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120

Query: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG-SMSVIAA 180
           GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK  G +MSVIAA
Sbjct: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKFVGEAMSVIAA 180

Query: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240
           NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSS SPTRHFIIHSRKALLNGISPAENREIP
Sbjct: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSSSPTRHFIIHSRKALLNGISPAENREIP 240

Query: 241 PLKYEYFYALMRDFP------DLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWR 300
           PLKY +   L R         +LRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPW 
Sbjct: 241 PLKYGFLRTLYRPRELVICGSNLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWH 300

Query: 301 TLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGN 360
           TLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGN
Sbjct: 301 TLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGN 360

Query: 361 GPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYE 420
           GPWKRKADAAF HCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYE
Sbjct: 361 GPWKRKADAAFHHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYE 420

Query: 421 DREQKVIKE 422
           DREQK++ E
Sbjct: 421 DREQKILLE 429

BLAST of Cp4.1LG02g15750 vs. ExPASy TrEMBL
Match: A0A6J1GBU8 (uncharacterized protein LOC111452746 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111452746 PE=4 SV=1)

HSP 1 Score: 848 bits (2192), Expect = 2.21e-306
Identity = 414/423 (97.87%), Postives = 417/423 (98.58%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60
           MMKFAGCSITASLTPIQPFIQRSHPRLSI VLQENLYKPQKLGEGSEAEMVARRYSPPLF
Sbjct: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSISVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60

Query: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120
           SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI
Sbjct: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120

Query: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG-SMSVIAA 180
           GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK  G +MSVIAA
Sbjct: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKFVGEAMSVIAA 180

Query: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240
           NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSS SPTRHFIIHSRKALLNGISPAENREIP
Sbjct: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSSSPTRHFIIHSRKALLNGISPAENREIP 240

Query: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300
           PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD
Sbjct: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300

Query: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360
           TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK
Sbjct: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360

Query: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKV 420
           ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQK+
Sbjct: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKI 420

Query: 421 IKE 422
           + E
Sbjct: 421 LLE 423

BLAST of Cp4.1LG02g15750 vs. ExPASy TrEMBL
Match: A0A6J1KEC0 (uncharacterized protein LOC111493074 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111493074 PE=4 SV=1)

HSP 1 Score: 842 bits (2175), Expect = 8.53e-304
Identity = 411/423 (97.16%), Postives = 414/423 (97.87%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60
           MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF
Sbjct: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKPQKLGEGSEAEMVARRYSPPLF 60

Query: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120
           SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI
Sbjct: 61  SVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHPIVLQI 120

Query: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG-SMSVIAA 180
           GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPK  G +MSVIAA
Sbjct: 121 GGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKFVGEAMSVIAA 180

Query: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240
           NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP
Sbjct: 181 NTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAENREIP 240

Query: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300
           PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD
Sbjct: 241 PLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRTLGHVD 300

Query: 301 TAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360
           TAIYGAPSSGITRRQVLEQYQVYGDS+LG YGNRPSIREVVKPLLHLFYTDPGNGPWKRK
Sbjct: 301 TAIYGAPSSGITRRQVLEQYQVYGDSILGKYGNRPSIREVVKPLLHLFYTDPGNGPWKRK 360

Query: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPYEDREQKV 420
           ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAE P GREDLF NTLSLMPPPYED EQKV
Sbjct: 361 ADAAFQHCKTIKSFFDETLVAIPDYILDAPVAESPPGREDLFVNTLSLMPPPYEDTEQKV 420

Query: 421 IKE 422
           + E
Sbjct: 421 LLE 423

BLAST of Cp4.1LG02g15750 vs. ExPASy TrEMBL
Match: A0A6J1DED7 (uncharacterized protein LOC111020285 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020285 PE=4 SV=1)

HSP 1 Score: 786 bits (2030), Expect = 1.26e-281
Identity = 381/430 (88.60%), Postives = 402/430 (93.49%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKP-------QKLGEGSEAEMVAR 60
           MMKFAGCSITASLTPIQ  IQ+SH RLS  +LQENL KP       QKL +G EAEMVAR
Sbjct: 1   MMKFAGCSITASLTPIQSVIQKSHCRLSSNILQENLSKPSGWSSSNQKLRQGPEAEMVAR 60

Query: 61  RYSPPLFSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQ 120
           RYSPP FSVAPMMDWTDNHYRTLAR+ISKHAWLYTEMLAAETIVYQKDNLDRFLAFSP+Q
Sbjct: 61  RYSPPWFSVAPMMDWTDNHYRTLARLISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPDQ 120

Query: 121 HPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG- 180
           HPIVLQ+GGNNLNNIAKA ELA  YGYDEIN NCGCPS+KVAGHGCFGVRLMLDPK  G 
Sbjct: 121 HPIVLQLGGNNLNNIAKATELANAYGYDEINFNCGCPSAKVAGHGCFGVRLMLDPKFVGE 180

Query: 181 SMSVIAANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISP 240
           +MSVIAANTDVPVSVKCRIGVDDHDSYNELCDF+YKVSSLSPTRHFIIHSRKALLNGISP
Sbjct: 181 AMSVIAANTDVPVSVKCRIGVDDHDSYNELCDFIYKVSSLSPTRHFIIHSRKALLNGISP 240

Query: 241 AENREIPPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPW 300
           AENR IPPLKYEYFYALMRDFPDLRFTINGGI +VDE NAAL+LGAHGVM+GR+AYQNPW
Sbjct: 241 AENRSIPPLKYEYFYALMRDFPDLRFTINGGINSVDEVNAALKLGAHGVMMGRAAYQNPW 300

Query: 301 RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPG 360
           RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGN+P+IREVVKPLLHLFY+DPG
Sbjct: 301 RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNKPNIREVVKPLLHLFYSDPG 360

Query: 361 NGPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPY 420
           NGPWKRKADAAFQHCKTIKSFF+ETLVA+PDYILDAPVAEPP GREDLFANTLSL+PPPY
Sbjct: 361 NGPWKRKADAAFQHCKTIKSFFEETLVAVPDYILDAPVAEPPPGREDLFANTLSLLPPPY 420

Query: 421 EDREQKVIKE 422
           ED+EQKV+ E
Sbjct: 421 EDKEQKVVLE 430

BLAST of Cp4.1LG02g15750 vs. ExPASy TrEMBL
Match: A0A6J1F1C8 (uncharacterized protein LOC111441480 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111441480 PE=4 SV=1)

HSP 1 Score: 781 bits (2017), Expect = 1.23e-279
Identity = 378/429 (88.11%), Postives = 401/429 (93.47%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKP-------QKLGEGSEAEMVAR 60
           MMKFAGCS++ASL+P+Q FIQ+SHPRLS  +LQ+NLYKP       Q L +  EAEMVA 
Sbjct: 1   MMKFAGCSVSASLSPVQSFIQKSHPRLSSSILQKNLYKPSSWFSGNQILQQSPEAEMVAG 60

Query: 61  RYSPPLFSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQ 120
           RYSPP FSVAPMMD TDNHYRTLAR+ISKHAWLY+EMLAAETIVYQKDNLDRFLAFSP+Q
Sbjct: 61  RYSPPWFSVAPMMDCTDNHYRTLARLISKHAWLYSEMLAAETIVYQKDNLDRFLAFSPDQ 120

Query: 121 HPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG- 180
           HPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPS KVAGHGCFG RLMLDPK  G 
Sbjct: 121 HPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSPKVAGHGCFGARLMLDPKFVGE 180

Query: 181 SMSVIAANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISP 240
           +MSVIAANT+ PVSVKCRIGVDDHDSYNELCDF+YKVSSLSPT+HFIIHSRKALLNGISP
Sbjct: 181 AMSVIAANTNAPVSVKCRIGVDDHDSYNELCDFIYKVSSLSPTKHFIIHSRKALLNGISP 240

Query: 241 AENREIPPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPW 300
           AENR IPPLKYEYFYALMRDFPDLRFTINGGI TVDE NAALRLGAHGVM+GR+AYQNPW
Sbjct: 241 AENRNIPPLKYEYFYALMRDFPDLRFTINGGINTVDEVNAALRLGAHGVMMGRAAYQNPW 300

Query: 301 RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPG 360
           RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGN+P+IR+VVKPLLHLFYTDPG
Sbjct: 301 RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNKPNIRDVVKPLLHLFYTDPG 360

Query: 361 NGPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPY 420
           NGPWKRKAD AFQHCKTIKSFF+ETLVAIPDYILDAPVAEPPSGREDLFANTL LMPPPY
Sbjct: 361 NGPWKRKADFAFQHCKTIKSFFEETLVAIPDYILDAPVAEPPSGREDLFANTLRLMPPPY 420

BLAST of Cp4.1LG02g15750 vs. ExPASy TrEMBL
Match: A0A6J1F233 (uncharacterized protein LOC111441480 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441480 PE=4 SV=1)

HSP 1 Score: 778 bits (2010), Expect = 1.49e-278
Identity = 377/433 (87.07%), Postives = 401/433 (92.61%), Query Frame = 0

Query: 1   MMKFAGCSITASLTPIQPFIQRSHPRLSIGVLQENLYKP-------QKLGEGSEAEMVAR 60
           MMKFAGCS++ASL+P+Q FIQ+SHPRLS  +LQ+NLYKP       Q L +  EAEMVA 
Sbjct: 1   MMKFAGCSVSASLSPVQSFIQKSHPRLSSSILQKNLYKPSSWFSGNQILQQSPEAEMVAG 60

Query: 61  RYSPPLFSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQ 120
           RYSPP FSVAPMMD TDNHYRTLAR+ISKHAWLY+EMLAAETIVYQKDNLDRFLAFSP+Q
Sbjct: 61  RYSPPWFSVAPMMDCTDNHYRTLARLISKHAWLYSEMLAAETIVYQKDNLDRFLAFSPDQ 120

Query: 121 HPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG- 180
           HPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPS KVAGHGCFG RLMLDPK  G 
Sbjct: 121 HPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSPKVAGHGCFGARLMLDPKFVGE 180

Query: 181 SMSVIAANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISP 240
           +MSVIAANT+ PVSVKCRIGVDDHDSYNELCDF+YKVSSLSPT+HFIIHSRKALLNGISP
Sbjct: 181 AMSVIAANTNAPVSVKCRIGVDDHDSYNELCDFIYKVSSLSPTKHFIIHSRKALLNGISP 240

Query: 241 AENREIPPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPW 300
           AENR IPPLKYEYFYALMRDFPDLRFTINGGI TVDE NAALRLGAHGVM+GR+AYQNPW
Sbjct: 241 AENRNIPPLKYEYFYALMRDFPDLRFTINGGINTVDEVNAALRLGAHGVMMGRAAYQNPW 300

Query: 301 RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNRPSIREVVKPLLHLFYTDPG 360
           RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGN+P+IR+VVKPLLHLFYTDPG
Sbjct: 301 RTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGNKPNIRDVVKPLLHLFYTDPG 360

Query: 361 NGPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLMPPPY 420
           NGPWKRKAD AFQHCKTIKSFF+ETLVAIPDYILDAPVAEPPSGREDLFANTL LMPPPY
Sbjct: 361 NGPWKRKADFAFQHCKTIKSFFEETLVAIPDYILDAPVAEPPSGREDLFANTLRLMPPPY 420

Query: 421 EDREQKVIKEVAL 425
           EDREQK + +  +
Sbjct: 421 EDREQKSMDDAVI 433

BLAST of Cp4.1LG02g15750 vs. TAIR 10
Match: AT3G63510.1 (FMN-linked oxidoreductases superfamily protein )

HSP 1 Score: 588.2 bits (1515), Expect = 8.6e-168
Identity = 271/392 (69.13%), Postives = 335/392 (85.46%), Query Frame = 0

Query: 32  LQENLYKPQKLGEGSEAEMVARRYSPPLFSVAPMMDWTDNHYRTLARMISKHAWLYTEML 91
           + E  +K   L   +     +  Y PP FSVAPMMDWTDNHYRTLAR+I+KHAWLYTEM+
Sbjct: 26  VSEMFFKSPMLKPTARFSSTSSPYLPPSFSVAPMMDWTDNHYRTLARLITKHAWLYTEMI 85

Query: 92  AAETIVYQKDNLDRFLAFSPEQHPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPS 151
           AAET+V+Q+ NLDRFLAFSP+QHPIVLQ+GG+N+ N+AKA +L+  YGYDEINLNCGCPS
Sbjct: 86  AAETLVHQQTNLDRFLAFSPQQHPIVLQLGGSNVENLAKAAKLSDAYGYDEINLNCGCPS 145

Query: 152 SKVAGHGCFGVRLMLDPKVSG-SMSVIAANTDVPVSVKCRIGVDDHDSYNELCDFVYKVS 211
            KVAGHGCFGV LML PK+ G +MS IAANT+VPV+VKCRIGVD+HDSY+ELCDF+YKVS
Sbjct: 146 PKVAGHGCFGVSLMLKPKLVGEAMSAIAANTNVPVTVKCRIGVDNHDSYDELCDFIYKVS 205

Query: 212 SLSPTRHFIIHSRKALLNGISPAENREIPPLKYEYFYALMRDFPDLRFTINGGIKTVDEA 271
           +LSPTRHFI+HSRKALL GISPA+NR IPPLKYEY+YAL+RDFPDLRFTINGGI +V + 
Sbjct: 206 TLSPTRHFIVHSRKALLGGISPADNRRIPPLKYEYYYALVRDFPDLRFTINGGITSVSKV 265

Query: 272 NAALRLGAHGVMVGRSAYQNPWRTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTY 331
           NAAL+ GAHGVMVGR+AY NPW+TLG VDTA+YG PSSG+TRRQVLEQYQVYGDSVLGT+
Sbjct: 266 NAALKEGAHGVMVGRAAYNNPWQTLGQVDTAVYGVPSSGLTRRQVLEQYQVYGDSVLGTH 325

Query: 332 GN-RPSIREVVKPLLHLFYTDPGNGPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAP 391
           GN RP++R++VKPLL+LF+++ GN  WKR+ADAAF+ C+++ S  +E+L AIPD +LD+P
Sbjct: 326 GNGRPNVRDLVKPLLNLFHSENGNSLWKRRADAAFKECRSVGSLLEESLRAIPDCVLDSP 385

Query: 392 VA-EPPSGREDLFANTLSLMPPPYEDREQKVI 421
           ++  P SG ED+FA+  +++PPPYE  E+ ++
Sbjct: 386 ISGSPESGDEDVFADVHNVLPPPYEAGEEIIL 417

BLAST of Cp4.1LG02g15750 vs. TAIR 10
Match: AT5G47970.1 (Aldolase-type TIM barrel family protein )

HSP 1 Score: 580.9 bits (1496), Expect = 1.4e-165
Identity = 264/375 (70.40%), Postives = 318/375 (84.80%), Query Frame = 0

Query: 51  VARRYSPPLFSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFS 110
           V+  YSPPLFS+APMM WTDNHYRTLAR+I+KHAWLYTEMLAAETIVYQ+DNLD FLAFS
Sbjct: 3   VSEAYSPPLFSIAPMMGWTDNHYRTLARLITKHAWLYTEMLAAETIVYQEDNLDSFLAFS 62

Query: 111 PEQHPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKV 170
           P+QHPIVLQIGG NL N+AKA  LA  Y YDEIN NCGCPS KV+G GCFG  LMLDPK 
Sbjct: 63  PDQHPIVLQIGGRNLENLAKATRLANAYAYDEINFNCGCPSPKVSGRGCFGALLMLDPKF 122

Query: 171 SG-SMSVIAANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNG 230
            G +MSVIAANT+  V+VKCRIGVDDHDSYNELCDF++ VSSLSPT+HFIIHSRKALL+G
Sbjct: 123 VGEAMSVIAANTNAAVTVKCRIGVDDHDSYNELCDFIHIVSSLSPTKHFIIHSRKALLSG 182

Query: 231 ISPAENREIPPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQ 290
           +SP++NR IPPLKYE+F+AL+RDFPDL+FTINGGI +V EA+AALR GAHGVM+GR+ Y 
Sbjct: 183 LSPSDNRRIPPLKYEFFFALLRDFPDLKFTINGGINSVVEADAALRSGAHGVMLGRAVYY 242

Query: 291 NPWRTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYG-NRPSIREVVKPLLHLFY 350
           NPW  LGHVDT +YG+PSSGITRRQVLE+Y++YG+SVLG YG  RP++R++V+PL++LF+
Sbjct: 243 NPWHILGHVDTVVYGSPSSGITRRQVLEKYKLYGESVLGKYGKGRPNLRDIVRPLINLFH 302

Query: 351 TDPGNGPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLM 410
           ++ GNG WKR+ DAA  HC T++SF DE L AIPDY+LD+   +  +GREDLFA+   L+
Sbjct: 303 SESGNGQWKRRTDAALLHCTTLQSFLDEVLPAIPDYVLDSSAVKEATGREDLFADVQRLL 362

Query: 411 PPPYEDREQKVIKEV 424
           PPPYE    K ++ +
Sbjct: 363 PPPYEKESFKALERI 377

BLAST of Cp4.1LG02g15750 vs. TAIR 10
Match: AT5G47970.2 (Aldolase-type TIM barrel family protein )

HSP 1 Score: 580.9 bits (1496), Expect = 1.4e-165
Identity = 264/375 (70.40%), Postives = 318/375 (84.80%), Query Frame = 0

Query: 51  VARRYSPPLFSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFS 110
           V+  YSPPLFS+APMM WTDNHYRTLAR+I+KHAWLYTEMLAAETIVYQ+DNLD FLAFS
Sbjct: 3   VSEAYSPPLFSIAPMMGWTDNHYRTLARLITKHAWLYTEMLAAETIVYQEDNLDSFLAFS 62

Query: 111 PEQHPIVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKV 170
           P+QHPIVLQIGG NL N+AKA  LA  Y YDEIN NCGCPS KV+G GCFG  LMLDPK 
Sbjct: 63  PDQHPIVLQIGGRNLENLAKATRLANAYAYDEINFNCGCPSPKVSGRGCFGALLMLDPKF 122

Query: 171 SG-SMSVIAANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNG 230
            G +MSVIAANT+  V+VKCRIGVDDHDSYNELCDF++ VSSLSPT+HFIIHSRKALL+G
Sbjct: 123 VGEAMSVIAANTNAAVTVKCRIGVDDHDSYNELCDFIHIVSSLSPTKHFIIHSRKALLSG 182

Query: 231 ISPAENREIPPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQ 290
           +SP++NR IPPLKYE+F+AL+RDFPDL+FTINGGI +V EA+AALR GAHGVM+GR+ Y 
Sbjct: 183 LSPSDNRRIPPLKYEFFFALLRDFPDLKFTINGGINSVVEADAALRSGAHGVMLGRAVYY 242

Query: 291 NPWRTLGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYG-NRPSIREVVKPLLHLFY 350
           NPW  LGHVDT +YG+PSSGITRRQVLE+Y++YG+SVLG YG  RP++R++V+PL++LF+
Sbjct: 243 NPWHILGHVDTVVYGSPSSGITRRQVLEKYKLYGESVLGKYGKGRPNLRDIVRPLINLFH 302

Query: 351 TDPGNGPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAPVAEPPSGREDLFANTLSLM 410
           ++ GNG WKR+ DAA  HC T++SF DE L AIPDY+LD+   +  +GREDLFA+   L+
Sbjct: 303 SESGNGQWKRRTDAALLHCTTLQSFLDEVLPAIPDYVLDSSAVKEATGREDLFADVQRLL 362

Query: 411 PPPYEDREQKVIKEV 424
           PPPYE    K ++ +
Sbjct: 363 PPPYEKESFKALERI 377

BLAST of Cp4.1LG02g15750 vs. TAIR 10
Match: AT3G63510.2 (FMN-linked oxidoreductases superfamily protein )

HSP 1 Score: 558.9 bits (1439), Expect = 5.6e-159
Identity = 259/368 (70.38%), Postives = 322/368 (87.50%), Query Frame = 0

Query: 56  SPPLFSVAPMMDWTDNHYRTLARMISKHAWLYTEMLAAETIVYQKDNLDRFLAFSPEQHP 115
           SPPL     ++  +DNHYRTLAR+I+KHAWLYTEM+AAET+V+Q+ NLDRFLAFSP+QHP
Sbjct: 18  SPPLRLPIFLLP-SDNHYRTLARLITKHAWLYTEMIAAETLVHQQTNLDRFLAFSPQQHP 77

Query: 116 IVLQIGGNNLNNIAKAVELAKPYGYDEINLNCGCPSSKVAGHGCFGVRLMLDPKVSG-SM 175
           IVLQ+GG+N+ N+AKA +L+  YGYDEINLNCGCPS KVAGHGCFGV LML PK+ G +M
Sbjct: 78  IVLQLGGSNVENLAKAAKLSDAYGYDEINLNCGCPSPKVAGHGCFGVSLMLKPKLVGEAM 137

Query: 176 SVIAANTDVPVSVKCRIGVDDHDSYNELCDFVYKVSSLSPTRHFIIHSRKALLNGISPAE 235
           S IAANT+VPV+VKCRIGVD+HDSY+ELCDF+YKVS+LSPTRHFI+HSRKALL GISPA+
Sbjct: 138 SAIAANTNVPVTVKCRIGVDNHDSYDELCDFIYKVSTLSPTRHFIVHSRKALLGGISPAD 197

Query: 236 NREIPPLKYEYFYALMRDFPDLRFTINGGIKTVDEANAALRLGAHGVMVGRSAYQNPWRT 295
           NR IPPLKYEY+YAL+RDFPDLRFTINGGI +V + NAAL+ GAHGVMVGR+AY NPW+T
Sbjct: 198 NRRIPPLKYEYYYALVRDFPDLRFTINGGITSVSKVNAALKEGAHGVMVGRAAYNNPWQT 257

Query: 296 LGHVDTAIYGAPSSGITRRQVLEQYQVYGDSVLGTYGN-RPSIREVVKPLLHLFYTDPGN 355
           LG VDTA+YG PSSG+TRRQVLEQYQVYGDSVLGT+GN RP++R++VKPLL+LF+++ GN
Sbjct: 258 LGQVDTAVYGVPSSGLTRRQVLEQYQVYGDSVLGTHGNGRPNVRDLVKPLLNLFHSENGN 317

Query: 356 GPWKRKADAAFQHCKTIKSFFDETLVAIPDYILDAPVA-EPPSGREDLFANTLSLMPPPY 415
             WKR+ADAAF+ C+++ S  +E+L AIPD +LD+P++  P SG ED+FA+  +++PPPY
Sbjct: 318 SLWKRRADAAFKECRSVGSLLEESLRAIPDCVLDSPISGSPESGDEDVFADVHNVLPPPY 377

Query: 416 EDREQKVI 421
           E  E+ ++
Sbjct: 378 EAGEEIIL 384

BLAST of Cp4.1LG02g15750 vs. TAIR 10
Match: AT1G07985.1 (Expressed protein )

HSP 1 Score: 102.8 bits (255), Expect = 1.1e-21
Identity = 66/124 (53.23%), Postives = 90/124 (72.58%), Query Frame = 0

Query: 547 MGVAEEPILSRLERLDNMLRRLEEIRGCGKSPKSSCASTPSSGTLTSDYQTSSVDL-SPK 606
           M V EEPILSRL+R+D M+R+LEE++  G SP+SS  STPSSGT     Q SS+DL SP+
Sbjct: 1   MAVVEEPILSRLDRIDFMVRKLEEMK--GSSPRSSSPSTPSSGT-----QPSSMDLSSPR 60

Query: 607 TLEK-HCRPINHVVKVTELRGSLVERMDNIEDRVLKLCLQIEGEIEKE-KEMIMVGKDKK 666
           ++ K HCR +  V + TE +G+L+ER++N+E++VLKLC Q E E+E+E K      K+KK
Sbjct: 61  SIGKVHCRSMEQVREETERKGTLLERLNNVEEQVLKLCSQFEEEVEEERKREDKTDKEKK 117

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q7UBC51.0e-6445.36tRNA-dihydrouridine(20/20a) synthase OS=Shigella flexneri OX=623 GN=dusA PE=3 SV... [more]
Q8CWK71.0e-6444.15tRNA-dihydrouridine(20/20a) synthase OS=Vibrio vulnificus (strain CMCP6) OX=2168... [more]
Q8X5V62.3e-6445.36tRNA-dihydrouridine(20/20a) synthase OS=Escherichia coli O157:H7 OX=83334 GN=dus... [more]
P326952.3e-6445.36tRNA-dihydrouridine(20/20a) synthase OS=Escherichia coli (strain K12) OX=83333 G... [more]
Q8FB303.0e-6445.03tRNA-dihydrouridine(20/20a) synthase OS=Escherichia coli O6:H1 (strain CFT073 / ... [more]
Match NameE-valueIdentityDescription
XP_023525441.14.80e-30898.58uncharacterized protein LOC111789048 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
XP_022949368.14.57e-30697.87uncharacterized protein LOC111452746 isoform X1 [Cucurbita moschata] >XP_0229493... [more]
KAG6606881.11.85e-30597.64hypothetical protein SDJN03_00223, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022998438.11.76e-30397.16uncharacterized protein LOC111493074 isoform X1 [Cucurbita maxima] >XP_022998439... [more]
KAG7036588.15.66e-29293.71dusA [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1GBU82.21e-30697.87uncharacterized protein LOC111452746 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KEC08.53e-30497.16uncharacterized protein LOC111493074 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1DED71.26e-28188.60uncharacterized protein LOC111020285 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1F1C81.23e-27988.11uncharacterized protein LOC111441480 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1F2331.49e-27887.07uncharacterized protein LOC111441480 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT3G63510.18.6e-16869.13FMN-linked oxidoreductases superfamily protein [more]
AT5G47970.11.4e-16570.40Aldolase-type TIM barrel family protein [more]
AT5G47970.21.4e-16570.40Aldolase-type TIM barrel family protein [more]
AT3G63510.25.6e-15970.38FMN-linked oxidoreductases superfamily protein [more]
AT1G07985.11.1e-2153.23Expressed protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013785Aldolase-type TIM barrelGENE3D3.20.20.70Aldolase class Icoord: 57..296
e-value: 2.3E-67
score: 229.0
IPR035587DUS-like, FMN-binding domainPFAMPF01207Duscoord: 62..371
e-value: 3.6E-57
score: 193.8
IPR035587DUS-like, FMN-binding domainCDDcd02801DUS_like_FMNcoord: 60..292
e-value: 6.34133E-74
score: 236.237
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 578..597
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 574..597
NoneNo IPR availablePANTHERPTHR42907:SF2TRNA-DIHYDROURIDINE SYNTHASEcoord: 50..421
NoneNo IPR availableSUPERFAMILY51395FMN-linked oxidoreductasescoord: 60..359
IPR004653tRNA-dihydrouridine(20/20a) synthasePANTHERPTHR42907FMN-LINKED OXIDOREDUCTASES SUPERFAMILY PROTEINcoord: 50..421

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g15750.1Cp4.1LG02g15750.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0002943 tRNA dihydrouridine synthesis
molecular_function GO:0050660 flavin adenine dinucleotide binding
molecular_function GO:0017150 tRNA dihydrouridine synthase activity
molecular_function GO:0003824 catalytic activity