Cp4.1LG01g04010 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g04010
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function (DUF1336)
LocationCp4.1LG01 : 1589280 .. 1594692 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCACGTGTAATGCGTGTGGGTCCCCTATGGGTTTGTGTTCCATTAGATTAATAGATTAATAAATTAATAATAGATTAGTTTAGATTAGATAGATGCGTTCTCACGCTGTCATCTCTTTTTTTTCCCTCTTTCTGCTCTGTTTCTCTCCAATCCTCTGTTCCACATTCCTTGTATATAAACACATTAAATTAAAATAAAAAGATCCAATTTTTTCATTACAGTCTATAGAGAGAGAAGGGAATTGGGATTTTAACAAACAAACGGTAGCTTTAACTTCTCTCCCTCTGCAACCAAGAAAATTCAGTGTTGTTGTTGGTGTCTGGGATGTTGTTTTCTGCTACGTGATTCTTCATCTTCGTCTTCTTCTTCTTGTTTGCCGACATGGGTGCTTGCGTGTCCAAGCCGGAGGGCTGCACCGGAGGCAAATTTAAGAAGCCTTTTAAGAGGAAGAACCGCCGGAGGAGGAGGAAAGCTTCCAAAACCACCGCGTTTTCTGGCCTTTCCGATGGATCGCACCTCTCCGACCCGATTGACCGCTGCTCATTTTCCAATCCCACTTTTCAAGGTTCCTTTTTTTTGTTCTCTCTTTCCGTGTTCTTTTTTTCTTTAGGGAAGAAGGAAAAATGGGGTCTGGGTTTATTTGTTTGTTTTTGGGTGTTGGGTTGAGTTTCTTTTAATATCAGAAATGTTTGATTTTGAATGATTTTTTGATGTGTTTGTGTGAAATGTACGTATTCTTTGATGCTGCGTCATGTTCAAAGCTCCTGCAAGTCATCTGCAGATACCCAGATCTTGAATTCTTCCAGAATGAGCATAGAAGATTCCCCATTTCTTTATTTTTTTATGAAAAAGAATTATAAGTAGACTGGGGTTGACTTGGGTTCTCTATAATTGTGGCATGAACAGGAAGTTCTGATGAGGCCTGGTTTGATACATTTCCAAGATTTGAATCCGATTGTGATGAAGACTACCAGAGCATTCCTGATGGTATAACTCCTTTTTTTTTTTTACTCTCCAACTGTTTGTTATTCAATATGTAATTGTCAATCTCGTGACTTCCTGCAGACATACAGTCGATTAATAGCTTCGAAGGCGTGTCGACGTCGAGTATTTCGTCTTCTGGGGATGCTAATCATGGAGATCATAATGTGAATCAGATTCACAGGCCAGGAAATTCAGCAAGAGTTCATTCTGTGAGGAGCTCTGGAAGTGAAGTTGTAATGAATCCAGATGATGCAGAACATCAACTGAAGGGTCATGGAGGACACTCGAGCGAGGCAAACGAACCGGTGTTCGTCGACGATATCTCGTCGACTGCTGGTGAAAGTTCTGCCAAAGGGGATGGAATATTGGATAATTGTGGGATTTTGCCTAGCAATTGCTTGCCTTGTCTTGCATCAACAATTAACTCTGTTGACAAGAGAAAATCTCTAAGTTCTAGTCCTCCAAGTGGGCTCAAAAAGGCCGCCTTGAAATTGTCTTTCAAATGGAAAGAAGGAAATCCCAATGCTGCTTTATGTGAGTGTTCATCTTTTATGATCTGTCTTACTGATCTTGATCGATTTTGATCGATTTCGTTTCTTTCAGTTTCGTCGAAATCGCTTCTACAAAGACCTATAGCAGGTTCTCAAGTGCCTTTTGTCCCAGCTGACAAGAAAATGCTAGATTGTTGGTCACATATCGAGCCAGATAGTTTCAAAGTTCGGGGACTAAACTATGCAAAGTAAGCGAGCAAACAACTCTTTCTACGAGTTCATCATTGCATCAATGATATTAATCCTACAAATTTGCAGGGACAAGAAGAAGGAATTTGCTCCCAGTCATGCTGCTTACCAACCCTTTGGAGTCGACGTATTCTTGTCGCATCGAAAAGTGGATCATATCGCTCGATTCGTCGAGCTGCCTGCAGCTCATTACACTGGAAAACTCCCACCTATCCTTGTTGTTAATGTTCAGGTAAACTCATCCCTTGCTTCTCCTATGATAAATGTACCTTCTCTGTCATAGCTGCTTCATGTTATTGAATTCTGCAGATTCCATTGTATCCTGCCGCCATTTTTCAAGGCGAAACCGATGGAGAAGGGATGAGTATAGTCTTGTACTTCAAGATATCTGATGGATTTGCTAAAGAACTTACATCTCATTTTCAAGAAAGCATCAGGGTAAGATTCAGCATTTGAAAATTTTCATTCATTTGAAATCTTTTAATAACTGTGTTTACTGGCTCCAAAAACAGAAGCTGATTGATGATGAAGTTGAAAGGGTGAAGGGTTTTCCAGTAGACACAGTAGTTCCATTTAGAGAGAGGTTGAAAATTTTGGGTCGAGTTGCGAACGTCGAGGAGCTTCCGATGAGCGCTGCGGAGCGGAAGCTCATGCAGGCTTACAACGAAAAGCCTGTCCTCTCGCGTCCTCAACATGAATTTTACATGGTAAGTAAAGCCTTCCATTGCATTACAGACTGCTTTTGAATGAAATTGCATATTGATTACTGAATGATTCATGGATTATTACAGGGAGAAAACTACTTGGAAATTGACTTAGACATGCACAGATTTAGTTACATTTCAAGGAAAGGCTTTGAAGCATTCCTTGACAGACTCAAGTGCTGCATTTTGGATGTCGGCCTCACAATTCAGGCATGTTTCTTGTAACGATGTTTTTGTGTCTTTTTTTTAGCTGACTGTCCTGTTTTTCTTACTCTGTTTTGGGTTCTGAAACAGGGGAACAAACCTGAAGAATTGCCAGAGGAGATCTTATGTTGTATTAGACTAAATGGAATTGATTACGTAAATTACCAGCAGTTGGGGATGAGTCAAGAGATTCTGTAAAGTGCATCGCCATTGTATTCATACAAAAAAGTTCACTAAAAAAGCAAATTTTTAATGAGATATGACGTTTTTTTCTTTGAAAAGATCAGTAATTTAACACTGCTTATAGCTAAGATATTTGAATTCATCTTCTTCTCCATCAAAGCTTACATGGGTTTAATGGCTTTCCACACAATCCTCTGTTTCCCATGTAAGCGCTTCGAGGGAAATCGCTCAGCGGCTTCCCCGCTGGAATTTCTCCCACTAACAGATTGTTCGACAGATCCAATTCCTTCAACTTTCGTAATTTCAGAAACCCTTTTGGAATTTTTCCAGTAAAATGGTTTCTTTGCAGCTTCAACGTTTCTAATTGGTTGGCATTCACCATAGCCTCTGGCAAGTCGAACCCCAGATTGTTGGAGCTCAAATCCAGGCTTTGTATTGATTTCAAACCTCCGATTGAGATGGGTAATCTTCCTTTCAGATTGTTATGCGACAAATTCAGAAACTGGATCGCTGGTTGTTGTCCGACATCGACTTGATCAAATTCCCCGGAGAAGTTATTGTCGGAGAGATCAATGTACGTTAAAGAATCATCTGGGAATCCGTGTTCTATCTTGAAGATCTGTCGTATGTCGCCTGTTAACTTGTTCGAATGCAAATCCAGCACTCGCAAGTCCTGTAATTTCGTAATGGAATGTGGAATCTCGTAAACCAGTGAATTCTGTGATAGCTTTAGTGAATAGAGCGCAGTGAGGCTTCCAATCCATTCTGGAATTTTCCCAGTAAGATAGTTCAGGGAAAGATCCAGTTCCTGAATTGGACTTGGAGTTCTTCGCAAGAACTCTGGAATTTCGCCACGGATTCCGCACCCTGCAAGATACATTTTAGAAAGAGATGGCAGTTTACCCAGCCATTGGGGGATCGCGGTCAGGCTCAAACGGTTGAAAGATAGATCAATGGTTTGAATGTTTTGAAGCGACGACATCTCCGGCAACGGCCCTTTAATGAAATTGTGAGAGAGATCCAGGAGAATGAGCTGCGAAAGCTGCCCAAATGATTCCGGTAATTCGCCGGAGAATCTGTTGTAATCAAGATACAACTCCGACAGAGCTGATAACCGACCCAAAGAAGAAGGAATTGTCCCCTCAAGTTTGTTGTTTGAAAGTGAAGCTCGTTTGAGAGAGACAAGCTCCCCGAAACTTGGAGGGATGGTTCCACTAAGCCGGTTACTACTCAATCTAAGAAAAGCAAGAGCCGGCGGGAACAACAATGGCCCCTCCAGATTGTTGTTATCCAAATACAAAACAGAGAGAGCCGTCAATTTTGTTAAGGAAATGGGAATCTTCCCGCTCAAGAAATTGTTCGAAAGATCAAGCTCATTCAACATCTGCAACTCACCAATTCTCTCAGGAATCCGACCAACGAAGGAGTTGCTATGTAAATCCATCCGAACCAACTTCGTAAGGTTCGTAATCGAATCCGGTATCGAACCCGAAAACCGATTTCCATCGAGATGCAAGGAATGCAGAGACATGAGGCTGCCAAGGCTTGCAGGGAGCGAACCCGACAGCCTATTATCATGGACAGCAAGCTCTTCAAGTTTCATAAGCTTACCTATGCTCTCAGGAAGTGGACCTCTCAGCTTGTTGCCATAAAGATAAAGCTTCCTCAGGTTGCGGAGGCGTAAGCCAATGAATTTAGGAATGGTTCCACCAAGGCCAATCAACCCTCCAAGATCAATGATCTCAACAGCATCAAGCAGAGTTATTGAAGAAGAAAGACAGCCCATCATCTCAGATTGAAAGGGGATATCGCCAGTGGATATGAAGCCTGGCAAATTGATCTCCTTGACTCTGCCAGTGGCATTGTGGCAAGAGATGCCGTCCCAGCCGCAACAGTTCCGGCCGACCCACTTGGCAAGCCTGCCGGAAGTGTCGTGGGAAATCGCAGACTTGAAACTGGTTAGACCCTTGAAATCTGCAGGGTGGCAAGCTTGACAGAGTGTGGTAATGGAGATCATAAGAACAACCCATTGTAGCAGAAGCCGATCCATTTTGATTTCGTAATACAGAGAAGTTCAATGGCTCATTATAACATGGCCAAAGGGCACGAACAAAAATGGTATCTCCGTAAACGTTTCGCTTTTGGCTTTTCATCTTTTTTGTCTCTCATCAATTCACTTTATATTAATTAACCCACAGATTTCTCGTTCATGATGATTAAACTAATAATTCCATGGTTTGACTAATCCAATAAACGTCCAAATACTAATTTCTTGCTCTTTCTGAAACAGAGCATTCTGAAACTCCTAGGTAGCTCTGTTTCTGTTTCTTCCATCCCAAGATTTGATAGCCATTGTTGTGTGATTAACAGAACAGGTTGTCTTTTGAGTGTTTCAAGAACGAACAGTTTCAAATCAAAGTGGTGAAGTTTGTATTCTTTTGGAGTAAATGTTGAATGGAAGAATGTGTCAGTGAGCTGCAAAAGACATGATTTTGATTTGCCCCACTTCAACCCACCAACCTGGAAAAAGCTGTTTCAATTCGGTCAACAAAAGAACCCATGTGTTTGAAAATCTGAACGTTACAGAATTTTTTA

mRNA sequence

ATGCCACGTTCTATAGAGAGAGAAGGGAATTGGGATTTTAACAAACAAACGGTAGCTTTAACTTCTCTCCCTCTGCAACCAAGAAAATTCAGTGTTGTTGTTGGTGTCTGGGATGTTGTTTTCTGCTACCCGGAGGGCTGCACCGGAGGCAAATTTAAGAAGCCTTTTAAGAGGAAGAACCGCCGGAGGAGGAGGAAAGCTTCCAAAACCACCGCGTTTTCTGGCCTTTCCGATGGATCGCACCTCTCCGACCCGATTGACCGCTGCTCATTTTCCAATCCCACTTTTCAAGGAAGTTCTGATGAGGCCTGGTTTGATACATTTCCAAGATTTGAATCCGATTGTGATGAAGACTACCAGAGCATTCCTGATGACATACAGTCGATTAATAGCTTCGAAGGCGTGTCGACGTCGAGTATTTCGTCTTCTGGGGATGCTAATCATGGAGATCATAATGTGAATCAGATTCACAGGCCAGGAAATTCAGCAAGAGTTCATTCTGTGAGGAGCTCTGGAAGTGAAGTTGTAATGAATCCAGATGATGCAGAACATCAACTGAAGGGTCATGGAGGACACTCGAGCGAGGCAAACGAACCGGTGTTCGTCGACGATATCTCGTCGACTGCTGGTGAAAGTTCTGCCAAAGGGGATGGAATATTGGATAATTGTGGGATTTTGCCTAGCAATTGCTTGCCTTGTCTTGCATCAACAATTAACTCTGTTGACAAGAGAAAATCTCTAAGTTCTAGTCCTCCAAGTGGGCTCAAAAAGGCCGCCTTGAAATTGTCTTTCAAATGGAAAGAAGGAAATCCCAATGCTGCTTTATTTTCGTCGAAATCGCTTCTACAAAGACCTATAGCAGGTTCTCAAGTGCCTTTTGTCCCAGCTGACAAGAAAATGCTAGATTGTTGGTCACATATCGAGCCAGATAGTTTCAAAGTTCGGGGACTAAACTATGCAAAGGACAAGAAGAAGGAATTTGCTCCCAGTCATGCTGCTTACCAACCCTTTGGAGTCGACGTATTCTTGTCGCATCGAAAAGTGGATCATATCGCTCGATTCGTCGAGCTGCCTGCAGCTCATTACACTGGAAAACTCCCACCTATCCTTGTTGTTAATGTTCAGATTCCATTGTATCCTGCCGCCATTTTTCAAGGCGAAACCGATGGAGAAGGGATGAGTATAGTCTTGTACTTCAAGATATCTGATGGATTTGCTAAAGAACTTACATCTCATTTTCAAGAAAGCATCAGGAAGCTGATTGATGATGAAGTTGAAAGGGTGAAGGGTTTTCCAGTAGACACAGTAGTTCCATTTAGAGAGAGGTTGAAAATTTTGGGTCGAGTTGCGAACGTCGAGGAGCTTCCGATGAGCGCTGCGGAGCGGAAGCTCATGCAGGCTTACAACGAAAAGCCTGTCCTCTCGCGTCCTCAACATGAATTTTACATGGGAGAAAACTACTTGGAAATTGACTTAGACATGCACAGATTTAGTTACATTTCAAGGAAAGGCTTTGAAGCATTCCTTGACAGACTCAAGTGCTGCATTTTGGATGGGAACAAACCTGAAGAATTGCCAGAGGAGATCTTATGTTGTATTAGACTAAATGGAATTGATTACGTAAATTACCAGCAGTTGGGGATGAGTCAAGAGATTCTGTAAAGTGCATCGCCATTGTATTCATACAAAAAAGTTCACTAAAAAAGCAAATTTTTAATGAGATATGACGTTTTTTTCTTTGAAAAGATCAGTAATTTAACACTGCTTATAGCTAAGATATTTGAATTCATCTTCTTCTCCATCAAAGCTTACATGGGTTTAATGGCTTTCCACACAATCCTCTGTTTCCCATGTAAGCGCTTCGAGGGAAATCGCTCAGCGGCTTCCCCGCTGGAATTTCTCCCACTAACAGATTGTTCGACAGATCCAATTCCTTCAACTTTCGTAATTTCAGAAACCCTTTTGGAATTTTTCCAGTAAAATGGTTTCTTTGCAGCTTCAACGTTTCTAATTGGTTGGCATTCACCATAGCCTCTGGCAAGTCGAACCCCAGATTGTTGGAGCTCAAATCCAGGCTTTGTATTGATTTCAAACCTCCGATTGAGATGGGTAATCTTCCTTTCAGATTGTTATGCGACAAATTCAGAAACTGGATCGCTGGTTGTTGTCCGACATCGACTTGATCAAATTCCCCGGAGAAGTTATTGTCGGAGAGATCAATGTACGTTAAAGAATCATCTGGGAATCCGTGTTCTATCTTGAAGATCTGTCGTATGTCGCCTGTTAACTTGTTCGAATGCAAATCCAGCACTCGCAAGTCCTGTAATTTCGTAATGGAATGTGGAATCTCGTAAACCAGTGAATTCTGTGATAGCTTTAGTGAATAGAGCGCAGTGAGGCTTCCAATCCATTCTGGAATTTTCCCAGTAAGATAGTTCAGGGAAAGATCCAGTTCCTGAATTGGACTTGGAGTTCTTCGCAAGAACTCTGGAATTTCGCCACGGATTCCGCACCCTGCAAGATACATTTTAGAAAGAGATGGCAGTTTACCCAGCCATTGGGGGATCGCGGTCAGGCTCAAACGGTTGAAAGATAGATCAATGGTTTGAATGTTTTGAAGCGACGACATCTCCGGCAACGGCCCTTTAATGAAATTGTGAGAGAGATCCAGGAGAATGAGCTGCGAAAGCTGCCCAAATGATTCCGGTAATTCGCCGGAGAATCTGTTGTAATCAAGATACAACTCCGACAGAGCTGATAACCGACCCAAAGAAGAAGGAATTGTCCCCTCAAGTTTGTTGTTTGAAAGTGAAGCTCGTTTGAGAGAGACAAGCTCCCCGAAACTTGGAGGGATGGTTCCACTAAGCCGGTTACTACTCAATCTAAGAAAAGCAAGAGCCGGCGGGAACAACAATGGCCCCTCCAGATTGTTGTTATCCAAATACAAAACAGAGAGAGCCGTCAATTTTGTTAAGGAAATGGGAATCTTCCCGCTCAAGAAATTGTTCGAAAGATCAAGCTCATTCAACATCTGCAACTCACCAATTCTCTCAGGAATCCGACCAACGAAGGAGTTGCTATGTAAATCCATCCGAACCAACTTCGTAAGGTTCGTAATCGAATCCGGTATCGAACCCGAAAACCGATTTCCATCGAGATGCAAGGAATGCAGAGACATGAGGCTGCCAAGGCTTGCAGGGAGCGAACCCGACAGCCTATTATCATGGACAGCAAGCTCTTCAAGTTTCATAAGCTTACCTATGCTCTCAGGAAGTGGACCTCTCAGCTTGTTGCCATAAAGATAAAGCTTCCTCAGGTTGCGGAGGCGTAAGCCAATGAATTTAGGAATGGTTCCACCAAGGCCAATCAACCCTCCAAGATCAATGATCTCAACAGCATCAAGCAGAGTTATTGAAGAAGAAAGACAGCCCATCATCTCAGATTGAAAGGGGATATCGCCAGTGGATATGAAGCCTGGCAAATTGATCTCCTTGACTCTGCCAGTGGCATTGTGGCAAGAGATGCCGTCCCAGCCGCAACAGTTCCGGCCGACCCACTTGGCAAGCCTGCCGGAAGTGTCGTGGGAAATCGCAGACTTGAAACTGGTTAGACCCTTGAAATCTGCAGGGTGGCAAGCTTGACAGAGTGTGGTAATGGAGATCATAAGAACAACCCATTGTAGCAGAAGCCGATCCATTTTGATTTCGTAATACAGAGAAGTTCAATGGCTCATTATAACATGGCCAAAGGGCACGAACAAAAATGGTATCTCCGTAAACGTTTCGCTTTTGGCTTTTCATCTTTTTTGTCTCTCATCAATTCACTTTATATTAATTAACCCACAGATTTCTCGTTCATGATGATTAAACTAATAATTCCATGGTTTGACTAATCCAATAAACGTCCAAATACTAATTTCTTGCTCTTTCTGAAACAGAGCATTCTGAAACTCCTAGGTAGCTCTGTTTCTGTTTCTTCCATCCCAAGATTTGATAGCCATTGTTGTGTGATTAACAGAACAGGTTGTCTTTTGAGTGTTTCAAGAACGAACAGTTTCAAATCAAAGTGGTGAAGTTTGTATTCTTTTGGAGTAAATGTTGAATGGAAGAATGTGTCAGTGAGCTGCAAAAGACATGATTTTGATTTGCCCCACTTCAACCCACCAACCTGGAAAAAGCTGTTTCAATTCGGTCAACAAAAGAACCCATGTGTTTGAAAATCTGAACGTTACAGAATTTTTTA

Coding sequence (CDS)

ATGCCACGTTCTATAGAGAGAGAAGGGAATTGGGATTTTAACAAACAAACGGTAGCTTTAACTTCTCTCCCTCTGCAACCAAGAAAATTCAGTGTTGTTGTTGGTGTCTGGGATGTTGTTTTCTGCTACCCGGAGGGCTGCACCGGAGGCAAATTTAAGAAGCCTTTTAAGAGGAAGAACCGCCGGAGGAGGAGGAAAGCTTCCAAAACCACCGCGTTTTCTGGCCTTTCCGATGGATCGCACCTCTCCGACCCGATTGACCGCTGCTCATTTTCCAATCCCACTTTTCAAGGAAGTTCTGATGAGGCCTGGTTTGATACATTTCCAAGATTTGAATCCGATTGTGATGAAGACTACCAGAGCATTCCTGATGACATACAGTCGATTAATAGCTTCGAAGGCGTGTCGACGTCGAGTATTTCGTCTTCTGGGGATGCTAATCATGGAGATCATAATGTGAATCAGATTCACAGGCCAGGAAATTCAGCAAGAGTTCATTCTGTGAGGAGCTCTGGAAGTGAAGTTGTAATGAATCCAGATGATGCAGAACATCAACTGAAGGGTCATGGAGGACACTCGAGCGAGGCAAACGAACCGGTGTTCGTCGACGATATCTCGTCGACTGCTGGTGAAAGTTCTGCCAAAGGGGATGGAATATTGGATAATTGTGGGATTTTGCCTAGCAATTGCTTGCCTTGTCTTGCATCAACAATTAACTCTGTTGACAAGAGAAAATCTCTAAGTTCTAGTCCTCCAAGTGGGCTCAAAAAGGCCGCCTTGAAATTGTCTTTCAAATGGAAAGAAGGAAATCCCAATGCTGCTTTATTTTCGTCGAAATCGCTTCTACAAAGACCTATAGCAGGTTCTCAAGTGCCTTTTGTCCCAGCTGACAAGAAAATGCTAGATTGTTGGTCACATATCGAGCCAGATAGTTTCAAAGTTCGGGGACTAAACTATGCAAAGGACAAGAAGAAGGAATTTGCTCCCAGTCATGCTGCTTACCAACCCTTTGGAGTCGACGTATTCTTGTCGCATCGAAAAGTGGATCATATCGCTCGATTCGTCGAGCTGCCTGCAGCTCATTACACTGGAAAACTCCCACCTATCCTTGTTGTTAATGTTCAGATTCCATTGTATCCTGCCGCCATTTTTCAAGGCGAAACCGATGGAGAAGGGATGAGTATAGTCTTGTACTTCAAGATATCTGATGGATTTGCTAAAGAACTTACATCTCATTTTCAAGAAAGCATCAGGAAGCTGATTGATGATGAAGTTGAAAGGGTGAAGGGTTTTCCAGTAGACACAGTAGTTCCATTTAGAGAGAGGTTGAAAATTTTGGGTCGAGTTGCGAACGTCGAGGAGCTTCCGATGAGCGCTGCGGAGCGGAAGCTCATGCAGGCTTACAACGAAAAGCCTGTCCTCTCGCGTCCTCAACATGAATTTTACATGGGAGAAAACTACTTGGAAATTGACTTAGACATGCACAGATTTAGTTACATTTCAAGGAAAGGCTTTGAAGCATTCCTTGACAGACTCAAGTGCTGCATTTTGGATGGGAACAAACCTGAAGAATTGCCAGAGGAGATCTTATGTTGTATTAGACTAAATGGAATTGATTACGTAAATTACCAGCAGTTGGGGATGAGTCAAGAGATTCTGTAA

Protein sequence

MPRSIEREGNWDFNKQTVALTSLPLQPRKFSVVVGVWDVVFCYPEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHLSDPIDRCSFSNPTFQGSSDEAWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNVNQIHRPGNSARVHSVRSSGSEVVMNPDDAEHQLKGHGGHSSEANEPVFVDDISSTAGESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEFAPSHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGETDGEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRVANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFLDRLKCCILDGNKPEELPEEILCCIRLNGIDYVNYQQLGMSQEIL
BLAST of Cp4.1LG01g04010 vs. TrEMBL
Match: A0A0A0KVS2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604060 PE=4 SV=1)

HSP 1 Score: 892.1 bits (2304), Expect = 3.4e-256
Identity = 454/530 (85.66%), Postives = 478/530 (90.19%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHLSDPIDRCSFSNPTFQGSSDEA 103
           P+GC GGKFKK  KRKNRRRRRK SKT AFS LS+GSH SDPID CSFSNPTFQGS DEA
Sbjct: 8   PQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEA 67

Query: 104 WFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNVN-------QI 163
           WFDT  +FESDCDEDYQS+PDD QSINS E  STSSISSSGDANHGDHNVN       QI
Sbjct: 68  WFDTVGKFESDCDEDYQSLPDDNQSINSLEAASTSSISSSGDANHGDHNVNRHSATSDQI 127

Query: 164 HRPGNSARVHSVRSSGSEVV-------MNPDDAEHQLKGHGGHSSEANEPVFVDDISSTA 223
           HRPGNSARVHSV SS S+V        +NPDDAE QLKG G HSSEANEPVF+D+ISSTA
Sbjct: 128 HRPGNSARVHSVSSSESQVARDSHLQAINPDDAEPQLKGCG-HSSEANEPVFIDEISSTA 187

Query: 224 GESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWKEG 283
           GESSAKGDGILDNCGILPSNCLPCLASTINSV+KRKSLSSSPPSGLKKAALKLSFKWKEG
Sbjct: 188 GESSAKGDGILDNCGILPSNCLPCLASTINSVEKRKSLSSSPPSGLKKAALKLSFKWKEG 247

Query: 284 NPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEFAP 343
           NPNAALFSSK+LLQRPIAGSQVPF PA+KKMLDCWSHIEPDSFKVRG+NYAKDKKKEFAP
Sbjct: 248 NPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYAKDKKKEFAP 307

Query: 344 SHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGETD 403
           +H AY PFGVDVFLSHRKVDHIARFVE+PAA  +G LPPILVVNVQIPLY AAIFQGETD
Sbjct: 308 NHTAYYPFGVDVFLSHRKVDHIARFVEMPAATSSGTLPPILVVNVQIPLYSAAIFQGETD 367

Query: 404 GEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRV 463
           GEGMSIVLYFK+SD +A++LTSHFQE+I+KLIDDEVERVKGFPVD VVPFRERLKILGRV
Sbjct: 368 GEGMSIVLYFKLSDAYAEKLTSHFQENIKKLIDDEVERVKGFPVDNVVPFRERLKILGRV 427

Query: 464 ANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFL 523
           ANVE+LPMSAAERKLMQAYNEKPVLSRPQHEFY+GENYLEIDLDMHRFSYISRKGFEAFL
Sbjct: 428 ANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYLGENYLEIDLDMHRFSYISRKGFEAFL 487

Query: 524 DRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQEIL 554
           DRLKCCILD      GN+PEELPEEILCCIRLNGIDYVNYQQLGM  EIL
Sbjct: 488 DRLKCCILDVGLTIQGNRPEELPEEILCCIRLNGIDYVNYQQLGMGLEIL 536

BLAST of Cp4.1LG01g04010 vs. TrEMBL
Match: A0A061E7C7_THECC (CW14 protein isoform 1 OS=Theobroma cacao GN=TCM_010075 PE=4 SV=1)

HSP 1 Score: 677.2 bits (1746), Expect = 1.7e-191
Identity = 354/530 (66.79%), Postives = 416/530 (78.49%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHLSDPIDR-------CSFSNPTF 103
           PEGC   K +   K+KNR+RR+   K    S LS+ S  SD +DR        SF+NPTF
Sbjct: 8   PEGCVSPKLRSS-KKKNRKRRKSCLKKRVSSRLSEVS--SDKVDRPAPPDHHSSFTNPTF 67

Query: 104 QGSSDEAWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHN--VN 163
           QGS DE WFD    F+SDCDE+++S+ +D+ S+N  EGVS SSISS  DAN G+H+  V+
Sbjct: 68  QGSIDE-WFDPVAVFDSDCDEEFESVQEDVLSLNGLEGVSISSISSLKDANCGEHSSLVD 127

Query: 164 QIHRPGNSARVHSVRSSGSEV-------VMNPDDAEHQLKGHGGHSSEANEPVFVDDISS 223
           Q+ +PG+ +  +S  +S  EV       V+N +D   Q K  G  S++A +PVF+DDI+S
Sbjct: 128 QMQKPGDLSAGNSACNSVGEVTRNSNSQVLNSEDVNSQSKSDGP-SNKAKQPVFLDDIAS 187

Query: 224 TAGESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWK 283
           +  E S K +G+LDNCGILPSNCLPCLAST+ S++KR+SLSSSPPS  KK ALKL FKW+
Sbjct: 188 SVDEGSGKEEGLLDNCGILPSNCLPCLASTVPSIEKRRSLSSSPPSARKKNALKLPFKWR 247

Query: 284 EGNPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEF 343
           EG+PNA LFSSK LLQRP AGSQVP  P +KKM DCWSHIEP +FKVRG NY +DKKK+F
Sbjct: 248 EGHPNATLFSSKMLLQRPKAGSQVPVCPIEKKMFDCWSHIEPGTFKVRGENYFRDKKKDF 307

Query: 344 APSHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGE 403
           AP+HAAY PFGVDVFLS RK+DHIARFVELP    +GKLP ILVVNVQIPLYPAA+FQ E
Sbjct: 308 APNHAAYYPFGVDVFLSPRKIDHIARFVELPVVSQSGKLPSILVVNVQIPLYPAALFQSE 367

Query: 404 TDGEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILG 463
           TDGEGMS VLYFK+SD + KEL  HFQE+IR+LI DEVE+VKGFPVDT+VPFRERLKILG
Sbjct: 368 TDGEGMSFVLYFKLSDSYLKELPPHFQENIRRLIVDEVEKVKGFPVDTIVPFRERLKILG 427

Query: 464 RVANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEA 523
           RVANVE+L MSAAERKLM AYNEKP LSRPQHEFY+GENY EID+DMHRFSYISRKGF+A
Sbjct: 428 RVANVEDLHMSAAERKLMHAYNEKPFLSRPQHEFYLGENYFEIDIDMHRFSYISRKGFDA 487

Query: 524 FLDRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQE 552
           FLDRLK CILD      GNKPEELPE+ILCCIRL+GIDY+NY QLG+SQE
Sbjct: 488 FLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLSGIDYMNYHQLGLSQE 532

BLAST of Cp4.1LG01g04010 vs. TrEMBL
Match: A0A067JET6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21729 PE=4 SV=1)

HSP 1 Score: 673.3 bits (1736), Expect = 2.5e-190
Identity = 350/546 (64.10%), Postives = 419/546 (76.74%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHLSDPIDR--------------- 103
           PEGC GG+ +    +K  R++RK  +    S LSDGS  ++  DR               
Sbjct: 8   PEGCVGGRLRS---KKKTRKKRKGIRRRVSSRLSDGSLDNNKFDRPLSSVSAAAVPPDHR 67

Query: 104 CSFSNPTFQGSSDEAWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANH 163
            SFSN TFQGS +EAWFD+ P FESDC+ED++S+PDD+ S+N  EG+  SSI+ S DA H
Sbjct: 68  SSFSNTTFQGSIEEAWFDSVPIFESDCEEDFESVPDDVLSLNGSEGLPPSSIAFSRDAKH 127

Query: 164 GDHNV--------NQIHRPGNSARVHSVRSSGSEV-------VMNPDDAEHQLKGHGGHS 223
           GDH +        + + + G+S+  +S R+S SE        V N D A+   K  G   
Sbjct: 128 GDHTIGFQYTSSGDHMKKAGDSSAGNSARNSVSEAARHPNNQVFNSDYADSLPKSEG--- 187

Query: 224 SEANEPVFVDDISSTAGESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPS 283
              ++PVF+D+I+S+  E+  KG+G+LDNCGILP+NCLPCLAST+  V+KR+SLSSSPPS
Sbjct: 188 --PSQPVFLDEIASSVDENGGKGEGLLDNCGILPANCLPCLASTVPPVEKRRSLSSSPPS 247

Query: 284 GLKKAALKLSFKWKEGNPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFK 343
             KKAALKLSFKWKEG+PN ALFSSK +LQRPIAGSQVPF P DKKMLDCWSHIEP SFK
Sbjct: 248 ARKKAALKLSFKWKEGHPNNALFSSKPILQRPIAGSQVPFCPIDKKMLDCWSHIEPSSFK 307

Query: 344 VRGLNYAKDKKKEFAPSHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVN 403
           VRG NY +DKKKEFAP++AAY PFGVDVFLS RKVDHIARFVELPA + +GKLP ILVVN
Sbjct: 308 VRGQNYFRDKKKEFAPNYAAYYPFGVDVFLSPRKVDHIARFVELPAVNSSGKLPNILVVN 367

Query: 404 VQIPLYPAAIFQGETDGEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPV 463
           VQIPLY AA FQ E DGEGMS VLYFK+S+ ++KE+ + FQESIR+LIDDEVE+VKGFPV
Sbjct: 368 VQIPLYNAAFFQSEIDGEGMSFVLYFKLSESYSKEVPTLFQESIRRLIDDEVEKVKGFPV 427

Query: 464 DTVVPFRERLKILGRVANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMG--ENYLEID 523
           DT+VPFRERLKILGRV N+E+L +SAAERKLMQAYNEKPVLSRPQHEFY+G  E Y EID
Sbjct: 428 DTIVPFRERLKILGRVVNIEDLHLSAAERKLMQAYNEKPVLSRPQHEFYLGERETYFEID 487

Query: 524 LDMHRFSYISRKGFEAFLDRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQ 552
           +DMHRFSYISRKGFEAFLDRLK C+LD      GNK EELPE++LCC+RLNGIDY+NY+Q
Sbjct: 488 IDMHRFSYISRKGFEAFLDRLKICVLDVGLTIQGNKVEELPEQVLCCVRLNGIDYMNYRQ 545

BLAST of Cp4.1LG01g04010 vs. TrEMBL
Match: M5XEC7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003760mg PE=4 SV=1)

HSP 1 Score: 670.2 bits (1728), Expect = 2.1e-189
Identity = 353/541 (65.25%), Postives = 416/541 (76.89%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHL----------SDPIDRCSFSN 103
           P+ C GG+     K+K+R+RRR   K  A       + L          S P DR +F+N
Sbjct: 8   PDRCVGGRLGSSKKKKSRKRRRDGVKRWAKQSPGRAARLPEGSPDRFDGSAPPDRSTFNN 67

Query: 104 PTFQGSSDEAWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNV 163
           PTFQ S +EAWFD    FESDCDED+QS PD++ S+N F+ VS SS SS  DAN G++N 
Sbjct: 68  PTFQESIEEAWFDPVAIFESDCDEDFQSAPDEMPSLNGFDHVSVSSNSSVKDANCGEYND 127

Query: 164 N--------QIHRPGNSARVHSVRSSGSEV-------VMNPDDAEHQLKGHGGHSSEANE 223
           N        ++H+PG+ +  +S  +S + V       +MN +D + Q K +     EANE
Sbjct: 128 NGLHTSSTDRMHKPGDLSTENSASNSVTVVSQRSNVQIMNVNDVDTQTKFNDHSVIEANE 187

Query: 224 PVFVDDISSTAGESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKA 283
           PVF+D+ISS+  E+SAK +GILDNCGILPS CLPCLAST+ SV+KR+SL SSPPS  KKA
Sbjct: 188 PVFLDEISSSVDETSAKEEGILDNCGILPSTCLPCLASTVPSVEKRRSLISSPPSARKKA 247

Query: 284 ALKLSFKWKEGNPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLN 343
           ALKL FKWKE   NA LFSSK LLQRPIAGSQVPF P +KKM D WSHIEP++FKVRG N
Sbjct: 248 ALKLPFKWKE-QANATLFSSKKLLQRPIAGSQVPFCPIEKKMFDSWSHIEPNTFKVRGPN 307

Query: 344 YAKDKKKEFAPSHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPL 403
           Y +DKKKEFAPS+AAY PFG+DVFLS RK+DHIARFVELP  + +G LP ILVVNVQ+PL
Sbjct: 308 YFRDKKKEFAPSYAAYYPFGLDVFLSQRKIDHIARFVELPIVNSSGDLPAILVVNVQVPL 367

Query: 404 YPAAIFQGETDGEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVP 463
           YPAAIFQGETDGEGM+ VLYFK+SD ++KEL S+FQE+IR+LI DEVE+VKGFPVDT+ P
Sbjct: 368 YPAAIFQGETDGEGMNFVLYFKLSDIYSKELPSNFQENIRRLIGDEVEKVKGFPVDTIAP 427

Query: 464 FRERLKILGRVANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFS 523
           FRERLKILGRV NVE+L +SA ERKLMQAYNEKPVLSRPQHEFY+GENYLEIDLDMHRFS
Sbjct: 428 FRERLKILGRVVNVEDLHLSAPERKLMQAYNEKPVLSRPQHEFYLGENYLEIDLDMHRFS 487

Query: 524 YISRKGFEAFLDRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQEI 554
           YISRKGFEAFLDRLK CILD      GNKPEELPE+ILCCIRLNGIDY+NY QLG++Q+ 
Sbjct: 488 YISRKGFEAFLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLNGIDYMNYHQLGLTQDP 547

BLAST of Cp4.1LG01g04010 vs. TrEMBL
Match: M5XRV1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003760mg PE=4 SV=1)

HSP 1 Score: 666.8 bits (1719), Expect = 2.3e-188
Identity = 353/544 (64.89%), Postives = 417/544 (76.65%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHL----------SDPIDRCSFSN 103
           P+ C GG+     K+K+R+RRR   K  A       + L          S P DR +F+N
Sbjct: 8   PDRCVGGRLGSSKKKKSRKRRRDGVKRWAKQSPGRAARLPEGSPDRFDGSAPPDRSTFNN 67

Query: 104 PTFQGSSD---EAWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGD 163
           PTFQ S++   EAWFD    FESDCDED+QS PD++ S+N F+ VS SS SS  DAN G+
Sbjct: 68  PTFQASTESIEEAWFDPVAIFESDCDEDFQSAPDEMPSLNGFDHVSVSSNSSVKDANCGE 127

Query: 164 HNVN--------QIHRPGNSARVHSVRSSGSEV-------VMNPDDAEHQLKGHGGHSSE 223
           +N N        ++H+PG+ +  +S  +S + V       +MN +D + Q K +     E
Sbjct: 128 YNDNGLHTSSTDRMHKPGDLSTENSASNSVTVVSQRSNVQIMNVNDVDTQTKFNDHSVIE 187

Query: 224 ANEPVFVDDISSTAGESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGL 283
           ANEPVF+D+ISS+  E+SAK +GILDNCGILPS CLPCLAST+ SV+KR+SL SSPPS  
Sbjct: 188 ANEPVFLDEISSSVDETSAKEEGILDNCGILPSTCLPCLASTVPSVEKRRSLISSPPSAR 247

Query: 284 KKAALKLSFKWKEGNPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVR 343
           KKAALKL FKWKE   NA LFSSK LLQRPIAGSQVPF P +KKM D WSHIEP++FKVR
Sbjct: 248 KKAALKLPFKWKE-QANATLFSSKKLLQRPIAGSQVPFCPIEKKMFDSWSHIEPNTFKVR 307

Query: 344 GLNYAKDKKKEFAPSHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQ 403
           G NY +DKKKEFAPS+AAY PFG+DVFLS RK+DHIARFVELP  + +G LP ILVVNVQ
Sbjct: 308 GPNYFRDKKKEFAPSYAAYYPFGLDVFLSQRKIDHIARFVELPIVNSSGDLPAILVVNVQ 367

Query: 404 IPLYPAAIFQGETDGEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDT 463
           +PLYPAAIFQGETDGEGM+ VLYFK+SD ++KEL S+FQE+IR+LI DEVE+VKGFPVDT
Sbjct: 368 VPLYPAAIFQGETDGEGMNFVLYFKLSDIYSKELPSNFQENIRRLIGDEVEKVKGFPVDT 427

Query: 464 VVPFRERLKILGRVANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMH 523
           + PFRERLKILGRV NVE+L +SA ERKLMQAYNEKPVLSRPQHEFY+GENYLEIDLDMH
Sbjct: 428 IAPFRERLKILGRVVNVEDLHLSAPERKLMQAYNEKPVLSRPQHEFYLGENYLEIDLDMH 487

Query: 524 RFSYISRKGFEAFLDRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQLGMS 554
           RFSYISRKGFEAFLDRLK CILD      GNKPEELPE+ILCCIRLNGIDY+NY QLG++
Sbjct: 488 RFSYISRKGFEAFLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLNGIDYMNYHQLGLT 547

BLAST of Cp4.1LG01g04010 vs. TAIR10
Match: AT1G59650.1 (AT1G59650.1 Protein of unknown function (DUF1336))

HSP 1 Score: 595.5 bits (1534), Expect = 3.3e-170
Identity = 312/518 (60.23%), Postives = 379/518 (73.17%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRR-KASKTTAFSGLSDGSHLSDPIDRCSFSNPTFQGSSDE 103
           P+ C GGK +   +RK R RR+ +  +  + S LSDGS           +NPTF+ S DE
Sbjct: 8   PKSCVGGKIRSSKRRKTRTRRKIQKKRVVSSSRLSDGSF---------DNNPTFRASVDE 67

Query: 104 AWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNVNQIHRPGNS 163
           AWFD+   FE+DCD+D+ S+ +D  S+N  E +S SS+SS  D+N G             
Sbjct: 68  AWFDSNLAFETDCDDDFHSVQEDTLSVNGCERISVSSMSSVKDSNLGG------------ 127

Query: 164 ARVHSVRSSGSEVVMNPDDAEHQLKGHGGHSSEANEPVFVDDISSTAGESSAKGDGILDN 223
               S R+S S+V+         +        +  + VF+D+ISS A  SS K +G+L+N
Sbjct: 128 ----SARNSLSDVISQSKSESALI--------DTKQAVFIDEISSNADGSSNKDEGLLEN 187

Query: 224 CGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKSLL 283
           CGILPSNCLPCL ST+ S++KR+SLSSSPPS  KKAA+KLSFKW+EG+P   LFS+   L
Sbjct: 188 CGILPSNCLPCLNSTVPSIEKRRSLSSSPPSTRKKAAVKLSFKWREGHPTGPLFSTTMQL 247

Query: 284 QRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEFAPSHAAYQPFGVDVF 343
           QRP+AGSQVPF P +KKM D WS IEP SF+VR   Y +DKKKE AP++AAY PFGVDVF
Sbjct: 248 QRPMAGSQVPFCPLEKKMFDSWSIIEPGSFRVRSKTYFRDKKKELAPNYAAYNPFGVDVF 307

Query: 344 LSHRKVDHIARFVELPAAHYTG-KLPPILVVNVQIPLYPAAIFQGETDGEGMSIVLYFKI 403
           LS RKV+HIA++VELP    T  KLP ILVVNVQIPLYPAAIF GETDGEGM+ VLYFK+
Sbjct: 308 LSQRKVNHIAQYVELPVVTTTPTKLPSILVVNVQIPLYPAAIFHGETDGEGMNFVLYFKL 367

Query: 404 SDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRVANVEELPMSAAE 463
           SD + KEL  HFQESI++L+DDEVE+V+G+  DT VPFRERLKILGRVANV++L ++ AE
Sbjct: 368 SDNYLKELPPHFQESIQRLLDDEVEKVRGYTTDTNVPFRERLKILGRVANVDDLQLNGAE 427

Query: 464 RKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFLDRLKCCILD--- 523
           +KLM AYNEKPVLSRPQHEFY GENY EID+DMHRFSYISRKGFEAFLDRLK C+LD   
Sbjct: 428 KKLMNAYNEKPVLSRPQHEFYSGENYFEIDIDMHRFSYISRKGFEAFLDRLKNCVLDVGL 487

Query: 524 ---GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQEIL 554
              GNKPEELPE+ILCCIRLNGIDY+NY QL +SQE+L
Sbjct: 488 TIQGNKPEELPEQILCCIRLNGIDYMNYHQLALSQEVL 492

BLAST of Cp4.1LG01g04010 vs. TAIR10
Match: AT1G10410.1 (AT1G10410.1 Protein of unknown function (DUF1336))

HSP 1 Score: 573.5 bits (1477), Expect = 1.4e-163
Identity = 296/518 (57.14%), Postives = 377/518 (72.78%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSG-LSDGSHLSDPIDRCSFSNPTFQGSSDE 103
           P+ C G K +   +RK+RRRR+   K  A S  LSDGS  +      +FSNP+ + + ++
Sbjct: 8   PKSCVGAKLRSSKRRKSRRRRKIQRKRAAVSSRLSDGSFDNLDHHHRNFSNPSSRATGED 67

Query: 104 AWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNVNQIHRPGNS 163
           AWF++   FE+DCD+D+ S+ +D  S+N  E VS SS +++                   
Sbjct: 68  AWFESNVAFETDCDDDFHSVHEDALSLNGSERVSLSSTTTTS------------------ 127

Query: 164 ARVHSVRSSGSEVVMNPDDAEHQLKGHGGHSSEANEPVFVDDISSTAGESSAKGDGILDN 223
               S R + S  VM+   ++       G  ++ N+P  +D         S+  +G+L+N
Sbjct: 128 ----STRDTDSNEVMSQSKSD-------GDLNDTNQPDLID---------SSADEGLLEN 187

Query: 224 CGILPSNCLPCL-ASTINSVDKRKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKSL 283
           C ILPSNCLPCL  +T+ S+DKR+SLSSSPPS  KK++L+LS+KW+EG+ + ALF SK  
Sbjct: 188 CRILPSNCLPCLNTTTVPSIDKRRSLSSSPPSSRKKSSLRLSYKWREGHASGALFLSKMQ 247

Query: 284 LQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEFAPSHAAYQPFGVDV 343
           L+RPIAGSQVPF P DKKMLDCWS I+P+SF+VRG  Y ++KKKEFAPSHAAY PFGVDV
Sbjct: 248 LKRPIAGSQVPFCPIDKKMLDCWSTIDPNSFRVRGKTYLREKKKEFAPSHAAYNPFGVDV 307

Query: 344 FLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGETDGEGMSIVLYFKI 403
           FLS  K+ H+A++V+LP    + KLP ILVVNVQIPLYP AIFQGE+DGEGM+IVLYFK+
Sbjct: 308 FLSEHKIHHVAQYVKLPVTTTSTKLPSILVVNVQIPLYPTAIFQGESDGEGMNIVLYFKL 367

Query: 404 SDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRVANVEELPMSAAE 463
           SD ++KEL  HFQESIR+LIDDEVE+VKGFP+DT  PFRERLKILGRVANV++L +S  E
Sbjct: 368 SDNYSKELPLHFQESIRRLIDDEVEKVKGFPLDTTAPFRERLKILGRVANVDDLHLSGPE 427

Query: 464 RKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFLDRLKCCILD--- 523
           +KLMQAYNEKPVLSRPQHEFY+G+NY EID+DMHRF YISRKGFE F+DRLK C+LD   
Sbjct: 428 KKLMQAYNEKPVLSRPQHEFYLGDNYFEIDIDMHRFGYISRKGFETFIDRLKICVLDVGL 485

Query: 524 ---GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQEIL 554
              GNKPEELPE+ILCC+RLNGID++NY QL  +QE+L
Sbjct: 488 TIQGNKPEELPEQILCCVRLNGIDFMNYHQL--TQELL 485

BLAST of Cp4.1LG01g04010 vs. TAIR10
Match: AT3G29180.1 (AT3G29180.1 Protein of unknown function (DUF1336))

HSP 1 Score: 331.6 bits (849), Expect = 9.0e-91
Identity = 212/503 (42.15%), Postives = 293/503 (58.25%), Query Frame = 1

Query: 58  RKNRRRRRKASKTTAFSGLSD---GSHLSDPIDRCSFSNPTFQGSSDEAWFDTFPRFESD 117
           R  R+ RR++SK   FS +SD    +++  P D    S  +F  S D+AWFD+    +SD
Sbjct: 13  RPRRKGRRRSSKH--FSKVSDIVPHANIRRPSD--VGSRVSFAISQDDAWFDSVSVLDSD 72

Query: 118 CDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNVNQIHRPGNSARVHS--VRSSG 177
            DED+ S+P++        G +T +   +G     + +   +   G     H   ++  G
Sbjct: 73  EDEDFISLPEENVPSTPSAGGATGNNIPNGQVVQFESSSCFVDGKGKYEEYHETYLKIDG 132

Query: 178 SEVVMNPDDAEHQLKGHGGHSS-EANEPVFVDDISSTAGESSAKGDGILDNCGILPSNCL 237
           S+       ++   K   G S    N    + D +S  G    K +          S  +
Sbjct: 133 SKA--EKFVSKGMYKDPSGLSVLTGNNKKKLMDHASFKGLKDPKRNSQEKTLRTSLSRLM 192

Query: 238 PCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWKEGNPNAALFSSKSLLQRPIAGSQV 297
           P    T++  DK  +L+S      K A  +LSFK +       +   + LL RP AG  +
Sbjct: 193 P----TVSFNDK--TLNSPTSQKRKSAVYRLSFK-RRSCDGEEVTEQRKLLYRPKAGFTI 252

Query: 298 PFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEFAPSHAAYQPFGVDVFLSHRKVDHI 357
           P    +K+    WS I P +FK+RG  Y KDKKK  AP+   Y P GVD+F+  RK+DHI
Sbjct: 253 PSSGREKQSSGSWSEIPPSTFKLRGETYFKDKKKSPAPNQCPYTPIGVDLFVCPRKIDHI 312

Query: 358 ARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGETDGEGMSIVLYFKISDGFAKELTS 417
           A+ +ELP      KLP +LVVN+Q+P YPAA+F G++DGEGMSIVLYFK+ D   KE + 
Sbjct: 313 AQHIELPNIKAEAKLPALLVVNIQLPTYPAAMFLGDSDGEGMSIVLYFKLRDNHEKETSQ 372

Query: 418 HFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRVANVEELPMSAAERKLMQAYNEK 477
            +QESI+KL++DE+E+VKGF  D+ V FRERLKI+  + N E+L +S+ E+KL+QAYNEK
Sbjct: 373 QYQESIKKLVNDEMEKVKGFAKDSNVAFRERLKIVAGLVNPEDLALSSTEKKLVQAYNEK 432

Query: 478 PVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFLDRLKCCILD------GNKPEEL 537
           PVLSRPQH F+ G NY EIDLD+HRFSYISRKG EAF DRLK   LD        KPEEL
Sbjct: 433 PVLSRPQHNFFKGPNYFEIDLDVHRFSYISRKGLEAFRDRLKNGTLDLGLTIQAQKPEEL 492

Query: 538 PEEILCCIRLNGIDYVNYQQLGM 549
           PE++LCC+RL+ ID+V++ Q+ M
Sbjct: 493 PEQVLCCLRLSKIDFVDHGQIPM 502

BLAST of Cp4.1LG01g04010 vs. TAIR10
Match: AT5G39430.1 (AT5G39430.1 Protein of unknown function (DUF1336))

HSP 1 Score: 312.0 bits (798), Expect = 7.4e-85
Identity = 205/513 (39.96%), Postives = 293/513 (57.12%), Query Frame = 1

Query: 58  RKNRRRRRKASKTTAFSGLSDGSHLSDPIDRCSFSNPTFQGSSDEAWFDTFPRFESDCDE 117
           R  R+ RR+ SK    S +SD   LSD   + SF       S ++AWFD+   F SD D+
Sbjct: 13  RPRRKGRRRFSKN--ISKVSDIRRLSDVGIQTSFDI-----SQNDAWFDSSSLF-SDSDD 72

Query: 118 DYQSI--PDDIQSINSFEG---------VSTSSISSSGDANHGDHNVNQIHRPGNSARVH 177
           D+ S+   D++       G            SS    G+ N+ +++ + +   G + ++ 
Sbjct: 73  DFISLHEADNVWLEGGVMGKIPNGQVVEFEASSCIVDGNGNYEEYHESYLKIDGGN-KIE 132

Query: 178 SVRSSGSEVVMNPDDAEHQLKGHGGHSSEANEPVFVDDISSTAG----ESSAKGDGILDN 237
              S+G     N       L G  G++ +      ++  SS  G    + + K   +  N
Sbjct: 133 KFMSNGLYKDTNG------LSGIIGNNKKK-----LNTYSSFKGLKELDPNPKEKALKSN 192

Query: 238 CGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWK--EGNPNAALFSSKS 297
                S  +P    + N     K+L+S      K A  ++SFK +  +G       SSK 
Sbjct: 193 L----SRLMPLPTVSFND----KTLNSPTSQNRKSAVYQVSFKRRSCDGEEVTEHRSSKR 252

Query: 298 LLQRPIAGSQVP-FVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEFAPSHAAYQPFGV 357
           LL RP AG  +P +V    +    W  I P + K+RG  Y KDK+K  AP+   Y P GV
Sbjct: 253 LLYRPKAGYTIPCYVKEKHQSSGSWCEIPPSNLKLRGETYFKDKRKHPAPNQCPYTPIGV 312

Query: 358 DVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGETDGEGMSIVLYF 417
           D+F+  RK+DHIA+ +ELP       LP +L+VN+Q+P YPAA+F G+++GEGMSIVLYF
Sbjct: 313 DLFVCPRKIDHIAQHIELPNIKAVANLPALLIVNIQLPTYPAAMFLGDSNGEGMSIVLYF 372

Query: 418 KISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRVANVEELPMSA 477
           K+ + F  E++  +Q+SI+KL++DE+E+VKGF  D +VPFRERLKI+  + N +EL +S+
Sbjct: 373 KLRENFKNEISQQYQDSIKKLVEDEMEKVKGFAKDNIVPFRERLKIVAGLVNPDELSLSS 432

Query: 478 AERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFLDRLKCCILD- 537
            E+KL+QAYNEKPVLSRPQH F+ G NY EIDLD+HRFSY+SRKG EAF DRLK   LD 
Sbjct: 433 TEKKLIQAYNEKPVLSRPQHNFFKGPNYFEIDLDVHRFSYLSRKGLEAFRDRLKNGTLDL 492

Query: 538 -----GNKPEELPEEILCCIRLNGIDYVNYQQL 547
                  K EELPE++LCC+RL+ ID+V+  Q+
Sbjct: 493 GLTIQAQKQEELPEKVLCCLRLSKIDFVDNGQI 497

BLAST of Cp4.1LG01g04010 vs. TAIR10
Match: AT1G13970.1 (AT1G13970.1 Protein of unknown function (DUF1336))

HSP 1 Score: 297.0 bits (759), Expect = 2.5e-80
Identity = 143/281 (50.89%), Postives = 206/281 (73.31%), Query Frame = 1

Query: 277 SSKSLLQRPIAGSQVPFVPADKKMLD-CWSHIEPDSFKVRGLNYAKDKKKEFAPSHAAYQ 336
           S++ LL RP AGS +     +K      WS + P SFK+RGLN+ +DK+K  AP+ + Y 
Sbjct: 217 SAEKLLYRPKAGSMIQRSLGEKMTSQGSWSEVSPSSFKLRGLNFFRDKQKCPAPNCSPYI 276

Query: 337 PFGVDVFLSHRKVDHIARFVELP----AAHYTGKLPPILVVNVQIPLYPAAIFQGETDGE 396
           P GVD+F   +K++HIA+ +ELP    A+     +P +L+VN+Q+P+YP ++F G+ DGE
Sbjct: 277 PIGVDLFACPKKINHIAQHIELPNLKPASSQVCDIPNLLIVNIQLPMYPTSMF-GDYDGE 336

Query: 397 GMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRVAN 456
           G+S+VLYFK ++ + KE++SHF+E+I++ ++DE+E+VKGF  ++ VPFRERLKI+  + N
Sbjct: 337 GLSLVLYFKRNENYHKEISSHFKETIKRFMEDEMEKVKGFTRESTVPFRERLKIMAGLVN 396

Query: 457 VEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFLDR 516
            E+  +S+ ERKL+ AYN++PVLSRPQH+F+ G NY EIDLD+HRFSYISRKG E+F DR
Sbjct: 397 PEDFQLSSTERKLITAYNDRPVLSRPQHDFFQGPNYFEIDLDIHRFSYISRKGLESFRDR 456

Query: 517 LKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQL 547
           +K  ILD         PEELPE++LCC+RLN ID+VN+ Q+
Sbjct: 457 IKNGILDLGLTIQAQSPEELPEQVLCCVRLNKIDFVNHGQI 496

BLAST of Cp4.1LG01g04010 vs. NCBI nr
Match: gi|659090942|ref|XP_008446285.1| (PREDICTED: uncharacterized protein LOC103489062 [Cucumis melo])

HSP 1 Score: 900.6 bits (2326), Expect = 1.4e-258
Identity = 457/530 (86.23%), Postives = 480/530 (90.57%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHLSDPIDRCSFSNPTFQGSSDEA 103
           P+GC GGKFKK  KRKNRRRRRK SKT AFS LS+GSH SDPID CSFSNPTFQGS DEA
Sbjct: 8   PQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEA 67

Query: 104 WFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNVN-------QI 163
           WFDT  +FESDCDEDYQS+PDD QSINSFEG S+SSISSSGDANHGDHNVN       QI
Sbjct: 68  WFDTVGKFESDCDEDYQSLPDDNQSINSFEGASSSSISSSGDANHGDHNVNRHSATPDQI 127

Query: 164 HRPGNSARVHSVRSSGSEVV-------MNPDDAEHQLKGHGGHSSEANEPVFVDDISSTA 223
           HRPGNSARVHSV SS S+V        MNPDDAE QLKG G HSSE NEPVF+D+ISSTA
Sbjct: 128 HRPGNSARVHSVSSSESQVARDSHSQAMNPDDAEPQLKGCG-HSSEGNEPVFIDEISSTA 187

Query: 224 GESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWKEG 283
           GESSAKGDGILDNCGILPSNCLPCLA+TINSV+KRKSLSSSPPSGLKKAALKLSFKWKEG
Sbjct: 188 GESSAKGDGILDNCGILPSNCLPCLATTINSVEKRKSLSSSPPSGLKKAALKLSFKWKEG 247

Query: 284 NPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEFAP 343
           NPNAALFSSK+LLQRPIAGSQVPF PA+KKMLDCWSHIEPDSFKVRG+NYAKDKKKEFAP
Sbjct: 248 NPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYAKDKKKEFAP 307

Query: 344 SHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGETD 403
           +HAAY PFGVDVFLSHRKVDHIARFVE+P A Y+G LPPILVVNVQIPLY AAIFQGETD
Sbjct: 308 NHAAYYPFGVDVFLSHRKVDHIARFVEMPTATYSGTLPPILVVNVQIPLYSAAIFQGETD 367

Query: 404 GEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRV 463
           GEGMSIVLYFK+SD +A+ELTSHFQE+IRKLIDDEVERVKGFPVD VVPFRERLKILGRV
Sbjct: 368 GEGMSIVLYFKLSDAYAEELTSHFQENIRKLIDDEVERVKGFPVDNVVPFRERLKILGRV 427

Query: 464 ANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFL 523
           ANVE+LPMSAAERKLMQAYNEKPVLSRPQHEFY+GENYLEIDLDMHRFSYISRKGFEAFL
Sbjct: 428 ANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYLGENYLEIDLDMHRFSYISRKGFEAFL 487

Query: 524 DRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQEIL 554
           DRLKCCILD      GN+PEELPEEILCCIRLNGIDYVNYQQLGM  EIL
Sbjct: 488 DRLKCCILDVGLTIQGNRPEELPEEILCCIRLNGIDYVNYQQLGMGPEIL 536

BLAST of Cp4.1LG01g04010 vs. NCBI nr
Match: gi|449434853|ref|XP_004135210.1| (PREDICTED: uncharacterized protein LOC101206832 [Cucumis sativus])

HSP 1 Score: 892.1 bits (2304), Expect = 4.9e-256
Identity = 454/530 (85.66%), Postives = 478/530 (90.19%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHLSDPIDRCSFSNPTFQGSSDEA 103
           P+GC GGKFKK  KRKNRRRRRK SKT AFS LS+GSH SDPID CSFSNPTFQGS DEA
Sbjct: 8   PQGCVGGKFKKSSKRKNRRRRRKGSKTIAFSALSEGSHRSDPIDHCSFSNPTFQGSYDEA 67

Query: 104 WFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNVN-------QI 163
           WFDT  +FESDCDEDYQS+PDD QSINS E  STSSISSSGDANHGDHNVN       QI
Sbjct: 68  WFDTVGKFESDCDEDYQSLPDDNQSINSLEAASTSSISSSGDANHGDHNVNRHSATSDQI 127

Query: 164 HRPGNSARVHSVRSSGSEVV-------MNPDDAEHQLKGHGGHSSEANEPVFVDDISSTA 223
           HRPGNSARVHSV SS S+V        +NPDDAE QLKG G HSSEANEPVF+D+ISSTA
Sbjct: 128 HRPGNSARVHSVSSSESQVARDSHLQAINPDDAEPQLKGCG-HSSEANEPVFIDEISSTA 187

Query: 224 GESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWKEG 283
           GESSAKGDGILDNCGILPSNCLPCLASTINSV+KRKSLSSSPPSGLKKAALKLSFKWKEG
Sbjct: 188 GESSAKGDGILDNCGILPSNCLPCLASTINSVEKRKSLSSSPPSGLKKAALKLSFKWKEG 247

Query: 284 NPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEFAP 343
           NPNAALFSSK+LLQRPIAGSQVPF PA+KKMLDCWSHIEPDSFKVRG+NYAKDKKKEFAP
Sbjct: 248 NPNAALFSSKALLQRPIAGSQVPFCPAEKKMLDCWSHIEPDSFKVRGVNYAKDKKKEFAP 307

Query: 344 SHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGETD 403
           +H AY PFGVDVFLSHRKVDHIARFVE+PAA  +G LPPILVVNVQIPLY AAIFQGETD
Sbjct: 308 NHTAYYPFGVDVFLSHRKVDHIARFVEMPAATSSGTLPPILVVNVQIPLYSAAIFQGETD 367

Query: 404 GEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILGRV 463
           GEGMSIVLYFK+SD +A++LTSHFQE+I+KLIDDEVERVKGFPVD VVPFRERLKILGRV
Sbjct: 368 GEGMSIVLYFKLSDAYAEKLTSHFQENIKKLIDDEVERVKGFPVDNVVPFRERLKILGRV 427

Query: 464 ANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEAFL 523
           ANVE+LPMSAAERKLMQAYNEKPVLSRPQHEFY+GENYLEIDLDMHRFSYISRKGFEAFL
Sbjct: 428 ANVEDLPMSAAERKLMQAYNEKPVLSRPQHEFYLGENYLEIDLDMHRFSYISRKGFEAFL 487

Query: 524 DRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQEIL 554
           DRLKCCILD      GN+PEELPEEILCCIRLNGIDYVNYQQLGM  EIL
Sbjct: 488 DRLKCCILDVGLTIQGNRPEELPEEILCCIRLNGIDYVNYQQLGMGLEIL 536

BLAST of Cp4.1LG01g04010 vs. NCBI nr
Match: gi|658009499|ref|XP_008339963.1| (PREDICTED: uncharacterized protein LOC103402953 [Malus domestica])

HSP 1 Score: 680.6 bits (1755), Expect = 2.2e-192
Identity = 356/534 (66.67%), Postives = 424/534 (79.40%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRR--KASKTTAFSG-LSDGS----HLSDPIDRCSFSNPTF 103
           PEGC GG+     +++ R+RRR  +A +T   +G LS+GS      S P DR +F+NPTF
Sbjct: 8   PEGCVGGRLSSSKRKRTRKRRRDGRAKQTPGRAGRLSEGSPDKFDRSAPPDRSTFNNPTF 67

Query: 104 QGSSDEAWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHNV--- 163
           Q  S++AWFD   RFESDCDED+ S+ D++ S+N FE VS SS  S  DAN G++N+   
Sbjct: 68  QEGSEDAWFDPVARFESDCDEDFHSVQDEVLSVNGFERVSVSSNLSLRDANCGEYNIIDL 127

Query: 164 -----NQIHRPGNSARVHSVRSSGSEVVMNPDDAEHQLKGH--GGHS-SEANEPVFVDDI 223
                +Q+H+ G+SA       + S  V++     H + G+   GHS +EAN+PVF+D+I
Sbjct: 128 HASSADQMHKRGDSA-------NNSVSVVSQKSINHIMSGNDVDGHSTAEANQPVFLDEI 187

Query: 224 SSTAGESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFK 283
           SS+  ESS K +GILDNCGILPS+CLPCLAST+ SV+KR+SLSSSPPS  KKAA+KL FK
Sbjct: 188 SSSVDESSTKEEGILDNCGILPSHCLPCLASTVPSVEKRRSLSSSPPSARKKAAIKLPFK 247

Query: 284 WKEGNPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKK 343
           WKEG+PNA+L SSK LLQRPIAGSQVPF P +KKM D WSHIEP+SFKVRG NY KD+KK
Sbjct: 248 WKEGHPNASLLSSKMLLQRPIAGSQVPFCPMEKKMFDSWSHIEPNSFKVRGPNYFKDRKK 307

Query: 344 EFAPSHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQ 403
           E APS+AAY PFG+DVFLS RK+DHIARFVELP    +G LP ILVVNVQ+PLYPAAIFQ
Sbjct: 308 EHAPSYAAYYPFGLDVFLSQRKIDHIARFVELPVVSSSGDLPAILVVNVQVPLYPAAIFQ 367

Query: 404 GETDGEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKI 463
           GETDGEGM+ VLYFK++D ++KEL  +FQE+IR+LI DEVE+VKGFPVDT+VPFRERLKI
Sbjct: 368 GETDGEGMNFVLYFKLNDMYSKELPPNFQENIRRLIGDEVEKVKGFPVDTIVPFRERLKI 427

Query: 464 LGRVANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGF 523
           LGRVANVE+L +SA ERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGF
Sbjct: 428 LGRVANVEDLHLSAPERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGF 487

Query: 524 EAFLDRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQEIL 554
           EAFLDRLK CILD      GNKPEELPE+ILCCIRLNGIDY+NY QLG++Q+ L
Sbjct: 488 EAFLDRLKHCILDVGLTIQGNKPEELPEQILCCIRLNGIDYMNYHQLGLTQDPL 534

BLAST of Cp4.1LG01g04010 vs. NCBI nr
Match: gi|590693731|ref|XP_007044415.1| (CW14 protein isoform 1 [Theobroma cacao])

HSP 1 Score: 677.2 bits (1746), Expect = 2.5e-191
Identity = 354/530 (66.79%), Postives = 416/530 (78.49%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHLSDPIDR-------CSFSNPTF 103
           PEGC   K +   K+KNR+RR+   K    S LS+ S  SD +DR        SF+NPTF
Sbjct: 8   PEGCVSPKLRSS-KKKNRKRRKSCLKKRVSSRLSEVS--SDKVDRPAPPDHHSSFTNPTF 67

Query: 104 QGSSDEAWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANHGDHN--VN 163
           QGS DE WFD    F+SDCDE+++S+ +D+ S+N  EGVS SSISS  DAN G+H+  V+
Sbjct: 68  QGSIDE-WFDPVAVFDSDCDEEFESVQEDVLSLNGLEGVSISSISSLKDANCGEHSSLVD 127

Query: 164 QIHRPGNSARVHSVRSSGSEV-------VMNPDDAEHQLKGHGGHSSEANEPVFVDDISS 223
           Q+ +PG+ +  +S  +S  EV       V+N +D   Q K  G  S++A +PVF+DDI+S
Sbjct: 128 QMQKPGDLSAGNSACNSVGEVTRNSNSQVLNSEDVNSQSKSDGP-SNKAKQPVFLDDIAS 187

Query: 224 TAGESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPSGLKKAALKLSFKWK 283
           +  E S K +G+LDNCGILPSNCLPCLAST+ S++KR+SLSSSPPS  KK ALKL FKW+
Sbjct: 188 SVDEGSGKEEGLLDNCGILPSNCLPCLASTVPSIEKRRSLSSSPPSARKKNALKLPFKWR 247

Query: 284 EGNPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFKVRGLNYAKDKKKEF 343
           EG+PNA LFSSK LLQRP AGSQVP  P +KKM DCWSHIEP +FKVRG NY +DKKK+F
Sbjct: 248 EGHPNATLFSSKMLLQRPKAGSQVPVCPIEKKMFDCWSHIEPGTFKVRGENYFRDKKKDF 307

Query: 344 APSHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVNVQIPLYPAAIFQGE 403
           AP+HAAY PFGVDVFLS RK+DHIARFVELP    +GKLP ILVVNVQIPLYPAA+FQ E
Sbjct: 308 APNHAAYYPFGVDVFLSPRKIDHIARFVELPVVSQSGKLPSILVVNVQIPLYPAALFQSE 367

Query: 404 TDGEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPVDTVVPFRERLKILG 463
           TDGEGMS VLYFK+SD + KEL  HFQE+IR+LI DEVE+VKGFPVDT+VPFRERLKILG
Sbjct: 368 TDGEGMSFVLYFKLSDSYLKELPPHFQENIRRLIVDEVEKVKGFPVDTIVPFRERLKILG 427

Query: 464 RVANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMGENYLEIDLDMHRFSYISRKGFEA 523
           RVANVE+L MSAAERKLM AYNEKP LSRPQHEFY+GENY EID+DMHRFSYISRKGF+A
Sbjct: 428 RVANVEDLHMSAAERKLMHAYNEKPFLSRPQHEFYLGENYFEIDIDMHRFSYISRKGFDA 487

Query: 524 FLDRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQLGMSQE 552
           FLDRLK CILD      GNKPEELPE+ILCCIRL+GIDY+NY QLG+SQE
Sbjct: 488 FLDRLKLCILDVGLTIQGNKPEELPEQILCCIRLSGIDYMNYHQLGLSQE 532

BLAST of Cp4.1LG01g04010 vs. NCBI nr
Match: gi|802787668|ref|XP_012091986.1| (PREDICTED: uncharacterized protein LOC105649804 isoform X2 [Jatropha curcas])

HSP 1 Score: 673.3 bits (1736), Expect = 3.6e-190
Identity = 350/546 (64.10%), Postives = 419/546 (76.74%), Query Frame = 1

Query: 44  PEGCTGGKFKKPFKRKNRRRRRKASKTTAFSGLSDGSHLSDPIDR--------------- 103
           PEGC GG+ +    +K  R++RK  +    S LSDGS  ++  DR               
Sbjct: 8   PEGCVGGRLRS---KKKTRKKRKGIRRRVSSRLSDGSLDNNKFDRPLSSVSAAAVPPDHR 67

Query: 104 CSFSNPTFQGSSDEAWFDTFPRFESDCDEDYQSIPDDIQSINSFEGVSTSSISSSGDANH 163
            SFSN TFQGS +EAWFD+ P FESDC+ED++S+PDD+ S+N  EG+  SSI+ S DA H
Sbjct: 68  SSFSNTTFQGSIEEAWFDSVPIFESDCEEDFESVPDDVLSLNGSEGLPPSSIAFSRDAKH 127

Query: 164 GDHNV--------NQIHRPGNSARVHSVRSSGSEV-------VMNPDDAEHQLKGHGGHS 223
           GDH +        + + + G+S+  +S R+S SE        V N D A+   K  G   
Sbjct: 128 GDHTIGFQYTSSGDHMKKAGDSSAGNSARNSVSEAARHPNNQVFNSDYADSLPKSEG--- 187

Query: 224 SEANEPVFVDDISSTAGESSAKGDGILDNCGILPSNCLPCLASTINSVDKRKSLSSSPPS 283
              ++PVF+D+I+S+  E+  KG+G+LDNCGILP+NCLPCLAST+  V+KR+SLSSSPPS
Sbjct: 188 --PSQPVFLDEIASSVDENGGKGEGLLDNCGILPANCLPCLASTVPPVEKRRSLSSSPPS 247

Query: 284 GLKKAALKLSFKWKEGNPNAALFSSKSLLQRPIAGSQVPFVPADKKMLDCWSHIEPDSFK 343
             KKAALKLSFKWKEG+PN ALFSSK +LQRPIAGSQVPF P DKKMLDCWSHIEP SFK
Sbjct: 248 ARKKAALKLSFKWKEGHPNNALFSSKPILQRPIAGSQVPFCPIDKKMLDCWSHIEPSSFK 307

Query: 344 VRGLNYAKDKKKEFAPSHAAYQPFGVDVFLSHRKVDHIARFVELPAAHYTGKLPPILVVN 403
           VRG NY +DKKKEFAP++AAY PFGVDVFLS RKVDHIARFVELPA + +GKLP ILVVN
Sbjct: 308 VRGQNYFRDKKKEFAPNYAAYYPFGVDVFLSPRKVDHIARFVELPAVNSSGKLPNILVVN 367

Query: 404 VQIPLYPAAIFQGETDGEGMSIVLYFKISDGFAKELTSHFQESIRKLIDDEVERVKGFPV 463
           VQIPLY AA FQ E DGEGMS VLYFK+S+ ++KE+ + FQESIR+LIDDEVE+VKGFPV
Sbjct: 368 VQIPLYNAAFFQSEIDGEGMSFVLYFKLSESYSKEVPTLFQESIRRLIDDEVEKVKGFPV 427

Query: 464 DTVVPFRERLKILGRVANVEELPMSAAERKLMQAYNEKPVLSRPQHEFYMG--ENYLEID 523
           DT+VPFRERLKILGRV N+E+L +SAAERKLMQAYNEKPVLSRPQHEFY+G  E Y EID
Sbjct: 428 DTIVPFRERLKILGRVVNIEDLHLSAAERKLMQAYNEKPVLSRPQHEFYLGERETYFEID 487

Query: 524 LDMHRFSYISRKGFEAFLDRLKCCILD------GNKPEELPEEILCCIRLNGIDYVNYQQ 552
           +DMHRFSYISRKGFEAFLDRLK C+LD      GNK EELPE++LCC+RLNGIDY+NY+Q
Sbjct: 488 IDMHRFSYISRKGFEAFLDRLKICVLDVGLTIQGNKVEELPEQVLCCVRLNGIDYMNYRQ 545

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KVS2_CUCSA3.4e-25685.66Uncharacterized protein OS=Cucumis sativus GN=Csa_5G604060 PE=4 SV=1[more]
A0A061E7C7_THECC1.7e-19166.79CW14 protein isoform 1 OS=Theobroma cacao GN=TCM_010075 PE=4 SV=1[more]
A0A067JET6_JATCU2.5e-19064.10Uncharacterized protein OS=Jatropha curcas GN=JCGZ_21729 PE=4 SV=1[more]
M5XEC7_PRUPE2.1e-18965.25Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003760mg PE=4 SV=1[more]
M5XRV1_PRUPE2.3e-18864.89Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003760mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G59650.13.3e-17060.23 Protein of unknown function (DUF1336)[more]
AT1G10410.11.4e-16357.14 Protein of unknown function (DUF1336)[more]
AT3G29180.19.0e-9142.15 Protein of unknown function (DUF1336)[more]
AT5G39430.17.4e-8539.96 Protein of unknown function (DUF1336)[more]
AT1G13970.12.5e-8050.89 Protein of unknown function (DUF1336)[more]
Match NameE-valueIdentityDescription
gi|659090942|ref|XP_008446285.1|1.4e-25886.23PREDICTED: uncharacterized protein LOC103489062 [Cucumis melo][more]
gi|449434853|ref|XP_004135210.1|4.9e-25685.66PREDICTED: uncharacterized protein LOC101206832 [Cucumis sativus][more]
gi|658009499|ref|XP_008339963.1|2.2e-19266.67PREDICTED: uncharacterized protein LOC103402953 [Malus domestica][more]
gi|590693731|ref|XP_007044415.1|2.5e-19166.79CW14 protein isoform 1 [Theobroma cacao][more]
gi|802787668|ref|XP_012091986.1|3.6e-19064.10PREDICTED: uncharacterized protein LOC105649804 isoform X2 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009769EDR2_C
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0030206 chondroitin sulfate biosynthetic process
biological_process GO:0006486 protein glycosylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0030246 carbohydrate binding
molecular_function GO:0047220 galactosylxylosylprotein 3-beta-galactosyltransferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g04010.1Cp4.1LG01g04010.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009769Protein ENHANCED DISEASE RESISTANCE 2, C-terminalPFAMPF07059DUF1336coord: 304..539
score: 1.6
NoneNo IPR availablePANTHERPTHR31558FAMILY NOT NAMEDcoord: 57..546
score: 1.5E
NoneNo IPR availablePANTHERPTHR31558:SF3CW14 PROTEINcoord: 57..546
score: 1.5E