Cp4.1LG20g03450 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g03450
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTetratricopeptide repeat protein 37
LocationCp4.1LG20 : 1890307 .. 1897915 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAATGATCATATGTTTGAATTGTATGAACACTCCGGTTACTCGACAAAAGGGGAAAAAATAATAATAAAGAAATTCTTAACCACGTCACCTTAATTTTTTCTTTAATCCTGCGTTAAAATTACACCAAAAATGTCAGCAAGCGTAAGTCGGTGCAGGTTGTTTCCGCATCCAAACAGCGGCTAAGCAATCCGCGCAACCAAACAAGGGTCTTCTTCTTTCATTCTTTACTTCTTTCGCGTCGACGGTCGCGCCACAAAGTCTTTAAGCGCCGAAAAGCTCCAGCTGCTCCCAAAATGCCTCTCAAGACAGAACATGGAGCATCGGACAGCTCCCTCGACGAGCACTCGAAGGCGGCCCATTCTTCTAAAGTCGTTGTTCTTGCTGACCTCAATGTTGATCCTCCAGAAATGGACGACGACAGTTCCGTTCACGTTTCAGCTTCGACTATCTCTAGGTTACTTTCTTTTTTATACTCTGTCTACTTTTTTAATCGAATTTGGAAATTTTTGGTAGCGTTTACCCCTTTTGGTACATAAAATTTCCTTGCTAGGGAGTCCTGGTTTTAATTGTGGTCTATTTACTCTTGAAGAACCAAAATTGTACCGGCAAACTGTTGAGCGGTTTGTTTCTTGTTTTTCGATCGAGAGTAGTTAAGGTTCTGTTAACAGTCTAATTAAAACTAGAAAAACATAGTACAAAATAAGTTGCTGACCTTCTTGCTAATGATTCGGAGGTACTTTGAGAAAGTGTATTCGTCCTTACCTGCTAATCAAATTATCTTTTAAAGTTGGATTTCTGTAATGGATGTTGTGAATTGTGAATCAAGGCTTATGAAACGAAGAAAATGTGTCAAAATAATGCTTCATCGACTGTCTTTGTTTTGATCAGCAAGGTACCGCATGAACTCATTAGTACTGTTATTTTGGATGGAACATGCATTCCACCCAAAACCATGTAAAATTGGAGGGAGCACTAGTTACTAGGAGTTCAAACCCAAACATGAGATCAGCTGGATAACGAGTTATCTTACTAGACATGACAAAAGATTAAGATAATTGCAGAAGTCACTGTAATCAAGAAGCTCAAAAGGCTCTCATTGTTTAAAATTTTCTTGCAAACCTCATCAAAATTCCTTTCAGTGAAACGTCCTAAAATCTTCCCCTCCATTAAACCCCAAACAAAAAGGTTAAAGACAGTGTGACGGCATATGATCTTAAGCCTCTTCGTGAGGTTCAGCTGTATCATTAGGGGCCCTCCAAGGTAAGGCAAAATACCAAATTTGGGAACTGAAATATAATCAAGGTTTTTTACTTTCTTTGTCCCAGAAAAAGGTCCATTTTAGGGTTTGAGGTTTTCCTATTGGTTAGTACCTACTCAAAATTATTGTAATAAATTTTTGTGGCTTCTTGTTAAAAATAAGCGTGTGATTTTAGGTTATCTGTGGATGAGAGCAATCAAGACAAAACTGTGGCAATTTGCAAGGATACTAATACAATGGAAGTTGAAGGTAGACATGTAAGCAAAATAGGGAAGTGCCGTTCAAGAAACAATAAGGTAGAGTACTCTCTTGATTCTGCAGCTGATGCAGATGGTGATCAACATGGTCAAGGTGTTTCAACTTCACGCGAAGAAAAAGTCAGCAGCCTCAAAACTGTGAGTTTTGTCAACATTTGGTTTTCTTCTCATTTTAACCTTTGTTTTCTGTATATCACCTGCTTCCTTTGCTGTTGTATTTCCTGCATCTTCGATATGATGCATTCTATTGCTTGCCATGTGAAAAGGTGGTTTCAAAAATCTTGTGTGCATGTAAACCTTGTAAGAACGGCCAGCTATTTCTTAACTTAAGACCTCTATCAGACAAGTTTACATCTAATTTTTGCATGCATATAATTTTTTTGGCTCATGGGCTGCTTCTTTCGAATTGTACGACTAAAATTATGTTTTGGTTTGTGAGCTGGGGAAAATCAGATTTTTGGATGGTTAATGGGGGCGCCCTTTTTCTTTCCCATATTTCAGACTTGTCCTTTTGGATGGTAAGTAGTTTTCTTTAAGACGTATCACAGGATAAAGGTAAAGACACAGGCCACTGCTGCCAAAGGTCAAAGAGATTGAAATACCAATAATAACAGTCACCATGTAGTTTGATCCCTTTGTTCTTTCAGCAGTTTGTTTGATCTGGGTTTTGTCCAGGGTTTTTTTGTGGCAGGTTTTGTTATATTTCCTTCATATCTTTTTATTTCTCAAGAAGATGCCTTATCGTTTCAAGGTTTGGTTTTGGTCTTGTTGGTCTGGTTACTGGGTTTTCTCTGGCTTTTCATGGATCTTCCTTTTTCCTTTTTGGTTGGCGTGTTTGGTGTGGTGGGCGCAGCATGTTATGGGACTTTTCTGTCTCATATTGGTGGTTTTAGCTTGTTGCAAATTCTGGAGTTCGTTTTTGTTTGGATTTAAAGCATCTTTGCAAGAAATTTTCTCCTTTGGTTGGATCGCCTTGAAACTGTTAAGTTTGCAGTCTGATTTTTTTTGGCAAGCATTTTTGGTAGAAGCTATTTTAGGGGCTTATTGTCTTAAGCTGGTGGATTTTTGCATCTTTGGGAGTTTTTTTTGGAGCTTTGTTTGCGAATCTTGCAAGTTTCTTCTGCAGGATATTTGTTTATTTTGGAATGTTTTTTCTTTCTTCTACGTGGAGTTTGTATCTTTTGAATTTTAGTATCTTTTTTTATTTTATTTTATTTTTAAAAGACGATTGCTTATGCAGTTTGATCTCCTGAAGGATCAGTGTACGGAGTGTTAACTTTTTTTGATTAACTTAAATGATGGAAAACTCTTGGTCTCTCTTCTTTGCCCTGCAATTTGGATACATGTATTATTTGAATCCTTGACTCTTTCTCTGGCTGGTCATTCAGGTAGGTTCTTACAAATCTTGTTTTTTCCTTTCTCAGTGAACATGGGGAATATTTATTCTTAACTTCATAATAGATTTTGTTTAAGTATGCCCTTGTAGCTTCTTTTAGGTAAATCTTTCTCTGTCTGATTTTGTTGATGAAATTGTTCCAAATAAAAATGTATTGTTGCACTTGGGTATCCTTTTCTCTTGCAGAAATTCAATCAAATTGTGTATTTATAGTCTTCTATGTTATCCATTTGTTTTCAAAGCTTCGCTTAAATGTTGAAGCTTATTCTGTATTCAATACATACATCCACGGCCACATCTCACTCCCCTCACCCCACAACCAAGTACTCAGGATGTGACTTCTTATTTACCTTGATTATTGGAATGCCTGCCTGTAGGGGTTAGTTCATGTAGCAAGAAAGATGCCAAAAAATGCTCACGCTCATTTCATTCTTGGCCTAATGTATCAGAGGTTGGGCCAGCCACAAAAGGTAGCTTGGCTTGCACACCATACTTGTTTTGGTTAATTTTCTTGTTCTATTATTAATTAAAACTGTGGTACTCTTTCAGGCTGTTTCAGCATATGAGAAGGCAGAGGAAATCTTACTCCAAAGTGATGTTGAGATTCACAGGCCAGAGTTGCTCTCATTGGTCCAAACTCATCATGCACAGGTTACATTTTATATCCATGTTCAACCTTACCTCTCTCCTCTTACGGTATACTTCGTCTCTATTTGTAAATAATAGATATTACCAACCTGATTATGCTCACCTGTTCTATGAACTGTCTAGTGTCTCCTTCTAGAAAGTGCAGCAGGGGATAATAGTTCAGACAAAGAACTTGATCAAGAGGAGCTTGACGAGATTCTTTCTAAACTTAAGCATTCAATTCAATCTGATGTTAGACAGGCAGCTATGTGGAATACCCTAGGCTTGATACTTCTTTCTACTGGTCGAGTGAAGGTGCATGTTTATTACTATTCACTTCCGTGGTTGCTCTCCTTTTAGAAACTAATGTGGACTCATGTTAATATGTCCAGAGTGCCCTTTCAGTGTTATCATCCCTGTTGGCCATTGTCCCTAACAACTATGATTGCCTTGGAAACCTTGGAATTGCTTATCTTCAAAGGTATATCAACAATCGTCCATGCTTGTCTGAAATCTAACTTTCATGCGTCTCTTCAGGTTTTTCTAACTTCAGTTCCAGTAAATGTTGAGGAGTTAGGGAACCAATACTAGAACTTGAAAAAATAAGCAACTGAAGGCAGTATACTCGACAAAATTGATTTGTACACCAATAAACATTAGTGATGATGCAAATGAGATAATCATGGAATTAATGCTAAAACCCATTGCTTCGTTAAAGTTAATATATGCATGTTCAATTGACAATTCTCGAATTACGCAGAAGGTCCTAAGCTTTCTTTATAGTCAGGCATGCTTCTTACTTCCAATTTTTTCCCTAAGAAATACAGTATCCTTACTGGCTTTGCACATAACAATTATCCTATGGCTATTTGTTAATTGGATAATTTTGAGTACTTTCGATGCTGTGCATGTATAATCACTTGTATGCCTTTTTGCATGTCCTGTCATTGTCGATAGTTAGTACAAGAAGTTCAGGCCTTGAGAATGTGCCAATTGGATTAATGCTATCATACTAAGTACATGCTACTGTTGGATATATGTTGTAGTGGAAACATGGAACTATCAGAAAAATGTTTTCAAGAATTGATCCTGAAAGATCAAAATCATCCTGCTGCTCTCATCAACTATGCTGCTTTTCTCTTGTGCAAGTATGGTTCTACTGTTGTAGGTATGTGCTTGAGGTGGTTTGTTTGTTTGTTAGAGAATTTACTTATGAAAAGGACATTGTTGTCGTTTTCCCTCTTATTTTGTAAATATTTTGAATGGGTTCGGTCATCACCTTTTTCTTTCCCCTTTTTTGGCCTACTTATTAGAACCTTAATTAGTTTGTGTTTCCCCATACGCAAAAAGTACTCATTCTTCTAAATTTTAAGGTGCTGGAGCAAATGCTGGTGAAGGTGGTGTTGATGAGAAGGTTGTAGGTATGAATGTTGCAAAGGAATGTTTGCTAGCGACTCTAAAGGTAGATCCAAAAGCAGCACATGCCTGGGCAAATCTTGCTAATGTGTATTTTGTGACTGGGGACCACAGAAGTTCTGCCAAGTGCTTGGAGAAGGTGCTCATGGATCTCTATTCTTTGGGATAAGGTTTTCTTCTCATGTCGTCTAGTTTTGTGGAAAAGATGGGATATTCAGTCTTTACCGGAAACCTTTGTAGAACTTTACCTTGTGTATGCATGGCTGTATGAATTCATTAATAGAAGGGTCTTGATGTTTTGGCCTCAAATTAGCTAATTCTCTCTCCTCTTGTAAATACCCAACAGGTAGCAAAACTGGAGCCAAATTGCATGGCTATGAGATATTCTGTTGCTATGCACCGGCTTAAGGATGCAGAAAGGTCTCAAGATCGTAGTGAGCAGCTCTCCTGGGCTGGAAATGAAATGGCCTCAATTATTAGAGATGGAGATGGCTTGACAATCGATCATCCTGTAGCATGGGCTGGGCTTTCCATGGTTCACAAGGCTCAGCATGAGATTGCTTCTGGATTTCGTACAGATCAAAGTGAACTGAGAGAAGTGGAAGAACACGCCGTCTACAGTTTATATCAGGTATCAATCTCTGGTTCTGAACGTATCTTTTGTCTAACCAAGCTACTAGAAAGTATGCTTATAATCATGCTTGGCCCTAAATTATCATTTGGTCTATGTTGGAACGGAAGCTGACAATATTTATTCGGTACGATATTGATGGTGATTGGACTTTTGAAGATCTCTTGTATGCTAATTATCATTTGGAAATCCCCATTTTGATAGCCTTGACTATTCTTTAATGAAAGATACAGCCTTAGCTTTGTTGTTGGTCCCCAATCGTCCACCTTGACCCATACAAGTGTTTTAGAAACACAATTTTTTTTATAAGCTCTTAGTCGGCTATCGAATTTTTATTCTTTTTATAACGTATAGCTTTTCGTCTTACCAAAAGGCTTTTGAAGGAATGAAAACCAAGGACTTCGTATCCATTCATTTCAATGTGGTTTCATGTTGTTCAACTATGAGATCCCACGTCGATTGGAGAGGGGAACAAAACATTTCTTATAAAGGTGTGGAAACCTCTCATTATCAGACGTGTTTTAAAACTGTGAGGCTGACAGCGATACGTAACAAACCAAAGCGAATAATATCAACTAGCGGTGAGCTTGAGCCGTTACAAATGATTTTAAAGCCAGACACCAAGCGGTGTGCCAGCGACGACGCCGGGCCTCTAAGGGGGGTGGATTGTGAGATCCACATCGGTTGGAGAGAGGAACGAAACATTTTTTATAAGGGTGTGGAAACCTCTCCATAGCAGATGCGTTTTAAAACTGTGAGGCTGACGGCAGTACGTAATGGGCCAAAGTAGACAATATTTACTAGCGGTGGGCTTAGGCTGTTACATCAACTGCCGAGTACTTGGATCACAACTTTTGTTATTTCCAGCTCTATGAATCTAGTTTTTTGGAATAATACAAGGAGCAAAGCAGCCTACTGATAGGAAAATTACATTTTTGTGGCAAATCTTTATACATTGTTTTAATTTTGAAAGTCTACAATAATATATTTCCATCTTCTATTATTGATGTTCTTGTTAGATTGATGGTGGGTATTGCGTTACTCTTAACCTATAGGGAATAGCTGAGGACCCAGATGATGCTGTTCAGTGGCATCAATTCGGCCTCCACAGTCTCTGTACACGAGAATTTAAAACATCACAAAGATACCTCAAAGCTGCAATTGCTCGCTTTAAGAACTGTAGCTATGCATGGTCAAACCTAGGTATGTCAATACAATTTAACGTCGATTTTAGTAAATCTCTTGGATCTCCGTCTCTCATTAGTCAATCGTGTTCCATATTTTTGTATCGTTAGGTATCTCGTTGCAACTCTCAGACAACCCGACACAGGCCGAAGAAGTATACAAGAAAGCTTTGTCATTGGTGGCCACAGAACAAGCACATTCCGTATTTTGCAACCTCGGAAATCTGTATCGACAGCAAAAACAGTATGAACGTGCCAAAGCTATGTTCTCAAAGTCTCTGGAACTGCAACCTGGTTATGCTCCTGCATTTAACAATCTAGGATTAGTGTTTATTGCAGAGGGTCGATGGGAGGAGGCTAAGTATTGTTTTGAGAAAGCTCTTGAGGCTGATCCATTGCTCGATTCAGCTAAGTCGAATTTGATTAAAACAGTAGCGGTATCTCGATTATGTAATAGCTTATCATCGTGTCGTGTTAAAGATTGAAACCTTTGCCATTGGCTTCATCCTCATTAGATTAGGATGTAGAATATACATGATATTTCTTACAAGGAAATGTAAAAGCAGCATTTACTGAATGAAGATGCTTCCTGGATATATCTCCCCAGGCTGTACTTCCTTGATGGGAACCAATGTGTTCATCAGGACCATTGGTTCAGTTAGGTTAGTTTCTAATATATTTGATTCTCACTCGCAAACTCTTCGGTTCTATAGTTCATAAAACTTAGGTTCTCGTCTCTTCGGTTCTATAGTTCATAAAACTTAGGTTCTCTTCTGTACCTGATTGTGGTTAGGACCTGATTGTGGTTAGGATGAATTCCTGAATTTTGAAGTTTGCG

mRNA sequence

TGAATGATCATATGTTTGAATTGTATGAACACTCCGGTTACTCGACAAAAGGGGAAAAAATAATAATAAAGAAATTCTTAACCACGTCACCTTAATTTTTTCTTTAATCCTGCGTTAAAATTACACCAAAAATGTCAGCAAGCGTAAGTCGGTGCAGGTTGTTTCCGCATCCAAACAGCGGCTAAGCAATCCGCGCAACCAAACAAGGGTCTTCTTCTTTCATTCTTTACTTCTTTCGCGTCGACGGTCGCGCCACAAAGTCTTTAAGCGCCGAAAAGCTCCAGCTGCTCCCAAAATGCCTCTCAAGACAGAACATGGAGCATCGGACAGCTCCCTCGACGAGCACTCGAAGGCGGCCCATTCTTCTAAAGTCGTTGTTCTTGCTGACCTCAATGTTGATCCTCCAGAAATGGACGACGACAGTTCCGTTCACGTTTCAGCTTCGACTATCTCTAGGTTATCTGTGGATGAGAGCAATCAAGACAAAACTGTGGCAATTTGCAAGGATACTAATACAATGGAAGTTGAAGGTAGACATGTAAGCAAAATAGGGAAGTGCCGTTCAAGAAACAATAAGGTAGAGTACTCTCTTGATTCTGCAGCTGATGCAGATGGTGATCAACATGGTCAAGGTGTTTCAACTTCACGCGAAGAAAAAGTCAGCAGCCTCAAAACTGGGTTAGTTCATGTAGCAAGAAAGATGCCAAAAAATGCTCACGCTCATTTCATTCTTGGCCTAATGTATCAGAGGTTGGGCCAGCCACAAAAGGCTGTTTCAGCATATGAGAAGGCAGAGGAAATCTTACTCCAAAGTGATGTTGAGATTCACAGGCCAGAGTTGCTCTCATTGGTCCAAACTCATCATGCACAGTGTCTCCTTCTAGAAAGTGCAGCAGGGGATAATAGTTCAGACAAAGAACTTGATCAAGAGGAGCTTGACGAGATTCTTTCTAAACTTAAGCATTCAATTCAATCTGATGTTAGACAGGCAGCTATGTGGAATACCCTAGGCTTGATACTTCTTTCTACTGGTCGAGTGAAGAGTGCCCTTTCAGTGTTATCATCCCTGTTGGCCATTGTCCCTAACAACTATGATTGCCTTGGAAACCTTGGAATTGCTTATCTTCAAAGTGGAAACATGGAACTATCAGAAAAATGTTTTCAAGAATTGATCCTGAAAGATCAAAATCATCCTGCTGCTCTCATCAACTATGCTGCTTTTCTCTTGTGCAAGTATGGTTCTACTGTTGTAGGTGCTGGAGCAAATGCTGGTGAAGGTGGTGTTGATGAGAAGGTTGTAGGTATGAATGTTGCAAAGGAATGTTTGCTAGCGACTCTAAAGGTAGATCCAAAAGCAGCACATGCCTGGGCAAATCTTGCTAATGTGTATTTTGTGACTGGGGACCACAGAAGTTCTGCCAAGTGCTTGGAGAAGGTAGCAAAACTGGAGCCAAATTGCATGGCTATGAGATATTCTGTTGCTATGCACCGGCTTAAGGATGCAGAAAGGTCTCAAGATCGTAGTGAGCAGCTCTCCTGGGCTGGAAATGAAATGGCCTCAATTATTAGAGATGGAGATGGCTTGACAATCGATCATCCTGTAGCATGGGCTGGGCTTTCCATGGTTCACAAGGCTCAGCATGAGATTGCTTCTGGATTTCGTACAGATCAAAGTGAACTGAGAGAAGTGGAAGAACACGCCGTCTACAGTTTATATCAGGGAATAGCTGAGGACCCAGATGATGCTGTTCAGTGGCATCAATTCGGCCTCCACAGTCTCTGTACACGAGAATTTAAAACATCACAAAGATACCTCAAAGCTGCAATTGCTCGCTTTAAGAACTGTAGCTATGCATGGTCAAACCTAGGTATCTCGTTGCAACTCTCAGACAACCCGACACAGGCCGAAGAAGTATACAAGAAAGCTTTGTCATTGGTGGCCACAGAACAAGCACATTCCGTATTTTGCAACCTCGGAAATCTGTATCGACAGCAAAAACAGTATGAACGTGCCAAAGCTATGTTCTCAAAGTCTCTGGAACTGCAACCTGGTTATGCTCCTGCATTTAACAATCTAGGATTAGTGTTTATTGCAGAGGGTCGATGGGAGGAGGCTAAGTATTGTTTTGAGAAAGCTCTTGAGGCTGATCCATTGCTCGATTCAGCTAAGTCGAATTTGATTAAAACAGTAGCGGTATCTCGATTATGTAATAGCTTATCATCGTGTCGTGTTAAAGATTGAAACCTTTGCCATTGGCTTCATCCTCATTAGATTAGGATGTAGAATATACATGATATTTCTTACAAGGAAATGTAAAAGCAGCATTTACTGAATGAAGATGCTTCCTGGATATATCTCCCCAGGCTGTACTTCCTTGATGGGAACCAATGTGTTCATCAGGACCATTGGTTCAGTTAGGTTAGTTTCTAATATATTTGATTCTCACTCGCAAACTCTTCGGTTCTATAGTTCATAAAACTTAGGTTCTCGTCTCTTCGGTTCTATAGTTCATAAAACTTAGGTTCTCTTCTGTACCTGATTGTGGTTAGGACCTGATTGTGGTTAGGATGAATTCCTGAATTTTGAAGTTTGCG

Coding sequence (CDS)

ATGCCTCTCAAGACAGAACATGGAGCATCGGACAGCTCCCTCGACGAGCACTCGAAGGCGGCCCATTCTTCTAAAGTCGTTGTTCTTGCTGACCTCAATGTTGATCCTCCAGAAATGGACGACGACAGTTCCGTTCACGTTTCAGCTTCGACTATCTCTAGGTTATCTGTGGATGAGAGCAATCAAGACAAAACTGTGGCAATTTGCAAGGATACTAATACAATGGAAGTTGAAGGTAGACATGTAAGCAAAATAGGGAAGTGCCGTTCAAGAAACAATAAGGTAGAGTACTCTCTTGATTCTGCAGCTGATGCAGATGGTGATCAACATGGTCAAGGTGTTTCAACTTCACGCGAAGAAAAAGTCAGCAGCCTCAAAACTGGGTTAGTTCATGTAGCAAGAAAGATGCCAAAAAATGCTCACGCTCATTTCATTCTTGGCCTAATGTATCAGAGGTTGGGCCAGCCACAAAAGGCTGTTTCAGCATATGAGAAGGCAGAGGAAATCTTACTCCAAAGTGATGTTGAGATTCACAGGCCAGAGTTGCTCTCATTGGTCCAAACTCATCATGCACAGTGTCTCCTTCTAGAAAGTGCAGCAGGGGATAATAGTTCAGACAAAGAACTTGATCAAGAGGAGCTTGACGAGATTCTTTCTAAACTTAAGCATTCAATTCAATCTGATGTTAGACAGGCAGCTATGTGGAATACCCTAGGCTTGATACTTCTTTCTACTGGTCGAGTGAAGAGTGCCCTTTCAGTGTTATCATCCCTGTTGGCCATTGTCCCTAACAACTATGATTGCCTTGGAAACCTTGGAATTGCTTATCTTCAAAGTGGAAACATGGAACTATCAGAAAAATGTTTTCAAGAATTGATCCTGAAAGATCAAAATCATCCTGCTGCTCTCATCAACTATGCTGCTTTTCTCTTGTGCAAGTATGGTTCTACTGTTGTAGGTGCTGGAGCAAATGCTGGTGAAGGTGGTGTTGATGAGAAGGTTGTAGGTATGAATGTTGCAAAGGAATGTTTGCTAGCGACTCTAAAGGTAGATCCAAAAGCAGCACATGCCTGGGCAAATCTTGCTAATGTGTATTTTGTGACTGGGGACCACAGAAGTTCTGCCAAGTGCTTGGAGAAGGTAGCAAAACTGGAGCCAAATTGCATGGCTATGAGATATTCTGTTGCTATGCACCGGCTTAAGGATGCAGAAAGGTCTCAAGATCGTAGTGAGCAGCTCTCCTGGGCTGGAAATGAAATGGCCTCAATTATTAGAGATGGAGATGGCTTGACAATCGATCATCCTGTAGCATGGGCTGGGCTTTCCATGGTTCACAAGGCTCAGCATGAGATTGCTTCTGGATTTCGTACAGATCAAAGTGAACTGAGAGAAGTGGAAGAACACGCCGTCTACAGTTTATATCAGGGAATAGCTGAGGACCCAGATGATGCTGTTCAGTGGCATCAATTCGGCCTCCACAGTCTCTGTACACGAGAATTTAAAACATCACAAAGATACCTCAAAGCTGCAATTGCTCGCTTTAAGAACTGTAGCTATGCATGGTCAAACCTAGGTATCTCGTTGCAACTCTCAGACAACCCGACACAGGCCGAAGAAGTATACAAGAAAGCTTTGTCATTGGTGGCCACAGAACAAGCACATTCCGTATTTTGCAACCTCGGAAATCTGTATCGACAGCAAAAACAGTATGAACGTGCCAAAGCTATGTTCTCAAAGTCTCTGGAACTGCAACCTGGTTATGCTCCTGCATTTAACAATCTAGGATTAGTGTTTATTGCAGAGGGTCGATGGGAGGAGGCTAAGTATTGTTTTGAGAAAGCTCTTGAGGCTGATCCATTGCTCGATTCAGCTAAGTCGAATTTGATTAAAACAGTAGCGGTATCTCGATTATGTAATAGCTTATCATCGTGTCGTGTTAAAGATTGA

Protein sequence

MPLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNQDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRPELLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEVYKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRLCNSLSSCRVKD
BLAST of Cp4.1LG20g03450 vs. Swiss-Prot
Match: OGT1_RAT (UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit OS=Rattus norvegicus GN=Ogt PE=1 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.6e-08
Identity = 88/409 (21.52%), Postives = 170/409 (41.56%), Query Frame = 1

Query: 224 SIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNME 283
           +I+ +   A  ++ LG +    G+++ A+      L + P+  D   NL  A + +G+ME
Sbjct: 71  AIKQNPLLAEAYSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDME 130

Query: 284 LSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKEC 343
            + + +            + + Y   L C            +  G + + +  +  AK C
Sbjct: 131 GAVQAY-----------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKAC 190

Query: 344 LLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDA 403
            L  ++  P  A AW+NL  V+   G+   +    EK   L+PN +    ++  + LK+A
Sbjct: 191 YLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLDAYINLG-NVLKEA 250

Query: 404 ERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELR 463
            R  DR+         +A+ +R    L+ +H V    L+ V+  Q  I     T +  + 
Sbjct: 251 -RIFDRA---------VAAYLR-ALSLSPNHAVVHGNLACVYYEQGLIDLAIDTYRRAI- 310

Query: 464 EVEEH---AVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYA 523
           E++ H   A  +L   + E    A                  ++     A+      + +
Sbjct: 311 ELQPHFPDAYCNLANALKEKGSVA-----------------EAEDCYNTALRLCPTHADS 370

Query: 524 WSNLGISLQLSDNPTQAEEVYKKALSLVAT-EQAHSVFCNLGNLYRQQKQYERAKAMFSK 583
            +NL    +   N  +A  +Y+KAL +      AHS   NL ++ +QQ + + A   + +
Sbjct: 371 LNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHS---NLASVLQQQGKLQEALMHYKE 425

Query: 584 SLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSAKSNL 629
           ++ + P +A A++N+G         + A  C+ +A++ +P    A SNL
Sbjct: 431 AIRISPTFADAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNL 425

BLAST of Cp4.1LG20g03450 vs. Swiss-Prot
Match: OGT1_RABIT (UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit OS=Oryctolagus cuniculus GN=OGT PE=1 SV=2)

HSP 1 Score: 62.8 bits (151), Expect = 1.6e-08
Identity = 88/409 (21.52%), Postives = 170/409 (41.56%), Query Frame = 1

Query: 224 SIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNME 283
           +I+ +   A  ++ LG +    G+++ A+      L + P+  D   NL  A + +G+ME
Sbjct: 81  AIKQNPLLAEAYSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDME 140

Query: 284 LSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKEC 343
            + + +            + + Y   L C            +  G + + +  +  AK C
Sbjct: 141 GAVQAY-----------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKAC 200

Query: 344 LLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDA 403
            L  ++  P  A AW+NL  V+   G+   +    EK   L+PN +    ++  + LK+A
Sbjct: 201 YLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLDAYINLG-NVLKEA 260

Query: 404 ERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELR 463
            R  DR+         +A+ +R    L+ +H V    L+ V+  Q  I     T +  + 
Sbjct: 261 -RIFDRA---------VAAYLR-ALSLSPNHAVVHGNLACVYYEQGLIDLAIDTYRRAI- 320

Query: 464 EVEEH---AVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYA 523
           E++ H   A  +L   + E    A                  ++     A+      + +
Sbjct: 321 ELQPHFPDAYCNLANALKEKGSVA-----------------EAEDCYNTALRLCPTHADS 380

Query: 524 WSNLGISLQLSDNPTQAEEVYKKALSLVAT-EQAHSVFCNLGNLYRQQKQYERAKAMFSK 583
            +NL    +   N  +A  +Y+KAL +      AHS   NL ++ +QQ + + A   + +
Sbjct: 381 LNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHS---NLASVLQQQGKLQEALMHYKE 435

Query: 584 SLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSAKSNL 629
           ++ + P +A A++N+G         + A  C+ +A++ +P    A SNL
Sbjct: 441 AIRISPTFADAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNL 435

BLAST of Cp4.1LG20g03450 vs. Swiss-Prot
Match: OGT1_PIG (UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit OS=Sus scrofa GN=OGT PE=2 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.6e-08
Identity = 88/409 (21.52%), Postives = 170/409 (41.56%), Query Frame = 1

Query: 224 SIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNME 283
           +I+ +   A  ++ LG +    G+++ A+      L + P+  D   NL  A + +G+ME
Sbjct: 81  AIKQNPLLAEAYSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDME 140

Query: 284 LSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKEC 343
            + + +            + + Y   L C            +  G + + +  +  AK C
Sbjct: 141 GAVQAY-----------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKAC 200

Query: 344 LLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDA 403
            L  ++  P  A AW+NL  V+   G+   +    EK   L+PN +    ++  + LK+A
Sbjct: 201 YLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLDAYINLG-NVLKEA 260

Query: 404 ERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELR 463
            R  DR+         +A+ +R    L+ +H V    L+ V+  Q  I     T +  + 
Sbjct: 261 -RIFDRA---------VAAYLR-ALSLSPNHAVVHGNLACVYYEQGLIDLAIDTYRRAI- 320

Query: 464 EVEEH---AVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYA 523
           E++ H   A  +L   + E    A                  ++     A+      + +
Sbjct: 321 ELQPHFPDAYCNLANALKEKGSVA-----------------EAEDCYNTALRLCPTHADS 380

Query: 524 WSNLGISLQLSDNPTQAEEVYKKALSLVAT-EQAHSVFCNLGNLYRQQKQYERAKAMFSK 583
            +NL    +   N  +A  +Y+KAL +      AHS   NL ++ +QQ + + A   + +
Sbjct: 381 LNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHS---NLASVLQQQGKLQEALMHYKE 435

Query: 584 SLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSAKSNL 629
           ++ + P +A A++N+G         + A  C+ +A++ +P    A SNL
Sbjct: 441 AIRISPTFADAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNL 435

BLAST of Cp4.1LG20g03450 vs. Swiss-Prot
Match: OGT1_MOUSE (UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit OS=Mus musculus GN=Ogt PE=1 SV=2)

HSP 1 Score: 62.8 bits (151), Expect = 1.6e-08
Identity = 88/409 (21.52%), Postives = 170/409 (41.56%), Query Frame = 1

Query: 224 SIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNME 283
           +I+ +   A  ++ LG +    G+++ A+      L + P+  D   NL  A + +G+ME
Sbjct: 81  AIKQNPLLAEAYSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDME 140

Query: 284 LSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKEC 343
            + + +            + + Y   L C            +  G + + +  +  AK C
Sbjct: 141 GAVQAY-----------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKAC 200

Query: 344 LLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDA 403
            L  ++  P  A AW+NL  V+   G+   +    EK   L+PN +    ++  + LK+A
Sbjct: 201 YLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLDAYINLG-NVLKEA 260

Query: 404 ERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELR 463
            R  DR+         +A+ +R    L+ +H V    L+ V+  Q  I     T +  + 
Sbjct: 261 -RIFDRA---------VAAYLR-ALSLSPNHAVVHGNLACVYYEQGLIDLAIDTYRRAI- 320

Query: 464 EVEEH---AVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYA 523
           E++ H   A  +L   + E    A                  ++     A+      + +
Sbjct: 321 ELQPHFPDAYCNLANALKEKGSVA-----------------EAEDCYNTALRLCPTHADS 380

Query: 524 WSNLGISLQLSDNPTQAEEVYKKALSLVAT-EQAHSVFCNLGNLYRQQKQYERAKAMFSK 583
            +NL    +   N  +A  +Y+KAL +      AHS   NL ++ +QQ + + A   + +
Sbjct: 381 LNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHS---NLASVLQQQGKLQEALMHYKE 435

Query: 584 SLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSAKSNL 629
           ++ + P +A A++N+G         + A  C+ +A++ +P    A SNL
Sbjct: 441 AIRISPTFADAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNL 435

BLAST of Cp4.1LG20g03450 vs. Swiss-Prot
Match: OGT1_HUMAN (UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit OS=Homo sapiens GN=OGT PE=1 SV=3)

HSP 1 Score: 62.8 bits (151), Expect = 1.6e-08
Identity = 88/409 (21.52%), Postives = 170/409 (41.56%), Query Frame = 1

Query: 224 SIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNME 283
           +I+ +   A  ++ LG +    G+++ A+      L + P+  D   NL  A + +G+ME
Sbjct: 81  AIKQNPLLAEAYSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDME 140

Query: 284 LSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKEC 343
            + + +            + + Y   L C            +  G + + +  +  AK C
Sbjct: 141 GAVQAY-----------VSALQYNPDLYC----------VRSDLGNLLKALGRLEEAKAC 200

Query: 344 LLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDA 403
            L  ++  P  A AW+NL  V+   G+   +    EK   L+PN +    ++  + LK+A
Sbjct: 201 YLKAIETQPNFAVAWSNLGCVFNAQGEIWLAIHHFEKAVTLDPNFLDAYINLG-NVLKEA 260

Query: 404 ERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELR 463
            R  DR+         +A+ +R    L+ +H V    L+ V+  Q  I     T +  + 
Sbjct: 261 -RIFDRA---------VAAYLR-ALSLSPNHAVVHGNLACVYYEQGLIDLAIDTYRRAI- 320

Query: 464 EVEEH---AVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYA 523
           E++ H   A  +L   + E    A                  ++     A+      + +
Sbjct: 321 ELQPHFPDAYCNLANALKEKGSVA-----------------EAEDCYNTALRLCPTHADS 380

Query: 524 WSNLGISLQLSDNPTQAEEVYKKALSLVAT-EQAHSVFCNLGNLYRQQKQYERAKAMFSK 583
            +NL    +   N  +A  +Y+KAL +      AHS   NL ++ +QQ + + A   + +
Sbjct: 381 LNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHS---NLASVLQQQGKLQEALMHYKE 435

Query: 584 SLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSAKSNL 629
           ++ + P +A A++N+G         + A  C+ +A++ +P    A SNL
Sbjct: 441 AIRISPTFADAYSNMGNTLKEMQDVQGALQCYTRAIQINPAFADAHSNL 435

BLAST of Cp4.1LG20g03450 vs. TrEMBL
Match: A0A0A0LTP6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G050030 PE=4 SV=1)

HSP 1 Score: 1139.0 bits (2945), Expect = 0.0e+00
Identity = 581/648 (89.66%), Postives = 609/648 (93.98%), Query Frame = 1

Query: 1   MPLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60
           MPLKTEHGA DSSLD+HSKA +SSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES
Sbjct: 1   MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60

Query: 61  NQDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREE 120
           N DKT  ICKDTN MEVEGR VSKIGKCRSRNNKVEYSLDSAAD DGDQ  QGVSTSREE
Sbjct: 61  NHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQPCQGVSTSREE 120

Query: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRP 180
           KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAV AYEKAEEILLQSDVEIHRP
Sbjct: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRP 180

Query: 181 ELLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGL 240
           E LSL+Q HHAQCLLLES  GDN+S++EL+QEELD++ SKLKHS+QSDVRQAA+WNTLGL
Sbjct: 181 EFLSLIQIHHAQCLLLESV-GDNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGL 240

Query: 241 ILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHP 300
           +LL+TGRVKSA++VLSSLLAIVPNN DCLGNLGIAYLQSGNMELSEKCFQELIL DQNH 
Sbjct: 241 LLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHL 300

Query: 301 AALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWAN 360
           AAL+ YAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLA LKVDPKAAHAWAN
Sbjct: 301 AALVYYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWAN 360

Query: 361 LANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEM 420
           LAN YFVTGDHRSSAKCLEK AKLEPNCM+MRY+VAMHRLKDAERSQDRSEQLSWAGNEM
Sbjct: 361 LANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEM 420

Query: 421 ASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAED 480
           AS+IRDGDGLTIDH VAWAG SMVHK QHEIA+GFRTD SELRE E+HAVYSL Q IAED
Sbjct: 421 ASVIRDGDGLTIDHSVAWAGFSMVHKIQHEIAAGFRTDLSELREKEDHAVYSLNQAIAED 480

Query: 481 PDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEV 540
            DDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFK CS+AWSNLGISLQL  NPT+AEEV
Sbjct: 481 TDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKKCSFAWSNLGISLQLPKNPTEAEEV 540

Query: 541 YKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA 600
           Y+KALSLVATEQAH+VFCNLGNLYRQQKQYERAKAMFSK+L LQ GYAPAFNNLGLVFIA
Sbjct: 541 YRKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKTLGLQLGYAPAFNNLGLVFIA 600

Query: 601 EGRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRLCNSLSSCRVKD 649
           EG+WEEAKYCFEKALEADPLLDSA SNL+KTVAV RLCNSLSSC VKD
Sbjct: 601 EGQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRLCNSLSSCHVKD 647

BLAST of Cp4.1LG20g03450 vs. TrEMBL
Match: D7SHD3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g09890 PE=4 SV=1)

HSP 1 Score: 936.4 bits (2419), Expect = 1.8e-269
Identity = 463/648 (71.45%), Postives = 557/648 (85.96%), Query Frame = 1

Query: 1   MPLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60
           +P+K+E G +++S DE SK    SKVVVLADLNVDPPE DDD S+HVSA  ++RL+ D+S
Sbjct: 12  LPIKSEVGVTENSADESSKRPQISKVVVLADLNVDPPETDDDDSLHVSAPDLTRLTNDDS 71

Query: 61  NQDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREE 120
           +QDK+  + KDT+ ++ EG+ ++K+GK RSR  KVEY LD  ADAD DQHGQG  TSREE
Sbjct: 72  SQDKSTLVSKDTDMVDGEGKRLNKLGKPRSRVTKVEYPLDYGADADADQHGQGAPTSREE 131

Query: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRP 180
           KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKA EILL+ + EI RP
Sbjct: 132 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAAEILLRCEEEIDRP 191

Query: 181 ELLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGL 240
           ELLSLVQ HHAQCLLL S+ GD+S+DKEL+ EEL+EIL K+K S+QSD+RQAA+WNTLGL
Sbjct: 192 ELLSLVQIHHAQCLLLGSS-GDHSADKELEPEELEEILLKMKDSMQSDIRQAAVWNTLGL 251

Query: 241 ILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHP 300
           ILL TGR+++A+SVLSSLL I P+N DCLGNLGIAYL+SGN+EL+EKCFQ LILKDQNHP
Sbjct: 252 ILLRTGRLQNAISVLSSLLTIAPDNLDCLGNLGIAYLRSGNLELAEKCFQNLILKDQNHP 311

Query: 301 AALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWAN 360
           AALINYAA L+CKYGS + GAGAN+GEG  +++++  NVAKECLLA +KV+PKAAH WAN
Sbjct: 312 AALINYAAVLMCKYGSIIAGAGANSGEGASEDQLIAANVAKECLLAAVKVEPKAAHVWAN 371

Query: 361 LANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEM 420
           LAN Y++ GD RSS+KC EK AKLEPNCM+ RY+VA+H++KDAER QD SEQLSWAGNEM
Sbjct: 372 LANAYYLMGDCRSSSKCFEKAAKLEPNCMSTRYAVAVHQIKDAERYQDPSEQLSWAGNEM 431

Query: 421 ASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAED 480
           ASI+R+GD   I+HP+AWAGL+MVHK Q+EIA+ F T+   L E+EE AV+ L Q IAED
Sbjct: 432 ASILREGDSALIEHPIAWAGLAMVHKIQNEIAAAFETEHKGLMEMEERAVHILKQAIAED 491

Query: 481 PDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEV 540
           PDDAVQWHQ GLH+LC ++FKTSQ+YLKAA+AR K CSY WSNLGISLQLS+ P QAE+V
Sbjct: 492 PDDAVQWHQLGLHNLCVQQFKTSQKYLKAAVARSKECSYMWSNLGISLQLSEEPAQAEQV 551

Query: 541 YKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA 600
           YK+ALSLV  +QA+++F NLGNLYRQQK+Y+ AKAMF+KSLELQPGYAPA+NNLGLVFIA
Sbjct: 552 YKRALSLVTPQQAYTIFSNLGNLYRQQKKYQSAKAMFTKSLELQPGYAPAYNNLGLVFIA 611

Query: 601 EGRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRLCNSLSSCRVKD 649
           EGRW+EA++CF KAL+ADPLLD+AKSN+IK  A+SR+C  LSSC ++D
Sbjct: 612 EGRWKEAEFCFNKALQADPLLDAAKSNMIKAAAMSRVCQHLSSCSLQD 658

BLAST of Cp4.1LG20g03450 vs. TrEMBL
Match: A0A061FZ59_THECC (Tetratricopeptide repeat (TPR)-containing protein OS=Theobroma cacao GN=TCM_014545 PE=4 SV=1)

HSP 1 Score: 905.6 bits (2339), Expect = 3.5e-260
Identity = 449/624 (71.96%), Postives = 537/624 (86.06%), Query Frame = 1

Query: 25  KVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNQDKTVAICKDTNTMEVEGRHVSK 84
           KVVVLADLNVDPPE +D  S+ + A  ++RL+ DES+ +K+  I K+++ +E E + ++K
Sbjct: 31  KVVVLADLNVDPPETEDHDSLLLPAPDLTRLTNDESSHEKSTFISKESDAVEGEAKKLTK 90

Query: 85  IGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHF 144
            GKCRSR +K + SLD  ADADGDQ  QG  +SREEKVSSLKTGLVHVARKMPKNAHAHF
Sbjct: 91  SGKCRSRISKADSSLDCGADADGDQPSQGTPSSREEKVSSLKTGLVHVARKMPKNAHAHF 150

Query: 145 ILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRPELLSLVQTHHAQCLLLESAAGDNS 204
           +LGLMYQRLGQPQKA+ AYEKA EIL++ +VEI RPELLSLVQ HHAQCLLLE++ GDN 
Sbjct: 151 VLGLMYQRLGQPQKAILAYEKAAEILVRCEVEIARPELLSLVQIHHAQCLLLENS-GDNG 210

Query: 205 SDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPN 264
            DKEL+ +EL+EILSKLK S+QSDVRQA +WNTLGLILL TGR++SA++VLSSLLA+ P+
Sbjct: 211 LDKELENDELEEILSKLKESMQSDVRQAGVWNTLGLILLKTGRLQSAIAVLSSLLALAPD 270

Query: 265 NYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGAN 324
           +YDCLGNLGIAYLQSGN+ELS + FQ+LI+KDQNHPAAL+NYAA LLCKYGS V GAGAN
Sbjct: 271 DYDCLGNLGIAYLQSGNLELSARYFQDLIIKDQNHPAALMNYAAILLCKYGSVVAGAGAN 330

Query: 325 AGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKL 384
           A E    ++V  +NVAKECLLA LK DPKAAH WANLAN Y++ GD+RSS+KCLEK AKL
Sbjct: 331 ASEVASGDQVASVNVAKECLLAALKSDPKAAHTWANLANAYYLIGDYRSSSKCLEKAAKL 390

Query: 385 EPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMV 444
           EPNCM+ RY+VA+HR+KDAERSQD SEQLSWAGNEMAS++R+GD + ID P+AWAGLSMV
Sbjct: 391 EPNCMSTRYAVAVHRIKDAERSQDPSEQLSWAGNEMASVLREGDSVPIDPPIAWAGLSMV 450

Query: 445 HKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQ 504
           HK QHEI + F T+Q+EL EVEE A++SL Q   EDPDDAVQW+Q GLHSLC++ FKT+Q
Sbjct: 451 HKTQHEIVAAFETEQNELVEVEERAIFSLKQAAGEDPDDAVQWNQLGLHSLCSQNFKTAQ 510

Query: 505 RYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEVYKKALSLVATEQAHSVFCNLGNLY 564
           +YLKAA+ RFK CSYAWSNLGIS+QLS+  +QAE VYK+ALSL   EQAH++F NLGNLY
Sbjct: 511 KYLKAAVVRFKECSYAWSNLGISIQLSEEASQAESVYKRALSLATVEQAHAIFSNLGNLY 570

Query: 565 RQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSA 624
           RQQKQYERAKAMF+KSLELQPGYAPAFNNLGLVF+AEG+WEEAK+CF+KAL++DPLLD+A
Sbjct: 571 RQQKQYERAKAMFTKSLELQPGYAPAFNNLGLVFVAEGQWEEAKFCFDKALQSDPLLDAA 630

Query: 625 KSNLIKTVAVSRLCNSLSSCRVKD 649
           KSN+IKTVA+SRLC  LSS  ++D
Sbjct: 631 KSNMIKTVALSRLCAGLSSFFIQD 653

BLAST of Cp4.1LG20g03450 vs. TrEMBL
Match: A0A067KGL4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11365 PE=4 SV=1)

HSP 1 Score: 902.1 bits (2330), Expect = 3.9e-259
Identity = 450/637 (70.64%), Postives = 538/637 (84.46%), Query Frame = 1

Query: 1   MPLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60
           + +KTE   +D S++E  K     KVVVLADLNV+PPE D   SVH+S   ++RL+ DES
Sbjct: 13  LSIKTEVDTADGSMEETCKTLQPPKVVVLADLNVNPPETDATDSVHLSVPELTRLTNDES 72

Query: 61  NQDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREE 120
            QDKT   CK+ +T E EG+ ++K+GKCRSRN+KV+ SLD   D D DQ GQG  +SREE
Sbjct: 73  -QDKTNFSCKEVDTAEAEGKKLNKLGKCRSRNSKVDASLDYGPDIDADQPGQGPPSSREE 132

Query: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRP 180
           KVSSLKTGL+HVARKMPKNAHAHFILGLMYQRLGQP KAV AYEKAEEILL+ D E+ RP
Sbjct: 133 KVSSLKTGLLHVARKMPKNAHAHFILGLMYQRLGQPPKAVFAYEKAEEILLRCDAEVARP 192

Query: 181 ELLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGL 240
           ELLSLVQ HHAQC+LLE +A DNS DKEL+ EEL+EI+S+LK S+Q D+RQA +WNTLGL
Sbjct: 193 ELLSLVQIHHAQCILLEYSA-DNSLDKELEAEELEEIISRLKESMQLDIRQAGVWNTLGL 252

Query: 241 ILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHP 300
           ILL +GR++SA+SVLSSLLAI  NNYDCLGNLGIAYLQSGN+ELS KCFQ+LILKDQNHP
Sbjct: 253 ILLKSGRLQSAISVLSSLLAIDTNNYDCLGNLGIAYLQSGNIELSAKCFQDLILKDQNHP 312

Query: 301 AALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWAN 360
           AA +NYAA LLCK+GS V GAGANAGEG   ++   ++VAKECLLA LKVDPKA H WAN
Sbjct: 313 AAFVNYAALLLCKHGSLVAGAGANAGEGAFRDQFEAVDVAKECLLAALKVDPKAGHTWAN 372

Query: 361 LANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEM 420
           LAN Y++ GDH+SS+KCLEK AKLEPNCM+ RY+VA+HR+KDAERSQD +EQLSWAGNEM
Sbjct: 373 LANAYYLMGDHKSSSKCLEKAAKLEPNCMSTRYAVAIHRIKDAERSQDPNEQLSWAGNEM 432

Query: 421 ASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAED 480
           ASI+R+GD + I+ P AWAGL+MVHKAQHEIA+ F T+ SEL ++EE A+YSL Q IAED
Sbjct: 433 ASILREGDSVPIELPTAWAGLAMVHKAQHEIAAAFETEHSELVDIEERALYSLKQAIAED 492

Query: 481 PDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEV 540
           PDD VQWHQ GLH  C+R+F+T+Q+Y K A+ + K CSYAWSNLGISLQLS+  +QAE+V
Sbjct: 493 PDDGVQWHQLGLHCFCSRQFETAQKYFKVAVTQLKECSYAWSNLGISLQLSEESSQAEDV 552

Query: 541 YKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA 600
           YK+ALS  A+EQAH++F NLGNLYRQQKQYERAKAMF+KSLEL+PGYAPA+NNLGLVF+A
Sbjct: 553 YKRALSFAASEQAHAIFSNLGNLYRQQKQYERAKAMFNKSLELKPGYAPAYNNLGLVFVA 612

Query: 601 EGRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRL 638
           EGR EEAK+CF++AL++DPLLD+AKSN+IK VA+SRL
Sbjct: 613 EGRLEEAKFCFDRALQSDPLLDAAKSNMIKAVAMSRL 647

BLAST of Cp4.1LG20g03450 vs. TrEMBL
Match: B9RAP6_RICCO (O-linked n-acetylglucosamine transferase, ogt, putative OS=Ricinus communis GN=RCOM_1507660 PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 3.6e-257
Identity = 445/636 (69.97%), Postives = 544/636 (85.53%), Query Frame = 1

Query: 2   PLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESN 61
           P+KTE  A ++SLD+ SKAA   K+VVLADLN +PPE D + SV++S   +SRL+ DES 
Sbjct: 14  PIKTELNAMETSLDDSSKAAQPPKLVVLADLNANPPETDTNDSVNLSVPDLSRLTNDES- 73

Query: 62  QDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREEK 121
           QDK+   CK+ +T+E EG+ ++K+GKCRSRN+K++ SLD   D D DQ GQG  +SREEK
Sbjct: 74  QDKSSVACKEGDTVEFEGKKLNKLGKCRSRNSKLDASLDYGPDIDADQPGQGPISSREEK 133

Query: 122 VSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRPE 181
           VSSLKTGLVHVA+KMPKNAHAHFILGLMYQRLGQPQKAV AYEKAEEILL+S+ E+ RPE
Sbjct: 134 VSSLKTGLVHVAKKMPKNAHAHFILGLMYQRLGQPQKAVFAYEKAEEILLRSEAEVARPE 193

Query: 182 LLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGLI 241
            LSLVQ HHAQC+LLE+++ DNS DKEL+ EEL+E+LS++K S+QSDVRQAA+WNTLGLI
Sbjct: 194 FLSLVQIHHAQCILLENSS-DNSLDKELEAEELEEVLSRMKESMQSDVRQAAVWNTLGLI 253

Query: 242 LLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHPA 301
           LL +GR++SA+SV SSLLA+  +NYDCLGNLGIAYLQSG++ELS KCFQELILKDQNHPA
Sbjct: 254 LLKSGRLQSAISVWSSLLAMDTSNYDCLGNLGIAYLQSGDLELSAKCFQELILKDQNHPA 313

Query: 302 ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWANL 361
           A +NYAA LLCKYGS V G GANAGEG        ++VA ECLLA LKVDPKAAH WANL
Sbjct: 314 AFVNYAALLLCKYGSVVAGPGANAGEGASVYWAEPVHVAMECLLAGLKVDPKAAHLWANL 373

Query: 362 ANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEMA 421
           AN Y++TGD+RSS+KCLEK AKLEPNCM  RY+VA+ R+KDAERSQD +EQLSWAGNEMA
Sbjct: 374 ANAYYLTGDYRSSSKCLEKSAKLEPNCMCTRYAVAVQRIKDAERSQDPNEQLSWAGNEMA 433

Query: 422 SIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAEDP 481
           SI+R+G+ + I+ P+AWAGL+MVHKAQHEIA+ F T+++EL +VEE A+YSL Q IAEDP
Sbjct: 434 SILREGESVPIEFPIAWAGLAMVHKAQHEIAAAFETERNELADVEERALYSLKQAIAEDP 493

Query: 482 DDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEVY 541
           DD VQWHQ G+H LC+R+F+T+Q+YLK A+  FK CSYAWSNLG+SLQLS+  ++AE+VY
Sbjct: 494 DDGVQWHQLGMHCLCSRQFETAQKYLKVAVTHFKECSYAWSNLGVSLQLSEESSRAEDVY 553

Query: 542 KKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE 601
           K+AL+  A+EQAH++F NLGNLYRQQKQYERAKAMF+KSLEL+PGYAPA+NNLGLVF+AE
Sbjct: 554 KQALACEASEQAHTIFSNLGNLYRQQKQYERAKAMFTKSLELRPGYAPAYNNLGLVFVAE 613

Query: 602 GRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRL 638
            +WEEAK+CF+KAL+ADPLLD+AKSN+IK + + RL
Sbjct: 614 SQWEEAKFCFDKALQADPLLDAAKSNMIKAMTMCRL 647

BLAST of Cp4.1LG20g03450 vs. TAIR10
Match: AT5G63200.1 (AT5G63200.1 tetratricopeptide repeat (TPR)-containing protein)

HSP 1 Score: 831.6 bits (2147), Expect = 3.2e-241
Identity = 421/624 (67.47%), Postives = 511/624 (81.89%), Query Frame = 1

Query: 25  KVVVLADLNVDPPEMDD-DSSVHV-SASTISRLSVDESNQDKTVAICKDTNTMEVEGRHV 84
           K+VVLADLN +PPE DD DSS+ + +   I+RLS +ES+Q+  +  CK+    EVE + +
Sbjct: 24  KLVVLADLNFNPPETDDLDSSIPIPTPPPITRLSNEESHQEGGILTCKEVEPGEVEAKKI 83

Query: 85  SKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHA 144
           SK+GKCRSR+ K+E S D   DADGD   QGV  SREEK+S+LK GL+HVARKMPKNAHA
Sbjct: 84  SKVGKCRSRS-KIESSSDCGVDADGDLANQGVPASREEKISNLKMGLIHVARKMPKNAHA 143

Query: 145 HFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRPELLSLVQTHHAQCLLLESAAGD 204
           HFILGLM+QRLGQ QKA+  YEKAEEILL  + EI RPELL LVQ HH QCLLL+   GD
Sbjct: 144 HFILGLMFQRLGQSQKAIPEYEKAEEILLGCEPEIARPELLLLVQIHHGQCLLLDGF-GD 203

Query: 205 NSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIV 264
             S KEL+ EEL+EILSKLK SI+ DVRQAA+WNTLGL+LL  G + SA+SVLSSLLA+V
Sbjct: 204 TDSVKELEGEELEEILSKLKDSIKLDVRQAAVWNTLGLMLLKAGCLMSAISVLSSLLALV 263

Query: 265 PNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAG 324
           P+NYDCL NLG+AYLQSG+MELS KCFQ+L+LKD NHPAALINYAA LLCK+ STV GAG
Sbjct: 264 PDNYDCLANLGVAYLQSGDMELSAKCFQDLVLKDHNHPAALINYAAELLCKHSSTVAGAG 323

Query: 325 ANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVA 384
           AN G    +++   MNVAKECLLA L+ DPK+AHAW NLAN Y++ GDHRSS+KCLEK A
Sbjct: 324 ANGGADASEDQKAPMNVAKECLLAALRSDPKSAHAWVNLANSYYMMGDHRSSSKCLEKAA 383

Query: 385 KLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLS 444
           KL+PNCMA R++VA+ R+KDAERSQD S+QLSWAGNEMAS+IR+G+ + ID P+AWAGL+
Sbjct: 384 KLDPNCMATRFAVAVQRIKDAERSQDASDQLSWAGNEMASVIREGESVPIDPPIAWAGLA 443

Query: 445 MVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKT 504
           M HKAQHEIA+ F  D++EL E+EE AVYSL Q + EDP+DAV+WHQ GLHSLC++++K 
Sbjct: 444 MAHKAQHEIAAAFVADRNELTEMEERAVYSLKQAVTEDPEDAVRWHQLGLHSLCSQQYKL 503

Query: 505 SQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEVYKKALSLVATEQAHSVFCNLGN 564
           SQ+YLKAA+ R + CSYAWSNLGISLQLSD  ++AEEVYK+AL++   +QAH++  NLGN
Sbjct: 504 SQKYLKAAVGRSRECSYAWSNLGISLQLSDEHSEAEEVYKRALTVSKEDQAHAILSNLGN 563

Query: 565 LYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLD 624
           LYRQ+KQYE +KAMFSK+LEL+PGYAPA+NNLGLVF+AE RWEEAK CFEK+LEAD LLD
Sbjct: 564 LYRQKKQYEVSKAMFSKALELKPGYAPAYNNLGLVFVAERRWEEAKSCFEKSLEADSLLD 623

Query: 625 SAKSNLIKTVAVSRLCNSLSSCRV 647
           +A+SNL+K   +SRLC   SS  V
Sbjct: 624 AAQSNLLKATTMSRLCTCFSSSTV 645

BLAST of Cp4.1LG20g03450 vs. TAIR10
Match: AT3G11540.1 (AT3G11540.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 60.1 bits (144), Expect = 5.9e-09
Identity = 40/112 (35.71%), Postives = 62/112 (55.36%), Query Frame = 1

Query: 516 NCSYAWSNLGISLQLSDNPTQAEEVYKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKA 575
           +C+ A +NLG+  +  DN  +A E Y+ ALS +    A S+  NLG +Y  Q + + A +
Sbjct: 327 HCAEACNNLGVLYKDRDNLDKAVECYQMALS-IKPNFAQSLN-NLGVVYTVQGKMDAAAS 386

Query: 576 MFSKSLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSAKSN 628
           M  K++   P YA AFNNLG+++   G    A   +E+ L+ DP   +A  N
Sbjct: 387 MIEKAILANPTYAEAFNNLGVLYRDAGNITMAIDAYEECLKIDPDSRNAGQN 436

BLAST of Cp4.1LG20g03450 vs. TAIR10
Match: AT4G08320.2 (AT4G08320.2 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 55.1 bits (131), Expect = 1.9e-07
Identity = 33/94 (35.11%), Postives = 53/94 (56.38%), Query Frame = 1

Query: 536 QAEEVYKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLG 595
           +A E+Y  A++L  T++    +CN    Y Q      A     KS+E+ P Y+ A++ LG
Sbjct: 193 EAVELYSFAIAL--TDKNAVFYCNRAAAYTQINMCSEAIKDCLKSIEIDPNYSKAYSRLG 252

Query: 596 LVFIAEGRWEEA-KYCFEKALEADPLLDSAKSNL 629
           L + A+G++ EA +  F+KAL  DP  +S K N+
Sbjct: 253 LAYYAQGKYAEAIEKGFKKALLLDPHNESVKENI 284

BLAST of Cp4.1LG20g03450 vs. TAIR10
Match: AT3G04240.1 (AT3G04240.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 52.0 bits (123), Expect = 1.6e-06
Identity = 88/418 (21.05%), Postives = 161/418 (38.52%), Query Frame = 1

Query: 212 EELDEILSKLKHSIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPNNYDCLGN 271
           +E D  +++ + +++   + A  +  +       G    A+      + + PN  D   N
Sbjct: 101 QEYDMCIARNEEALRIQPQFAECYGNMANAWKEKGDTDRAIRYYLIAIELRPNFADAWSN 160

Query: 272 LGIAYLQSGNMELSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVD 331
           L  AY++ G +  + +C Q+ +                        +V A +N G     
Sbjct: 161 LASAYMRKGRLSEATQCCQQAL-------------------SLNPLLVDAHSNLGNLMKA 220

Query: 332 EKVVGMNVAKECLLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKLEPNCMAM 391
           + ++  + A  C L  +++ P  A AW+NLA ++  +GD                   A+
Sbjct: 221 QGLI--HEAYSCYLEAVRIQPTFAIAWSNLAGLFMESGDLN----------------RAL 280

Query: 392 RYSVAMHRLKDAERSQDRSEQLSWAGNEMASIIRDGDGLTI-DHPVAWAGLSMVHKAQHE 451
           +Y     +LK A    D    L   GN   ++ R  + +    H +     S +  A   
Sbjct: 281 QYYKEAVKLKPA--FPDAYLNL---GNVYKALGRPTEAIMCYQHALQMRPNSAM--AFGN 340

Query: 452 IASGFRTDQSELREVEEHAVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQRYLKAA 511
           IAS +  +Q +L    + A+    Q ++ DP     ++  G           + R     
Sbjct: 341 IASIYY-EQGQL----DLAIRHYKQALSRDPRFLEAYNNLGNALKDIGRVDEAVRCYNQC 400

Query: 512 IARFKNCSYAWSNLGISLQLSDNPTQAEEVYKKALSLVATEQAHSVFCNLGNLYRQQKQY 571
           +A   N   A +NLG      +    A  ++K  L++  T    + F NL  +Y+QQ  Y
Sbjct: 401 LALQPNHPQAMANLGNIYMEWNMMGPASSLFKATLAV--TTGLSAPFNNLAIIYKQQGNY 460

Query: 572 ERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSAKSNL 629
             A + +++ L + P  A A  N G  +   GR  EA   +  A+   P +  A +NL
Sbjct: 461 SDAISCYNEVLRIDPLAADALVNRGNTYKEIGRVTEAIQDYMHAINFRPTMAEAHANL 467

BLAST of Cp4.1LG20g03450 vs. TAIR10
Match: AT3G16320.1 (AT3G16320.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 50.4 bits (119), Expect = 4.7e-06
Identity = 56/243 (23.05%), Postives = 90/243 (37.04%), Query Frame = 1

Query: 348 LKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQ 407
           + VD  +  +W  + N Y +  DH ++ K  ++  +L        +++  H     E  +
Sbjct: 482 ISVDRLSPESWCAVGNCYSLRKDHDTALKMFQRAIQLNER-FTYAHTLCGHEFAALEEFE 541

Query: 408 DRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEE 467
           D                R   G+   H  AW GL M +  Q +        Q  L+    
Sbjct: 542 DAER-----------CYRKALGIDTRHYNAWYGLGMTYLRQEKFEFAQHQFQLALQINPR 601

Query: 468 HAVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGIS 527
            +V   Y GIA             LH    R  +      KA +   KN    +    I 
Sbjct: 602 SSVIMCYYGIA-------------LHE-SKRNDEALMMMEKAVLTDAKNPLPKYYKAHIL 661

Query: 528 LQLSDNPTQAEEVYKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGY 587
             L D   +A++V ++       E   SV  +LG +Y Q KQY++A   F  +L+L P  
Sbjct: 662 TSLGDYH-KAQKVLEELKECAPQES--SVHASLGKIYNQLKQYDKAVLHFGIALDLSPSP 695

Query: 588 APA 591
           + A
Sbjct: 722 SDA 695

BLAST of Cp4.1LG20g03450 vs. NCBI nr
Match: gi|659067387|ref|XP_008439186.1| (PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SEC [Cucumis melo])

HSP 1 Score: 1166.0 bits (3015), Expect = 0.0e+00
Identity = 595/648 (91.82%), Postives = 617/648 (95.22%), Query Frame = 1

Query: 1   MPLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60
           MPLKTEHGASDSSLDEHSKA +SSKVVVLADLNVDPPEMDDDS VHVSAS ISRLSVDES
Sbjct: 1   MPLKTEHGASDSSLDEHSKAVYSSKVVVLADLNVDPPEMDDDSCVHVSASAISRLSVDES 60

Query: 61  NQDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREE 120
           N DKTV ICKDTN MEVEGR VSKIGKCRSRNNKVEYSLDSAAD DGDQHGQGVSTSREE
Sbjct: 61  NHDKTVEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREE 120

Query: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRP 180
           KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKA+ AYEKAEEILLQSDVEIHRP
Sbjct: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKALVAYEKAEEILLQSDVEIHRP 180

Query: 181 ELLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGL 240
           E LSLVQ HHAQCLLLES  GDN+S++EL+QEELDE+ SKLKHS+QSDVRQAA+WNTLGL
Sbjct: 181 EFLSLVQIHHAQCLLLESV-GDNTSNEELEQEELDEVCSKLKHSMQSDVRQAAVWNTLGL 240

Query: 241 ILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHP 300
           +LL+TGRVKSA+SVLSSLLAIVPNN DCLGNLGIAYLQSGNMELSEKCFQELIL DQNHP
Sbjct: 241 LLLTTGRVKSAISVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHP 300

Query: 301 AALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWAN 360
           AALINYAAFLLCK+GSTVVGAGANAGEGGVDEKVVGMNVAKECLLA LKVDPKAAHAWAN
Sbjct: 301 AALINYAAFLLCKHGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWAN 360

Query: 361 LANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEM 420
           LAN YFVTGDHRSSAKCLEK AKLEPNCM+MRY+VAMHRLKDAERSQDRSEQLSWAGNEM
Sbjct: 361 LANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEM 420

Query: 421 ASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAED 480
           ASIIRDGDGLTIDH VAWAGLSMVHK QHEIA+GFRTDQSELRE E+HAVYSL Q IAED
Sbjct: 421 ASIIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAED 480

Query: 481 PDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEV 540
            DDAVQWHQ GLHSLCTREFKTSQRYLKAAIARFKNCS+AWSNLGISLQLSDN T+AEEV
Sbjct: 481 TDDAVQWHQLGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLGISLQLSDNLTEAEEV 540

Query: 541 YKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA 600
           YKKALSLVATEQAH+VFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA
Sbjct: 541 YKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA 600

Query: 601 EGRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRLCNSLSSCRVKD 649
           EG+WE AKYCFEKALEADPLLDSA SNL+KTVAV RLCNSLSSC VKD
Sbjct: 601 EGQWEGAKYCFEKALEADPLLDSANSNLLKTVAVHRLCNSLSSCHVKD 647

BLAST of Cp4.1LG20g03450 vs. NCBI nr
Match: gi|778657980|ref|XP_011651897.1| (PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY [Cucumis sativus])

HSP 1 Score: 1139.0 bits (2945), Expect = 0.0e+00
Identity = 581/648 (89.66%), Postives = 609/648 (93.98%), Query Frame = 1

Query: 1   MPLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60
           MPLKTEHGA DSSLD+HSKA +SSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES
Sbjct: 1   MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60

Query: 61  NQDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREE 120
           N DKT  ICKDTN MEVEGR VSKIGKCRSRNNKVEYSLDSAAD DGDQ  QGVSTSREE
Sbjct: 61  NHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQPCQGVSTSREE 120

Query: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRP 180
           KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAV AYEKAEEILLQSDVEIHRP
Sbjct: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRP 180

Query: 181 ELLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGL 240
           E LSL+Q HHAQCLLLES  GDN+S++EL+QEELD++ SKLKHS+QSDVRQAA+WNTLGL
Sbjct: 181 EFLSLIQIHHAQCLLLESV-GDNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGL 240

Query: 241 ILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHP 300
           +LL+TGRVKSA++VLSSLLAIVPNN DCLGNLGIAYLQSGNMELSEKCFQELIL DQNH 
Sbjct: 241 LLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHL 300

Query: 301 AALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWAN 360
           AAL+ YAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLA LKVDPKAAHAWAN
Sbjct: 301 AALVYYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWAN 360

Query: 361 LANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEM 420
           LAN YFVTGDHRSSAKCLEK AKLEPNCM+MRY+VAMHRLKDAERSQDRSEQLSWAGNEM
Sbjct: 361 LANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEM 420

Query: 421 ASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAED 480
           AS+IRDGDGLTIDH VAWAG SMVHK QHEIA+GFRTD SELRE E+HAVYSL Q IAED
Sbjct: 421 ASVIRDGDGLTIDHSVAWAGFSMVHKIQHEIAAGFRTDLSELREKEDHAVYSLNQAIAED 480

Query: 481 PDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEV 540
            DDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFK CS+AWSNLGISLQL  NPT+AEEV
Sbjct: 481 TDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKKCSFAWSNLGISLQLPKNPTEAEEV 540

Query: 541 YKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA 600
           Y+KALSLVATEQAH+VFCNLGNLYRQQKQYERAKAMFSK+L LQ GYAPAFNNLGLVFIA
Sbjct: 541 YRKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKTLGLQLGYAPAFNNLGLVFIA 600

Query: 601 EGRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRLCNSLSSCRVKD 649
           EG+WEEAKYCFEKALEADPLLDSA SNL+KTVAV RLCNSLSSC VKD
Sbjct: 601 EGQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRLCNSLSSCHVKD 647

BLAST of Cp4.1LG20g03450 vs. NCBI nr
Match: gi|225456798|ref|XP_002275611.1| (PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SEC [Vitis vinifera])

HSP 1 Score: 936.4 bits (2419), Expect = 2.7e-269
Identity = 463/648 (71.45%), Postives = 557/648 (85.96%), Query Frame = 1

Query: 1   MPLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60
           +P+K+E G +++S DE SK    SKVVVLADLNVDPPE DDD S+HVSA  ++RL+ D+S
Sbjct: 12  LPIKSEVGVTENSADESSKRPQISKVVVLADLNVDPPETDDDDSLHVSAPDLTRLTNDDS 71

Query: 61  NQDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREE 120
           +QDK+  + KDT+ ++ EG+ ++K+GK RSR  KVEY LD  ADAD DQHGQG  TSREE
Sbjct: 72  SQDKSTLVSKDTDMVDGEGKRLNKLGKPRSRVTKVEYPLDYGADADADQHGQGAPTSREE 131

Query: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRP 180
           KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKA EILL+ + EI RP
Sbjct: 132 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAAEILLRCEEEIDRP 191

Query: 181 ELLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGL 240
           ELLSLVQ HHAQCLLL S +GD+S+DKEL+ EEL+EIL K+K S+QSD+RQAA+WNTLGL
Sbjct: 192 ELLSLVQIHHAQCLLLGS-SGDHSADKELEPEELEEILLKMKDSMQSDIRQAAVWNTLGL 251

Query: 241 ILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHP 300
           ILL TGR+++A+SVLSSLL I P+N DCLGNLGIAYL+SGN+EL+EKCFQ LILKDQNHP
Sbjct: 252 ILLRTGRLQNAISVLSSLLTIAPDNLDCLGNLGIAYLRSGNLELAEKCFQNLILKDQNHP 311

Query: 301 AALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWAN 360
           AALINYAA L+CKYGS + GAGAN+GEG  +++++  NVAKECLLA +KV+PKAAH WAN
Sbjct: 312 AALINYAAVLMCKYGSIIAGAGANSGEGASEDQLIAANVAKECLLAAVKVEPKAAHVWAN 371

Query: 361 LANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEM 420
           LAN Y++ GD RSS+KC EK AKLEPNCM+ RY+VA+H++KDAER QD SEQLSWAGNEM
Sbjct: 372 LANAYYLMGDCRSSSKCFEKAAKLEPNCMSTRYAVAVHQIKDAERYQDPSEQLSWAGNEM 431

Query: 421 ASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAED 480
           ASI+R+GD   I+HP+AWAGL+MVHK Q+EIA+ F T+   L E+EE AV+ L Q IAED
Sbjct: 432 ASILREGDSALIEHPIAWAGLAMVHKIQNEIAAAFETEHKGLMEMEERAVHILKQAIAED 491

Query: 481 PDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEV 540
           PDDAVQWHQ GLH+LC ++FKTSQ+YLKAA+AR K CSY WSNLGISLQLS+ P QAE+V
Sbjct: 492 PDDAVQWHQLGLHNLCVQQFKTSQKYLKAAVARSKECSYMWSNLGISLQLSEEPAQAEQV 551

Query: 541 YKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA 600
           YK+ALSLV  +QA+++F NLGNLYRQQK+Y+ AKAMF+KSLELQPGYAPA+NNLGLVFIA
Sbjct: 552 YKRALSLVTPQQAYTIFSNLGNLYRQQKKYQSAKAMFTKSLELQPGYAPAYNNLGLVFIA 611

Query: 601 EGRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRLCNSLSSCRVKD 649
           EGRW+EA++CF KAL+ADPLLD+AKSN+IK  A+SR+C  LSSC ++D
Sbjct: 612 EGRWKEAEFCFNKALQADPLLDAAKSNMIKAAAMSRVCQHLSSCSLQD 658

BLAST of Cp4.1LG20g03450 vs. NCBI nr
Match: gi|590669690|ref|XP_007037847.1| (Tetratricopeptide repeat (TPR)-containing protein [Theobroma cacao])

HSP 1 Score: 905.6 bits (2339), Expect = 5.0e-260
Identity = 449/624 (71.96%), Postives = 537/624 (86.06%), Query Frame = 1

Query: 25  KVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNQDKTVAICKDTNTMEVEGRHVSK 84
           KVVVLADLNVDPPE +D  S+ + A  ++RL+ DES+ +K+  I K+++ +E E + ++K
Sbjct: 31  KVVVLADLNVDPPETEDHDSLLLPAPDLTRLTNDESSHEKSTFISKESDAVEGEAKKLTK 90

Query: 85  IGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHF 144
            GKCRSR +K + SLD  ADADGDQ  QG  +SREEKVSSLKTGLVHVARKMPKNAHAHF
Sbjct: 91  SGKCRSRISKADSSLDCGADADGDQPSQGTPSSREEKVSSLKTGLVHVARKMPKNAHAHF 150

Query: 145 ILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRPELLSLVQTHHAQCLLLESAAGDNS 204
           +LGLMYQRLGQPQKA+ AYEKA EIL++ +VEI RPELLSLVQ HHAQCLLLE++ GDN 
Sbjct: 151 VLGLMYQRLGQPQKAILAYEKAAEILVRCEVEIARPELLSLVQIHHAQCLLLENS-GDNG 210

Query: 205 SDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGLILLSTGRVKSALSVLSSLLAIVPN 264
            DKEL+ +EL+EILSKLK S+QSDVRQA +WNTLGLILL TGR++SA++VLSSLLA+ P+
Sbjct: 211 LDKELENDELEEILSKLKESMQSDVRQAGVWNTLGLILLKTGRLQSAIAVLSSLLALAPD 270

Query: 265 NYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHPAALINYAAFLLCKYGSTVVGAGAN 324
           +YDCLGNLGIAYLQSGN+ELS + FQ+LI+KDQNHPAAL+NYAA LLCKYGS V GAGAN
Sbjct: 271 DYDCLGNLGIAYLQSGNLELSARYFQDLIIKDQNHPAALMNYAAILLCKYGSVVAGAGAN 330

Query: 325 AGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWANLANVYFVTGDHRSSAKCLEKVAKL 384
           A E    ++V  +NVAKECLLA LK DPKAAH WANLAN Y++ GD+RSS+KCLEK AKL
Sbjct: 331 ASEVASGDQVASVNVAKECLLAALKSDPKAAHTWANLANAYYLIGDYRSSSKCLEKAAKL 390

Query: 385 EPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEMASIIRDGDGLTIDHPVAWAGLSMV 444
           EPNCM+ RY+VA+HR+KDAERSQD SEQLSWAGNEMAS++R+GD + ID P+AWAGLSMV
Sbjct: 391 EPNCMSTRYAVAVHRIKDAERSQDPSEQLSWAGNEMASVLREGDSVPIDPPIAWAGLSMV 450

Query: 445 HKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAEDPDDAVQWHQFGLHSLCTREFKTSQ 504
           HK QHEI + F T+Q+EL EVEE A++SL Q   EDPDDAVQW+Q GLHSLC++ FKT+Q
Sbjct: 451 HKTQHEIVAAFETEQNELVEVEERAIFSLKQAAGEDPDDAVQWNQLGLHSLCSQNFKTAQ 510

Query: 505 RYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEVYKKALSLVATEQAHSVFCNLGNLY 564
           +YLKAA+ RFK CSYAWSNLGIS+QLS+  +QAE VYK+ALSL   EQAH++F NLGNLY
Sbjct: 511 KYLKAAVVRFKECSYAWSNLGISIQLSEEASQAESVYKRALSLATVEQAHAIFSNLGNLY 570

Query: 565 RQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGRWEEAKYCFEKALEADPLLDSA 624
           RQQKQYERAKAMF+KSLELQPGYAPAFNNLGLVF+AEG+WEEAK+CF+KAL++DPLLD+A
Sbjct: 571 RQQKQYERAKAMFTKSLELQPGYAPAFNNLGLVFVAEGQWEEAKFCFDKALQSDPLLDAA 630

Query: 625 KSNLIKTVAVSRLCNSLSSCRVKD 649
           KSN+IKTVA+SRLC  LSS  ++D
Sbjct: 631 KSNMIKTVALSRLCAGLSSFFIQD 653

BLAST of Cp4.1LG20g03450 vs. NCBI nr
Match: gi|802649036|ref|XP_012079925.1| (PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY [Jatropha curcas])

HSP 1 Score: 902.1 bits (2330), Expect = 5.5e-259
Identity = 450/637 (70.64%), Postives = 538/637 (84.46%), Query Frame = 1

Query: 1   MPLKTEHGASDSSLDEHSKAAHSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDES 60
           + +KTE   +D S++E  K     KVVVLADLNV+PPE D   SVH+S   ++RL+ DES
Sbjct: 13  LSIKTEVDTADGSMEETCKTLQPPKVVVLADLNVNPPETDATDSVHLSVPELTRLTNDES 72

Query: 61  NQDKTVAICKDTNTMEVEGRHVSKIGKCRSRNNKVEYSLDSAADADGDQHGQGVSTSREE 120
            QDKT   CK+ +T E EG+ ++K+GKCRSRN+KV+ SLD   D D DQ GQG  +SREE
Sbjct: 73  -QDKTNFSCKEVDTAEAEGKKLNKLGKCRSRNSKVDASLDYGPDIDADQPGQGPPSSREE 132

Query: 121 KVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVSAYEKAEEILLQSDVEIHRP 180
           KVSSLKTGL+HVARKMPKNAHAHFILGLMYQRLGQP KAV AYEKAEEILL+ D E+ RP
Sbjct: 133 KVSSLKTGLLHVARKMPKNAHAHFILGLMYQRLGQPPKAVFAYEKAEEILLRCDAEVARP 192

Query: 181 ELLSLVQTHHAQCLLLESAAGDNSSDKELDQEELDEILSKLKHSIQSDVRQAAMWNTLGL 240
           ELLSLVQ HHAQC+LLE +A DNS DKEL+ EEL+EI+S+LK S+Q D+RQA +WNTLGL
Sbjct: 193 ELLSLVQIHHAQCILLEYSA-DNSLDKELEAEELEEIISRLKESMQLDIRQAGVWNTLGL 252

Query: 241 ILLSTGRVKSALSVLSSLLAIVPNNYDCLGNLGIAYLQSGNMELSEKCFQELILKDQNHP 300
           ILL +GR++SA+SVLSSLLAI  NNYDCLGNLGIAYLQSGN+ELS KCFQ+LILKDQNHP
Sbjct: 253 ILLKSGRLQSAISVLSSLLAIDTNNYDCLGNLGIAYLQSGNIELSAKCFQDLILKDQNHP 312

Query: 301 AALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLATLKVDPKAAHAWAN 360
           AA +NYAA LLCK+GS V GAGANAGEG   ++   ++VAKECLLA LKVDPKA H WAN
Sbjct: 313 AAFVNYAALLLCKHGSLVAGAGANAGEGAFRDQFEAVDVAKECLLAALKVDPKAGHTWAN 372

Query: 361 LANVYFVTGDHRSSAKCLEKVAKLEPNCMAMRYSVAMHRLKDAERSQDRSEQLSWAGNEM 420
           LAN Y++ GDH+SS+KCLEK AKLEPNCM+ RY+VA+HR+KDAERSQD +EQLSWAGNEM
Sbjct: 373 LANAYYLMGDHKSSSKCLEKAAKLEPNCMSTRYAVAIHRIKDAERSQDPNEQLSWAGNEM 432

Query: 421 ASIIRDGDGLTIDHPVAWAGLSMVHKAQHEIASGFRTDQSELREVEEHAVYSLYQGIAED 480
           ASI+R+GD + I+ P AWAGL+MVHKAQHEIA+ F T+ SEL ++EE A+YSL Q IAED
Sbjct: 433 ASILREGDSVPIELPTAWAGLAMVHKAQHEIAAAFETEHSELVDIEERALYSLKQAIAED 492

Query: 481 PDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSYAWSNLGISLQLSDNPTQAEEV 540
           PDD VQWHQ GLH  C+R+F+T+Q+Y K A+ + K CSYAWSNLGISLQLS+  +QAE+V
Sbjct: 493 PDDGVQWHQLGLHCFCSRQFETAQKYFKVAVTQLKECSYAWSNLGISLQLSEESSQAEDV 552

Query: 541 YKKALSLVATEQAHSVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIA 600
           YK+ALS  A+EQAH++F NLGNLYRQQKQYERAKAMF+KSLEL+PGYAPA+NNLGLVF+A
Sbjct: 553 YKRALSFAASEQAHAIFSNLGNLYRQQKQYERAKAMFNKSLELKPGYAPAYNNLGLVFVA 612

Query: 601 EGRWEEAKYCFEKALEADPLLDSAKSNLIKTVAVSRL 638
           EGR EEAK+CF++AL++DPLLD+AKSN+IK VA+SRL
Sbjct: 613 EGRLEEAKFCFDRALQSDPLLDAAKSNMIKAVAMSRL 647

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
OGT1_RAT1.6e-0821.52UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit... [more]
OGT1_RABIT1.6e-0821.52UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit... [more]
OGT1_PIG1.6e-0821.52UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit... [more]
OGT1_MOUSE1.6e-0821.52UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit... [more]
OGT1_HUMAN1.6e-0821.52UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit... [more]
Match NameE-valueIdentityDescription
A0A0A0LTP6_CUCSA0.0e+0089.66Uncharacterized protein OS=Cucumis sativus GN=Csa_1G050030 PE=4 SV=1[more]
D7SHD3_VITVI1.8e-26971.45Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g09890 PE=4 SV=... [more]
A0A061FZ59_THECC3.5e-26071.96Tetratricopeptide repeat (TPR)-containing protein OS=Theobroma cacao GN=TCM_0145... [more]
A0A067KGL4_JATCU3.9e-25970.64Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11365 PE=4 SV=1[more]
B9RAP6_RICCO3.6e-25769.97O-linked n-acetylglucosamine transferase, ogt, putative OS=Ricinus communis GN=R... [more]
Match NameE-valueIdentityDescription
AT5G63200.13.2e-24167.47 tetratricopeptide repeat (TPR)-containing protein[more]
AT3G11540.15.9e-0935.71 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G08320.21.9e-0735.11 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G04240.11.6e-0621.05 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G16320.14.7e-0623.05 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659067387|ref|XP_008439186.1|0.0e+0091.82PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransfe... [more]
gi|778657980|ref|XP_011651897.1|0.0e+0089.66PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransfe... [more]
gi|225456798|ref|XP_002275611.1|2.7e-26971.45PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransfe... [more]
gi|590669690|ref|XP_007037847.1|5.0e-26071.96Tetratricopeptide repeat (TPR)-containing protein [Theobroma cacao][more]
gi|802649036|ref|XP_012079925.1|5.5e-25970.64PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransfe... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR019734TPR_repeat
IPR013026TPR-contain_dom
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000956 nuclear-transcribed mRNA catabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g03450.1Cp4.1LG20g03450.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 433..625
score: 3.4E-40coord: 136..311
score: 4.0E-23coord: 340..386
score: 3.4
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 481..619
score: 4.84E-26coord: 338..391
score: 4.11E-26coord: 200..303
score: 4.11E-26coord: 137..169
score: 4.11
IPR013026Tetratricopeptide repeat-containing domainPROFILEPS50293TPR_REGIONcoord: 467..621
score: 26.07coord: 140..173
score: 9.496coord: 232..299
score: 14.179coord: 355..388
score: 9
IPR019734Tetratricopeptide repeatPFAMPF13181TPR_8coord: 140..169
score: 8.8E-4coord: 355..388
score: 4.
IPR019734Tetratricopeptide repeatSMARTSM00028tpr_5coord: 554..587
score: 1.1E-6coord: 266..299
score: 0.011coord: 588..621
score: 6.5E-5coord: 518..551
score: 0.095coord: 355..388
score: 3.6E-4coord: 232..265
score: 3.2coord: 140..173
score: 0
IPR019734Tetratricopeptide repeatPROFILEPS50005TPRcoord: 140..173
score: 8.703coord: 554..587
score: 11.594coord: 266..299
score: 7.169coord: 518..551
score: 7.759coord: 355..388
score: 9.588coord: 588..621
score: 11.122coord: 232..265
score:
NoneNo IPR availablePANTHERPTHR23083TETRATRICOPEPTIDE REPEAT PROTEIN, TPRcoord: 112..451
score: 2.1E-145coord: 467..631
score: 2.1E
NoneNo IPR availablePANTHERPTHR23083:SF343BARDET-BIEDL SYNDROME 4 PROTEIN HOMOLOGcoord: 467..631
score: 2.1E-145coord: 112..451
score: 2.1E
NoneNo IPR availablePFAMPF13424TPR_12coord: 556..617
score: 3.7
NoneNo IPR availablePFAMPF14559TPR_19coord: 242..301
score: 1.