CaUC02G028810 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC02G028810
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionIndole-3-glycerol-phosphate synthase
LocationCiama_Chr02: 3017948 .. 3029061 (-)
RNA-Seq ExpressionCaUC02G028810
SyntenyCaUC02G028810
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTAAAATAGGGAAAACAGAGTTTCAAGTTGCAACCTTCATCGGCTCCAAGTCCCAACAAGAACAACAAAAACGCCCACAAATTTGACCTATTTTTTGTTTTTTGGTTTTGTTTTGTTTCATATTTTGGTGTAACTCCTTAATTCGCCAATATCTATAAAACAATCTGCTTCACAGACGCAGGGTTTTCATTTTCAAACGAGCTGAAGTTTTGTTCGACAATGGAGGGCCTCGCTTCCCTCAGAACAATTCCAAAGGTTTCCTTTCCACCCATTTCCTCTTCTACTCGCAGGTCCAAGTTTCTTTCCAGAAGGTTGAATTTCAGCCCTTCAATGGACACTTCTTTGAGGAATTCAATCTCTCTTGCCTCTTCTTCTTCCTCCTCTCCCATTTCAACTGCCATCCGAGCTCAACAGGTTTTGATTTACGCCTTCTATGTTCTTCCAATTTGGGAGTTTTTTTGTTGGGGTTGAAGGATTTTCTTTGTATTTCGTGCGTTCGTTGTTTCTTTGTGCTTAATCTGCATTTTGCTCATCTGGGTAGATTTAGTTTTCTCGTATATTCATTTAAATGGTCGAATTTTTGGAGTTGTTTTGTAATGAGTGCATATACTGATTGTTTTTTTAGATAGAATCCGAGGCTGGCTCAGCTGCGGCTTCACCAGTTACTGAATCAGAAGAAAATGCTCTGAAAGTGAAGGAATGGGAAGTGGGGATGTTCCAAGATGAAGTAGCTGCAAGTCAAGGCATAAGGATTCGTCGCAGGCCGCCCACGGGACCGCCATTGCATTATGTTGGACCCTTTCAGTTCAGATTACAGAATGAAGGGAACACTCCTAGGAATATTTTGGAGGAGATCATATGGTACAAAGACAAGGAAGTTTCACAGGTAAAATTACATAGGACTGTTAGATGATATAATATTAAATTTGCCTTCACCCGTCATCTTAAGGGTCAATCGTTGATTTAAGATGGTATTAGATCAGGTAATCCAGGAGCTCCTATGTTTGAATTCCTTCAATGTTAAATTTGCCTTCACCCATCAGCTTAAGCTTTTGGGACATTTAAAACGACCTTTGATTGATTTGGATTCTCATACTTTAGGAAGTTGTCTGTTGGCTAACAGTGTTTTTGGTTTTTGAATGTTTTCAGATGAAAGAAAGAAGGCCTCTCGTCTCGTTGAAAAGGGACCTTGAGAGGGCCCCTCCTGCTAGAGATTTCCTTGGAGCTCTCAAGGCTGCCTACCTTAGAACTAATTTGCCTGGTTTGATTGCTGAAGTAAAAAAGGCTTCTCCCAGCAGAGGAGTTTTAAGGGAGGATTTTGATCCAGTGAGTTTCCTTGACTATTTATTTTGTTGTTAGATTTCATAAATCTCAACACGGTTTTGAGTCTTTGACACAGTAACTATAGCAGAAAATCCCTCCTTTCATAGTACCAACTTTTGATTGGATTGTAAAATTCAAGAAGGTTATACAAACCTAGGTTTGAGCAATAAAGAGGATTTTTAGGTTTAAGAAATACGAGTTTGTGAAATCTTTGTTTAGATAGTAAAAGAAATAGGTTATTAAAACCTAGGTTTGTGTAACTTGTATCAAGTAGGTGTTTACCATCTGTAATAGTTGGTATATAATTTGAAGAGTAGGTTTCATGAGTGTGGGTTTATTAGACATGGATTCATGTAGTATGTGAAATTAACAACTCTATAGTATGGGATAAATAAAACCACTAGCAATTGTAAGTTAGGGTCCCTAACCCAAGTTAGATAAACCTGATCTATGAACCGAACAGCACTTAAATGTTTGTTTTCTCCCCTTCAGAAACCAAACTTTCATTGTGAAGAAACAAAAGAAAACAAGACAAAAGTTCCAAGGGCATTTAAAAAAAAGAAAAGAAAAACAGCCCACACTAAAGGAGTCTAGCACCTAACTATACAAAAAGGGGCTCAAACTTCAATCCGTAAAAAATTAGTGGGTTTACTTTCCTGTGCTATGCTCTTGATGGATGGGGGTAAAAATGACTGAAGATGTGGTAGGCGTTCATTGCCAATGCCCAGTTTTGTCACAATGTAGTACGTCTTGGACTTGGAGTGTGGGGATGGCTAAACTGGATATAAGATCTTCCCCGACCCTAGGAATGGTAGTGATTTCTTTTACCGGGTTAATTGATGCATCAAGCAGGGACCTATCGTGTGTGCATGCCCAAAACTTCAGCTATAAAACTACACTACGAAATTCAAGCCCTTTTATAGTGCTCAAGATTCAACTACTTGTTAAGAGTAGCATTTGTGTTTAATCTTGTTACTTTTGCATTGGAACAATCAAGTTGTTTTTATTTAATGTAAATTTGATAAATGATGATAGGTTGAGATTGCCCAAGCGTATGAGAAAGGTGGAGCAGCATGCCTTAGTGTTCTCACAGATCAGAAGTTTTTTCAGGTGAAGAACATTTTACGAGCTCTTTCTTTCTTTTTATTACTCGGATGGTGTCCGACTTTTTATTCATTTTCTTTTAGTATTTGAAGATGCTTCGATTACTTCCTGTGTTTTACATCTGTTGTTTTGGGGATTAGGGAAGCTTTGAGAATCTGGAGAAGATAAGGAATGCTGGAGTGAAGGTTTGTCATATTTAATCATAATTTTACATATTAGTATGTGAATTCTAAAAGTAGGAAACATATGCAGTGCCCTCTCTTGTGCAAGGAGTTTGTGGTTGATGCATGGCAGATCTACTATGCCCGATCCAAAGGTGCCGATGCCATTCTTTTGATTGCTGCTGTTTTGCCTGATCTCGACATTAAATATATGACTAAAATCTGCAAGATGGTCGGTTTGACCCCCCTTGTTGAGGTCGGTGTACACATTATTATGCAATGCACCATAATCTACAAAAATTTCATGTCTTTCCATCTTGTATAGCTGAATTGAATCCTGTTTTGCGCAGGTCAGATATTGCTTGCAATTATTGTACAATGTTCAAAGAAGCTATTCATGTTTTTTACCTATTACATTATTCGTAGGTTCATGACGAGAAGGAAATGGATCGTATGCTCGCAATTGATGGCATTGAGCTTATTGGCATCAACAATCGGAATCTTGGTAATGGATGACTACTTTTGTTACTTCGTTCATGTTGCTAAACTTCTCATTTCTCCATTAATCAGTATATTGCATTCTTTGCACCAAACTGCATCTCTACTCTCTTCTAACTTGATTGTTTTATTAGGACATCGTATTCCTTACATTTGAATGACCATAGCTTTAAAGTAGTTTAGTATTCCAAAACACGAGCAATTGTCTGTTCTTGCTTTTGAAGCGTTCCTATTGAAAGTTATTTTACTTATATGCGTGTGTAAAACAATTACTTGCAGAAACATTTGAGGTCGATATCAGCAACACAAAAAGGCTTCTTGAAGGAGAGCGTGGACAAAAGATCCGCAAGAAGAACGTAACAGTAAGTATCAAATAGACAGTTCTGCTGTTCAACAACTTCTCAACAAACTAATTCTTTCCATCTTCTTTTATACCAAATGAAAAAACACATCAAACTCTTACATTATCTGATATTCACTTTGTTTGTTGTGAGGATTTGTTCAGATAGTGGGAGAATCTGGGCTGTTCACTCCTGATGATATTGCTTATGTGCAAGAAGCTGGTGTTAAAGCTGTAAGCTTCTCATATTTGGATTCATGGCTTCCATTGTTGCATCAGACATGGAATTTTGAGATTGTGTAATGTTGGTTTCTGCAGGTTCTGGTTGGCGAGTCGATTGTGAAACAGAGCGACCCGACGAAGGGAATAACTGGACTTTTTGGTAAAGATATTTCTGTTTAAAGCTTATAGAATCTTGAAACTGTTGTATTTTCTGAATGTGTTGGTTATCCCCAGTTGGCTCTAATTGTATTTACCTATATGGTATTTTGGGTTCTAAGTGATTAGATAGAGCTTGGATTATACTGCCTTCCAAATTGGAAAAATGAAGCATAAGTTACACATTAATACAATAAATGTTGCAGTATACCATATGAGGTTCTCTCTTTGATTGATTTTTTTTTAATGCATAAATATTGAGATGATTGTACGGTTGACCTAAAATTTGTGTCCAAAATGGAATTTGTCCCACTTTTTAAAAATCAAGTTTTTTGACATCTTGCTTGTGTCTTCACTAGGGGAAATTACCAAAATGACTTTATTGTGTGTGCTTGACTCTTGACGCTTGCTCTTGCTCTCACTACTGTACATACTTGCTGTTGGTCCTTACTTTTGACTCTTAACGAGAGCGAGAGAAAATTGAGAGTTAGGAGCGAAAAAACCTAAACTCTAAGGAGCAAGATTTCACCTGCACACACAATGTTGCATTTTGGTAATTTTACTCTATATGACGTGAGTGAAAGGTTAAAAAAAATGTACTTTTAAGTTTTAAGAACTCACTCAAATTTCAACTATCATGGGTTGATCTAGTGGCAGTGAGAATATCTAAAAAAGGCCAAAGAGCCAACGGGTCAAGGATTCAATCCATAGTGGCCACCTACCGGACACCACGGATAAAAAAAAAATTGACTCAAATTTCTTTTGGTCCTTCTAAAATAGATCATTTGTGAATCATATCTCAACAATATACAAATTTGACATATTTTTTCTTCATCAATGAAATTGTTTTGCATAAAAGAAAAGGGTTAGTTGCAATTATAAAAAAAATTATAAATATAGCAAAATTTAGCGAAATTTAGATTTTGTTAGTGAGTTTATCATTGTTAGATTTTACTATATTTGTAATTTTTTTAAAAATATTGACTTAATTATTATCTTTAAAAGAATATTTTAGTGGGTCCGTTTTTCGTCCTTCAAGAACAATCAAACCCCATCCCCGCTATTTGGGGTAGATTCTCCAACTGCCCCCATCCCTGCTGTTTGGGGGTGGAATCGTTGGATTCCCCCATTCGAGAATCTAGTCCTCACAGGGATACGTTTATAGAGATTTTTTTATTATCTGTATTTTATATTGTACATTTTTTAAATAATATTATTTAATTTGAGTTTGAGTCTCTTTTTTCAATATAAAGTGCATTTTTTTGTGAAAAAAATAAATTTTAATAGACATAGATGCCTACTATTTAAAACAAAAACAAAATACGAATAATAAAAAAATGGTAAAATTAAAACAATTCATTTTTTTCATAAAGAAAATGAAAGGAAACTAATGGACTTGTAAGAAAATTTTGTTTTTTTTAATTGACCATTCTTTTAATTAGTATTTTCTTTTGTATATAGAATATACTATATTTATAATCATAAAACATATTTATATATCTTTAAATTAAATGGGGCGAGTTCGGAGTTGGAGATATCCTCCTCATCCTTGCTCGATCCGTTTAGTATTTCTAGTTTAACTCACATTCCCGATCCAACTAGGGATTCCCTGCCAACTTAACGTCTTTTTTTTCCCCTTTTGCCAACTTTAGGAGGGTGTAAATGGATACGTTTATGAAAATTCATAGTTTAAACTGATACCCAAAGTCAGGGAGAAATTTGTATTTTTTCATAAATAAAAAATAGATTAAAATACCATTTTGGTCCCTAGATTTTGAAAGTTGTTTAATTTTAGTACATGTAGTTTCAAATCTCAAATTTTAGTCTTTGCAATTTTAATAAATTTTAAATTTAGTCCCTATTGCTAATTTATTGTTGATTTTTTCTAAAAACTTATTATTATCTCTAACATTTTCATTAAGAATTTTGGAAACACATTCACATATTGAATTTTCTTGCATAAATTGTTGTTATTGGTTAACTAATCTCGATAAAAATTAACTCTTAGATACTAAATTTAAGATTTATTGAAGGTACATCGACTAAAGTTGAATAAACTGAGAATCTTTACATTTTCATTTTACCCTTATTTATTTTAAGAATTCTTTTTTTGCCATTTCTTACAACTTACAACTATTTGGGTCAACGTGGTTACATGCGAAGTGGTGTATTTTTTCGTGTATAGATATGATCTAGACTATTAATATGAAATATCATAATCACGTCTTTTGCCCCATAATAACTATTCAAATTGAGTAGTTATATGACAAAGTGGTATATCCTTTTGTATATGGGTGTGATCTAAACTATATATATATGAAACATTAGAATCTCACTGTGGAGATATGTCTGAATTGACAATCACGTCCTTGCCTTATTAAAACTATTCGGGTCAAAATGAGTACAAAGAAGTAGTGTATCATTTCGTGTATGAGTTTGATTTAGTCCATAAATATGAAATGTCACAATCTAACACTATATGTTGAAACGCATGTACAATCTCGTCATTTGCCTTTTAATTTTTTTTTCATAAATGGCTATTTTAGTTAAACAATATAAGAGATTACCTCAAAATAAAACTATAAATTTTACGTGGATTGCCCTTATTATTATGGTCCTAAAAAAATGGGTGTATAAAGCACATGCTACGAGCGCATGTAGTTGAATAATTCCCATTTGCCGTTTGGGACGTGTGATCTCGTCGGTTTCTTCAGAGAAACTTCCACTCGAACTCGCCTCTATTTCTCCGGCGATGTTCTCTGCCGGAGCCGCGGCCTCCGGCGAACTGGAGTTCCGGTGGGACGATGACGCGTGGTACAATGTGACCGTGAAACTCGAAGGCGATGATCTTAGGATCAGTTATTGCGAGTTCGATAAGGAACATGACAATGTCTTCAACGCCAACCATTTCCGGAGCTTATCGGAGTTGAGCGACTTCGAAGCTAGGTTTCGGCCTCTGTCCAGACAGTTGCAGGATTCCGAATGCCCTAACGTCGTCCCTGGAATGCCCGTTTGCGCTTCCCACTCCTCTCGAGCCGATGATGTTCGCTTCTACGATGCTCTTGTGGAAGGGGTTCGTTCCTTTGCTTTGCACTTCTCCTATTGTTTGAGTCTGAGTTCACCGCTGATTCTTTTTGTTTGAAACTTTAGCCTCTTAGTTCCATTAACTTTATCGAGCAGGACCTTATTTAGAAACTTTGTTTTGTGATTTTCCTTCTCATACTGAAGATAGTCTCTGGTTTTTTTGGTTTGTTTGCTATTTTGAGGATCTTAGATGGCCTAAGACTGTTCCGTATGCCATAGGGTAGTCTGAGTTTTGGAAGGCCTTCATCATGTTTTTTCAGCATGCATGCTTTACAGCCATAATACCATTTAATGTTCTGAAAATAACTTTATGAACCATCGTATCATGGATTGGCCTAGTGGTAAATATGTAGGCATGACCTTGATAAAGGATTTAGGAGTCATAGATTCTATTTATGGTGACCACCTACCTAGGATTTAATATCATACAAGTTTCTTTGACACCTAAATGTTGTAGGGTTAGGCGGGTTGTCCTGTGGGATTAGTTGAGGTGTGCGTAAACTGGCCCAAAAAGGAAAATGAACTTTCCGAAATTATATGGATCAAATATTTCTTTTCCTTAATTTTCTATATGATTGAAACTCCAATCTCTTATTATAACTCTTTTGTTCATTTTAATTCTTAAATTTATAGAACTTCATTTCTTTTTATTATGAATGGTTATAAGCTCAATTAACTAGACCATATTTACATGCAATCAATGATATCTATCTATGAGTGCGATCATACTAGTACTAATATTGTCATGGACAATGACCTTCCTCACATCGATATGATATTGTTCACTTTGGGCATAAACCTTCATGACTTTTATTTTTGGTTTCATCCAAAAGGTTTAATGCCATTGGAGATTTTATCTTTCCTCCTTATATATCCATGATCTTCTCTTTATCTAGCCAACGTGGGACCTTGGTCGTAATCCTAACAATCATCTCCTCTAACAAAGTACCATTGAGTCTCCCCTCAAACAATTATATATTCGTATAATTCGTATCCATTCCAAATCCTTTTCTCTAGAGTCCACCGCCAATCAGATAGAGCTCAACCATGGGTCTTCACCATGTCTACAACTTTCAATGCTCACCACCCAAGGATTCTACCGACATGACTAAGTTAGGATCATGACTCTAATACTACTTGACCCTCCCCATATCTCCACAGTGGCAATGATATTGTCCACTTTGGGCATAAACTCTTGTGATTTTGTTTTTGGTTTCACCCAAAAGGACTCATGCCATTGAAGATAGTGTTCCTCACTTATCTATCCATGATCTTCCCCTTATCTAGCTGATGTGGGACTTTGGTCGCATTCCCAACAAATCCATTGGATCCTATCAAAACTCGACGTCTACGGTTAAGTGTGCTTGGGCGATAGTAGTACTAAGTTGGTGGTCACTTTAGAAGTCCTTGTGTCGTACCTCTTTTTATTTCTATTTGTTATAATAAAAAAAATCTATCTTTTGAATTGAATATATGAAGCGTATACTCTAATCTAAACAGTGTTTTTTTGACATGAGTTGAGGTTTCATTTTATTGCACATTTTAGGCTTTCTTGTGGACATAAGATGAAGAAATAACGTATGAAATGTACTATGGACTTGTATTTTATTTTTGGGCACTAAGTCAAATAATATGGGCATGTTTGAATTAAACCTTTTTATGCATGTTTTAATTATCAATCCCAGAACATATAGTTGATGAAAAATAATATATTGGATTTTTGAAGAATGATTAATTCAAAATTTCTTAGAGTATGAATTCAATTAGAAGCCTTGGCATTTTCATTTTTTCCATGACTCGTGCATTGGTAGGTGGATTATCTTGAACACTCTTATGCAAATGGAGAAGAAGAGTGCTTATGCAACTTTATCCTTTTATGGCAGCATGGTCCAAACTCTGGGAATTTGACATTTGCCAGTATTGCTAACTTGTGTCAAATTCAATTTGATGAAATTAATGACACAGTGTTAGCAACCTTCTTCGCGAAAGTTAGGGAGAAAATCCAAACCAGAATGAATAGAGGTGGTACCTGTTCTGAAGATCGCCTTCTCACCCATAATGACGGTGGTGCTCATCAGAAGGATGAATGCAGCCTAAAATTAAAGCGTCGGCTATCCTTTTTTGAACGCATGGACCAGGTGAGTATTATTTGTAGAGCTCAAGATTTTACTAAGCAATAAGTTATTAAGCTCCAACTTCCACTTACATTGGAGGGCTCCATTTCATTCACCATTTCATTATACTTTTATCAGTATTTGATGGCTCTCTCCATTTGTAGGACACACGGCGTGCCAAGCGTTCTTCTGGGGCAGTAGGACTATGGGAAGGTAAAATTTGTTATTTACGGCATGAACCTAACCTTTCGTTATCTCATCAAGTGTGGACAATGTGATCTATGGGGCAAATATTTTTTCTCCACTTGAAACTGTGGTATCTTAATCAATAATAATGCACCATTTAACAGTACTTTTTTAATCACTATTTTGTTTCTTGATTGAACTTCTTTTCCCCCCATAGACCAACAGACTCTCAGTTCCAGAAAAAGTGGGGTGATCGAGCAAGATACTGATATTGGTGGAGTGAAGTATCAGTACATGATTTTACTTGAGAATCTAGATAAAGAATTGTCTCCAGTAAAACTTGCCAAATTCTTACATGCACAAACATTGATATTACCTCGAGTATACATTTTTCCAAGTTTAACATTTGAGGCGTATGCAAAGGGAGCTGTTGTATTGAATTGCAGAAAGAACTTAGAGAGGTTGTGCGATTTTTTGGATAATCCAGACCATGTCATTTTATCCTCCCAAGGAAGGTAAACTTTTTCACTAAATGAACTTTTTGTACAGACATATTATAAATCATCAAACTTTTCTACGAAATATGCATTTCGCTCTCTCTCTCTGGCTTTGATACTAACTGTCTTGGATTTAATCATTTAGCTCAACTTGATTACAATTCTATGCCTAATTTCTATAAATCCTCGACTGTTGAGATTTGTTTCCATTTTATAGCTTGGGAGTTTTCCACTCATTTTCTGCAGGCCCTTGGTAGTAACCGGAAGAATAGCAGGACGCGAAACTTTTGGGACATTGGCGGCAGGGGCCATGGTGCTAGACTCGGAAGTAAGTTCTTATTCACTCCTTGTTCGTTTTTGTACTAACAGGAGTGGCTCTGTTGATCTCTCATAGGTTCATTTGAAGTTGAATTTAGAAAAATGAAAGAGATGTCCAATTTTTAATTAGTTAAGCAACATGTATCATTCACGCCGAATTCTTAGTATAAATGACGCCTTCAAGTTTCTCCATTCAGATATAATATTAAATTTATCTTCACCTATTAGATTAAGCTTTTGGGTTAATCGGTGATTTAACATGGAATCAGAGCAGGAAGTCTTGTGTTCAAATCCCAACAATACAATGTGATTTCCACTCCCATTAATATTGGTTTTCACTTGTTAGGTCTTCTACGTATTCCAAGCCCACAAGTGAGGGAGAGTGTTAGATAATATAATATTAAATTTACTTTCACCCATCAGCTTAAGTTTTTGGGTCAACATGTGATTTAACATCTCCTAGATAGTTCAACGTGATCTTTTTCAGTTGCAATAAATCGAATTCCATTTCTTGATATAAATCCTTGAACGGAAGTCTTACAAAAATGATGGAAGACAATCTGGAATCGATATTGAACAGGGGCTATAAACCAATTTAGCTTAGGAAGTAATCTAGGAATTTAAGTTGGTATCCTACTTCTGATGGTTGTTGCTGTTCATCTTGATTTGTTACAGAATAAATTTGGTAATGAAAAAGATGGGAGGGTGCGTTGCGAACTGAAGGTTGTGAAAGTAGGAACAAATGAATATTTGACTGCAAAGCACATGAAGGAATTGTTCATGGAGTTTCTTAACCATCAAAGGAGGTTGCACCAAAGATTGGCCATGGAGGAGGGAAAGATCTATTGCAATGGTGCTTTGTAATTACAAGTTGTAAATCAATCATAGGTTAGTTGGATAAGGTTCTTGTTTTTGTGGAAAAAGAAATTACTTTGAGCTGTACATTTTAGATTTTACACTGGATAATGGTGTAGGATACCAACTTCCATTTTTGTATAGGCCTTCTCCTAGTTGAGTTTATGTGGGATTTTGGCTGTATTAAAGAATGAACATTTTGATGTAATACTCATGCTCTTTTTGTTCCCTCAAACTCCAAGGACTATTAGGCAC

mRNA sequence

TTTAAAATAGGGAAAACAGAGTTTCAAGTTGCAACCTTCATCGGCTCCAAGTCCCAACAAGAACAACAAAAACGCCCACAAATTTGACCTATTTTTTGTTTTTTGGTTTTGTTTTGTTTCATATTTTGGTGTAACTCCTTAATTCGCCAATATCTATAAAACAATCTGCTTCACAGACGCAGGGTTTTCATTTTCAAACGAGCTGAAGTTTTGTTCGACAATGGAGGGCCTCGCTTCCCTCAGAACAATTCCAAAGGTTTCCTTTCCACCCATTTCCTCTTCTACTCGCAGGTCCAAGTTTCTTTCCAGAAGGTTGAATTTCAGCCCTTCAATGGACACTTCTTTGAGGAATTCAATCTCTCTTGCCTCTTCTTCTTCCTCCTCTCCCATTTCAACTGCCATCCGAGCTCAACAGATAGAATCCGAGGCTGGCTCAGCTGCGGCTTCACCAGTTACTGAATCAGAAGAAAATGCTCTGAAAGTGAAGGAATGGGAAGTGGGGATGTTCCAAGATGAAGTAGCTGCAAGTCAAGGCATAAGGATTCGTCGCAGGCCGCCCACGGGACCGCCATTGCATTATATGAAAGAAAGAAGGCCTCTCGTCTCGTTGAAAAGGGACCTTGAGAGGGCCCCTCCTGCTAGAGATTTCCTTGGAGCTCTCAAGGCTGCCTACCTTAGAACTAATTTGCCTGGTTTGATTGCTGAAGTAAAAAAGGCTTCTCCCAGCAGAGGAGTTTTAAGGGAGGATTTTGATCCAGTTGAGATTGCCCAAGCGTATGAGAAAGGTGGAGCAGCATGCCTTAGTGTTCTCACAGATCAGAAGTTTTTTCAGGGAAGCTTTGAGAATCTGGAGAAGATAAGGAATGCTGGAGTGAAGTGCCCTCTCTTGTGCAAGGAGTTTGTGGTTGATGCATGGCAGATCTACTATGCCCGATCCAAAGGTGCCGATGCCATTCTTTTGATTGCTGCTGTTTTGCCTGATCTCGACATTAAATATATGACTAAAATCTGCAAGATGGTCGGTTTGACCCCCCTTGTTGAGGTTCATGACGAGAAGGAAATGGATCGTATGCTCGCAATTGATGGCATTGAGCTTATTGGCATCAACAATCGGAATCTTGAAACATTTGAGGTCGATATCAGCAACACAAAAAGGCTTCTTGAAGGAGAGCGTGGACAAAAGATCCGCAAGAAGAACGTAACAATAGTGGGAGAATCTGGGCTGTTCACTCCTGATGATATTGCTTATGTGCAAGAAGCTGGTGTTAAAGCTGTTCTGGTTGGCGAGTCGATTGTGAAACAGAGCGACCCGACGAAGGGAATAACTGGACTTTTTGAGAAACTTCCACTCGAACTCGCCTCTATTTCTCCGGCGATGTTCTCTGCCGGAGCCGCGGCCTCCGGCGAACTGGAGTTCCGGTGGGACGATGACGCGTGGTACAATGTGACCGTGAAACTCGAAGGCGATGATCTTAGGATCAGTTATTGCGAGTTCGATAAGGAACATGACAATGTCTTCAACGCCAACCATTTCCGGAGCTTATCGGAGTTGAGCGACTTCGAAGCTAGGTTTCGGCCTCTGTCCAGACAGTTGCAGGATTCCGAATGCCCTAACGTCGTCCCTGGAATGCCCGTTTGCGCTTCCCACTCCTCTCGAGCCGATGATGTTCGCTTCTACGATGCTCTTGTGGAAGGGGTGGATTATCTTGAACACTCTTATGCAAATGGAGAAGAAGAGTGCTTATGCAACTTTATCCTTTTATGGCAGCATGGTCCAAACTCTGGGAATTTGACATTTGCCAGTATTGCTAACTTGTGTCAAATTCAATTTGATGAAATTAATGACACAGTGTTAGCAACCTTCTTCGCGAAAGTTAGGGAGAAAATCCAAACCAGAATGAATAGAGGTGGTACCTGTTCTGAAGATCGCCTTCTCACCCATAATGACGGTGGTGCTCATCAGAAGGATGAATGCAGCCTAAAATTAAAGCGTCGGCTATCCTTTTTTGAACGCATGGACCAGGTGGACACACGGCGTGCCAAGCGTTCTTCTGGGGCAGTAGGACTATGGGAAGACCAACAGACTCTCAGTTCCAGAAAAAGTGGGGTGATCGAGCAAGATACTGATATTGGTGGAGTGAAGTATCAGTACATGATTTTACTTGAGAATCTAGATAAAGAATTGTCTCCAGTAAAACTTGCCAAATTCTTACATGCACAAACATTGATATTACCTCGAGTATACATTTTTCCAAGTTTAACATTTGAGGCGTATGCAAAGGGAGCTGTTGTATTGAATTGCAGAAAGAACTTAGAGAGGTTGTGCGATTTTTTGGATAATCCAGACCATGTCATTTTATCCTCCCAAGGAAGGCCCTTGGTAGTAACCGGAAGAATAGCAGGACGCGAAACTTTTGGGACATTGGCGGCAGGGGCCATGGTGCTAGACTCGGAAAATAAATTTGGTAATGAAAAAGATGGGAGGGTGCGTTGCGAACTGAAGGTTGTGAAAGTAGGAACAAATGAATATTTGACTGCAAAGCACATGAAGGAATTGTTCATGGAGTTTCTTAACCATCAAAGGAGGTTGCACCAAAGATTGGCCATGGAGGAGGGAAAGATCTATTGCAATGGTGCTTTGTAATTACAAGTTGTAAATCAATCATAGGTTAGTTGGATAAGGTTCTTGTTTTTGTGGAAAAAGAAATTACTTTGAGCTGTACATTTTAGATTTTACACTGGATAATGGTGTAGGATACCAACTTCCATTTTTGTATAGGCCTTCTCCTAGTTGAGTTTATGTGGGATTTTGGCTGTATTAAAGAATGAACATTTTGATGTAATACTCATGCTCTTTTTGTTCCCTCAAACTCCAAGGACTATTAGGCAC

Coding sequence (CDS)

ATGGAGGGCCTCGCTTCCCTCAGAACAATTCCAAAGGTTTCCTTTCCACCCATTTCCTCTTCTACTCGCAGGTCCAAGTTTCTTTCCAGAAGGTTGAATTTCAGCCCTTCAATGGACACTTCTTTGAGGAATTCAATCTCTCTTGCCTCTTCTTCTTCCTCCTCTCCCATTTCAACTGCCATCCGAGCTCAACAGATAGAATCCGAGGCTGGCTCAGCTGCGGCTTCACCAGTTACTGAATCAGAAGAAAATGCTCTGAAAGTGAAGGAATGGGAAGTGGGGATGTTCCAAGATGAAGTAGCTGCAAGTCAAGGCATAAGGATTCGTCGCAGGCCGCCCACGGGACCGCCATTGCATTATATGAAAGAAAGAAGGCCTCTCGTCTCGTTGAAAAGGGACCTTGAGAGGGCCCCTCCTGCTAGAGATTTCCTTGGAGCTCTCAAGGCTGCCTACCTTAGAACTAATTTGCCTGGTTTGATTGCTGAAGTAAAAAAGGCTTCTCCCAGCAGAGGAGTTTTAAGGGAGGATTTTGATCCAGTTGAGATTGCCCAAGCGTATGAGAAAGGTGGAGCAGCATGCCTTAGTGTTCTCACAGATCAGAAGTTTTTTCAGGGAAGCTTTGAGAATCTGGAGAAGATAAGGAATGCTGGAGTGAAGTGCCCTCTCTTGTGCAAGGAGTTTGTGGTTGATGCATGGCAGATCTACTATGCCCGATCCAAAGGTGCCGATGCCATTCTTTTGATTGCTGCTGTTTTGCCTGATCTCGACATTAAATATATGACTAAAATCTGCAAGATGGTCGGTTTGACCCCCCTTGTTGAGGTTCATGACGAGAAGGAAATGGATCGTATGCTCGCAATTGATGGCATTGAGCTTATTGGCATCAACAATCGGAATCTTGAAACATTTGAGGTCGATATCAGCAACACAAAAAGGCTTCTTGAAGGAGAGCGTGGACAAAAGATCCGCAAGAAGAACGTAACAATAGTGGGAGAATCTGGGCTGTTCACTCCTGATGATATTGCTTATGTGCAAGAAGCTGGTGTTAAAGCTGTTCTGGTTGGCGAGTCGATTGTGAAACAGAGCGACCCGACGAAGGGAATAACTGGACTTTTTGAGAAACTTCCACTCGAACTCGCCTCTATTTCTCCGGCGATGTTCTCTGCCGGAGCCGCGGCCTCCGGCGAACTGGAGTTCCGGTGGGACGATGACGCGTGGTACAATGTGACCGTGAAACTCGAAGGCGATGATCTTAGGATCAGTTATTGCGAGTTCGATAAGGAACATGACAATGTCTTCAACGCCAACCATTTCCGGAGCTTATCGGAGTTGAGCGACTTCGAAGCTAGGTTTCGGCCTCTGTCCAGACAGTTGCAGGATTCCGAATGCCCTAACGTCGTCCCTGGAATGCCCGTTTGCGCTTCCCACTCCTCTCGAGCCGATGATGTTCGCTTCTACGATGCTCTTGTGGAAGGGGTGGATTATCTTGAACACTCTTATGCAAATGGAGAAGAAGAGTGCTTATGCAACTTTATCCTTTTATGGCAGCATGGTCCAAACTCTGGGAATTTGACATTTGCCAGTATTGCTAACTTGTGTCAAATTCAATTTGATGAAATTAATGACACAGTGTTAGCAACCTTCTTCGCGAAAGTTAGGGAGAAAATCCAAACCAGAATGAATAGAGGTGGTACCTGTTCTGAAGATCGCCTTCTCACCCATAATGACGGTGGTGCTCATCAGAAGGATGAATGCAGCCTAAAATTAAAGCGTCGGCTATCCTTTTTTGAACGCATGGACCAGGTGGACACACGGCGTGCCAAGCGTTCTTCTGGGGCAGTAGGACTATGGGAAGACCAACAGACTCTCAGTTCCAGAAAAAGTGGGGTGATCGAGCAAGATACTGATATTGGTGGAGTGAAGTATCAGTACATGATTTTACTTGAGAATCTAGATAAAGAATTGTCTCCAGTAAAACTTGCCAAATTCTTACATGCACAAACATTGATATTACCTCGAGTATACATTTTTCCAAGTTTAACATTTGAGGCGTATGCAAAGGGAGCTGTTGTATTGAATTGCAGAAAGAACTTAGAGAGGTTGTGCGATTTTTTGGATAATCCAGACCATGTCATTTTATCCTCCCAAGGAAGGCCCTTGGTAGTAACCGGAAGAATAGCAGGACGCGAAACTTTTGGGACATTGGCGGCAGGGGCCATGGTGCTAGACTCGGAAAATAAATTTGGTAATGAAAAAGATGGGAGGGTGCGTTGCGAACTGAAGGTTGTGAAAGTAGGAACAAATGAATATTTGACTGCAAAGCACATGAAGGAATTGTTCATGGAGTTTCTTAACCATCAAAGGAGGTTGCACCAAAGATTGGCCATGGAGGAGGGAAAGATCTATTGCAATGGTGCTTTGTAA

Protein sequence

MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTAIRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHYMKERRPLVSLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKLPLELASISPAMFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFNANHFRSLSELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSSRKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKIYCNGAL
Homology
BLAST of CaUC02G028810 vs. NCBI nr
Match: XP_038888477.1 (uncharacterized protein LOC120078315 isoform X1 [Benincasa hispida])

HSP 1 Score: 772.3 bits (1993), Expect = 4.1e-219
Identity = 382/422 (90.52%), Postives = 397/422 (94.08%), Query Frame = 0

Query: 386 MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFNANHFRSLSELS 445
           MFSAG AASG++EF+ DDDAWYNVTVKLEGDDLRISYCEF +EHDNVFNANHFRSLSELS
Sbjct: 1   MFSAGDAASGDIEFQCDDDAWYNVTVKLEGDDLRISYCEFGEEHDNVFNANHFRSLSELS 60

Query: 446 DFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEE 505
           +FEARFRPLSRQLQDSECPNV PGMPVCAS+SS+ADDVRFYDALVEGVDYLEHSYANGEE
Sbjct: 61  NFEARFRPLSRQLQDSECPNVDPGMPVCASYSSQADDVRFYDALVEGVDYLEHSYANGEE 120

Query: 506 ECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGT 565
           ECLCNFILLWQHGPNSGNLT ASIAN+CQIQFDEINDTVLATFFAKVREKI TRMNRG T
Sbjct: 121 ECLCNFILLWQHGPNSGNLTIASIANMCQIQFDEINDTVLATFFAKVREKIHTRMNRGDT 180

Query: 566 CSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSS 625
           CSEDRLLTHN GG HQKDECSLKLKRRLSFFERMD  DTRRAKRSS  +  WEDQQ LSS
Sbjct: 181 CSEDRLLTHNGGGVHQKDECSLKLKRRLSFFERMDP-DTRRAKRSSVTLEPWEDQQALSS 240

Query: 626 RKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY 685
           RKS VIE DTDIGG+KYQYMILLENLDK LSPVK+AKFLHAQTLILPRVYIFPSLTFE+Y
Sbjct: 241 RKSEVIEADTDIGGMKYQYMILLENLDKGLSPVKVAKFLHAQTLILPRVYIFPSLTFESY 300

Query: 686 AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSE 745
           A+GAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIA  ETFGTL AGAMVLDSE
Sbjct: 301 ARGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIARHETFGTLTAGAMVLDSE 360

Query: 746 NKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKIYCNG 805
           NKFGNEKDG V CELKVVKVGT+EYLTAKHMKELFMEFL+HQRRLHQRLAMEE KIYCNG
Sbjct: 361 NKFGNEKDGMVCCELKVVKVGTDEYLTAKHMKELFMEFLSHQRRLHQRLAMEESKIYCNG 420

Query: 806 AL 808
           AL
Sbjct: 421 AL 421

BLAST of CaUC02G028810 vs. NCBI nr
Match: TYJ09753.1 (hypothetical protein E1A91_A11G161700v1 [Gossypium mustelinum])

HSP 1 Score: 722.6 bits (1864), Expect = 3.7e-204
Identity = 418/838 (49.88%), Postives = 530/838 (63.25%), Query Frame = 0

Query: 1   MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTA 60
           MEGL SL++ P+ S     S  +     +RR     SMD  LR         SS P   A
Sbjct: 1   MEGLISLKSSPRTSLSSFPSFNQPPNSFARRF----SMDLPLRR--------SSFP---A 60

Query: 61  IRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY 120
           IRAQQ          SP  E EE+ALKVKEWEVGMFQ+EVAASQGIRIRRRPPT PPLHY
Sbjct: 61  IRAQQ------PGTISPKEEDEEDALKVKEWEVGMFQNEVAASQGIRIRRRPPTSPPLHY 120

Query: 121 --------------------------------MKERRPLVSLKRDLERAPPARDFLGALK 180
                                           MKER+PL +LK+ +E AP  RDF+GALK
Sbjct: 121 VGPFEFRLQNDGNTPRNILEEIVWHKDTEVSQMKERKPLATLKKFIENAPLTRDFVGALK 180

Query: 181 AAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFE 240
           AA+ RT LPGLIAEVKKASPSRG+LREDFDPVEIA+AYEKGGAACLSVLTD+KFF+GSFE
Sbjct: 181 AAHSRTGLPGLIAEVKKASPSRGILREDFDPVEIARAYEKGGAACLSVLTDEKFFKGSFE 240

Query: 241 NLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG 300
           NLE IRNAGV+CPLLCKEFV+DAWQIYYAR KGADAILLIAAVLPDLDI+YM KICKM+G
Sbjct: 241 NLEAIRNAGVQCPLLCKEFVIDAWQIYYARIKGADAILLIAAVLPDLDIRYMVKICKMLG 300

Query: 301 LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKNVT 360
           L  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGE GQ IR+K++ 
Sbjct: 301 LAALVEVHDEREMDRVLGIDGIELIGINNRNLETFEVDISNTKKLLEGEHGQLIRQKDII 360

Query: 361 IVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKLPLELASISPAMFS 420
           +VGESGLFTPD + YVQEAGVKA    E  V+ +      +G    L             
Sbjct: 361 VVGESGLFTPDHVGYVQEAGVKA----ERKVEDTPSVMSHSGDSHSL------------- 420

Query: 421 AGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFNANHFRSLSEL 480
             A    + EFR + DDAWY+V + LEG   + LR+ Y EF    DNVF A++F+S  EL
Sbjct: 421 LTAEEGYDTEFRSYADDAWYSVQLLLEGERSEKLRVKYDEFPAASDNVFLADNFKSEDEL 480

Query: 481 SDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGE 540
            DF  RFR +S QLQD  C  VV GM VCAS S  A +V FYDA+V+ V   +HS  NG+
Sbjct: 481 HDFLGRFRKVSAQLQDPNCSKVVKGMSVCASDSFAAGEVLFYDAIVDDVLRKKHSNLNGQ 540

Query: 541 EECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG 600
           EEC C F+L W HGPN GNLT   +A++C +Q  E++  ++      ++  ++   +   
Sbjct: 541 EECECIFLLFWLHGPNVGNLTNKGVADICLLQDSELHPKLIYFMEISMQNILKALPDFVS 600

Query: 601 GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTL 660
           GT S+D +             C++  + R +    +     +          +W  Q   
Sbjct: 601 GTTSDDFV-------------CNIVARLRETNGRPLSGCLRQGKYAQLSLSEVWPPQGGN 660

Query: 661 SSRKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFE 720
              +     QDTD+GG K  Y+IL++NL+K+LS   ++KF+H QT I  +VYIFPSL +E
Sbjct: 661 CDNR-----QDTDVGGDKKLYVILVQNLEKDLSSSAVSKFIHEQTSIATQVYIFPSLPWE 720

Query: 721 AYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLD 780
            Y  G + ++C+K++E+L  FL +P+   +SS GRPLV T +++  + +       ++L 
Sbjct: 721 PYTNGVITMDCKKDVEQLFGFLQSPNQFTVSSSGRPLVATEKLSLNDHW------TLMLK 776

Query: 781 SENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI 802
           S NK  N ++G    ELKVV  GT EY  AK +++LF++F++HQ+ L+++L  EE  I
Sbjct: 781 SPNKLLNRREGEFSSELKVVCSGTEEYKKAKELRDLFLQFIDHQKTLYKKLCTEETSI 776

BLAST of CaUC02G028810 vs. NCBI nr
Match: TYJ09754.1 (hypothetical protein E1A91_A11G161700v1 [Gossypium mustelinum])

HSP 1 Score: 714.9 bits (1844), Expect = 7.7e-202
Identity = 418/847 (49.35%), Postives = 530/847 (62.57%), Query Frame = 0

Query: 1   MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTA 60
           MEGL SL++ P+ S     S  +     +RR     SMD  LR         SS P   A
Sbjct: 1   MEGLISLKSSPRTSLSSFPSFNQPPNSFARRF----SMDLPLRR--------SSFP---A 60

Query: 61  IRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY 120
           IRAQQ          SP  E EE+ALKVKEWEVGMFQ+EVAASQGIRIRRRPPT PPLHY
Sbjct: 61  IRAQQ------PGTISPKEEDEEDALKVKEWEVGMFQNEVAASQGIRIRRRPPTSPPLHY 120

Query: 121 --------------------------------MKERRPLVSLKRDLERAPPARDFLGALK 180
                                           MKER+PL +LK+ +E AP  RDF+GALK
Sbjct: 121 VGPFEFRLQNDGNTPRNILEEIVWHKDTEVSQMKERKPLATLKKFIENAPLTRDFVGALK 180

Query: 181 AAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFE 240
           AA+ RT LPGLIAEVKKASPSRG+LREDFDPVEIA+AYEKGGAACLSVLTD+KFF+GSFE
Sbjct: 181 AAHSRTGLPGLIAEVKKASPSRGILREDFDPVEIARAYEKGGAACLSVLTDEKFFKGSFE 240

Query: 241 NLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG 300
           NLE IRNAGV+CPLLCKEFV+DAWQIYYAR KGADAILLIAAVLPDLDI+YM KICKM+G
Sbjct: 241 NLEAIRNAGVQCPLLCKEFVIDAWQIYYARIKGADAILLIAAVLPDLDIRYMVKICKMLG 300

Query: 301 LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKNVT 360
           L  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGE GQ IR+K++ 
Sbjct: 301 LAALVEVHDEREMDRVLGIDGIELIGINNRNLETFEVDISNTKKLLEGEHGQLIRQKDII 360

Query: 361 IVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKLPLELASISPAMFS 420
           +VGESGLFTPD + YVQEAGVKA    E  V+ +      +G    L             
Sbjct: 361 VVGESGLFTPDHVGYVQEAGVKA----ERKVEDTPSVMSHSGDSHSL------------- 420

Query: 421 AGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFNANHFRSLSEL 480
             A    + EFR + DDAWY+V + LEG   + LR+ Y EF    DNVF A++F+S  EL
Sbjct: 421 LTAEEGYDTEFRSYADDAWYSVQLLLEGERSEKLRVKYDEFPAASDNVFLADNFKSEDEL 480

Query: 481 SDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGE 540
            DF  RFR +S QLQD  C  VV GM VCAS S  A +V FYDA+V+ V   +HS  NG+
Sbjct: 481 HDFLGRFRKVSAQLQDPNCSKVVKGMSVCASDSFAAGEVLFYDAIVDDVLRKKHSNLNGQ 540

Query: 541 EECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG 600
           EEC C F+L W HGPN GNLT   +A++C +Q  E++  ++      ++  ++   +   
Sbjct: 541 EECECIFLLFWLHGPNVGNLTNKGVADICLLQDSELHPKLIYFMEISMQNILKALPDFVS 600

Query: 601 GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTL 660
           GT S+D +             C++  + R +    +     +          +W  Q   
Sbjct: 601 GTTSDDFV-------------CNIVARLRETNGRPLSGCLRQGKYAQLSLSEVWPPQGGN 660

Query: 661 SSRKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFE 720
              +     QDTD+GG K  Y+IL++NL+K+LS   ++KF+H QT I  +VYIFPSL +E
Sbjct: 661 CDNR-----QDTDVGGDKKLYVILVQNLEKDLSSSAVSKFIHEQTSIATQVYIFPSLPWE 720

Query: 721 AYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQG---------RPLVVTGRIAGRETFGT 780
            Y  G + ++C+K++E+L  FL +P+   +SS G         RPLV T +++  + +  
Sbjct: 721 PYTNGVITMDCKKDVEQLFGFLQSPNQFTVSSSGRSSIHITACRPLVATEKLSLNDHW-- 780

Query: 781 LAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRL 802
                ++L S NK  N ++G    ELKVV  GT EY  AK +++LF++F++HQ+ L+++L
Sbjct: 781 ----TLMLKSPNKLLNRREGEFSSELKVVCSGTEEYKKAKELRDLFLQFIDHQKTLYKKL 785

BLAST of CaUC02G028810 vs. NCBI nr
Match: XP_008466441.1 (PREDICTED: uncharacterized protein LOC103503848 isoform X1 [Cucumis melo] >TYK31453.1 uncharacterized protein E5676_scaffold455G007670 [Cucumis melo var. makuwa])

HSP 1 Score: 711.4 bits (1835), Expect = 8.5e-201
Identity = 353/422 (83.65%), Postives = 381/422 (90.28%), Query Frame = 0

Query: 386 MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFNANHFRSLSELS 445
           MFSAG A+SG+LEF  DDDAWYN  VKL+G  LR+SYCEF +EHDNVF+A+HF+SLSELS
Sbjct: 1   MFSAGDASSGDLEFLSDDDAWYNANVKLQGKVLRVSYCEFSEEHDNVFDADHFQSLSELS 60

Query: 446 DFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEE 505
            FEARFRP+SRQLQDSECPNV PGMPVCAS+SSRADDVRFYDALVEGVDYLEHSYANGEE
Sbjct: 61  VFEARFRPMSRQLQDSECPNVHPGMPVCASYSSRADDVRFYDALVEGVDYLEHSYANGEE 120

Query: 506 ECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGT 565
           ECLCNFILLWQ GPNSGNLT ASIAN+CQIQFD+INDTVLATFF KVREKI+TR NRG  
Sbjct: 121 ECLCNFILLWQRGPNSGNLTIASIANMCQIQFDKINDTVLATFFRKVREKIETRTNRGNI 180

Query: 566 CSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSS 625
           CSED   THN GGA QKD+CSLKLK RLSFFERMDQ +TRRAKRSSG V  WEDQ +LSS
Sbjct: 181 CSEDHFPTHNGGGACQKDDCSLKLKHRLSFFERMDQ-ETRRAKRSSGDVEPWEDQLSLSS 240

Query: 626 RKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY 685
           RK  VIEQDTDIGG+KYQYMILLENLDK L+P+KLAKFL+ +TLILPRVYIFPSLTFE Y
Sbjct: 241 RKREVIEQDTDIGGMKYQYMILLENLDKGLAPLKLAKFLYEETLILPRVYIFPSLTFELY 300

Query: 686 AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSE 745
           A+GAVV+NCRKNL+RL DFLD+PDHVILSSQGRPLVVTG+IA  ETFGTLAAGAMVLDSE
Sbjct: 301 ARGAVVMNCRKNLKRLYDFLDSPDHVILSSQGRPLVVTGKIARHETFGTLAAGAMVLDSE 360

Query: 746 NKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKIYCNG 805
           NKFGNEKDGR  CELKVVKVGT+EYLTAKHMKELF+EFL HQR L QRLAMEE KIYCNG
Sbjct: 361 NKFGNEKDGRASCELKVVKVGTDEYLTAKHMKELFVEFLCHQRGLQQRLAMEESKIYCNG 420

Query: 806 AL 808
           AL
Sbjct: 421 AL 421

BLAST of CaUC02G028810 vs. NCBI nr
Match: OVA19762.1 (Indole-3-glycerol phosphate synthase [Macleaya cordata])

HSP 1 Score: 706.4 bits (1822), Expect = 2.7e-199
Identity = 394/785 (50.19%), Postives = 505/785 (64.33%), Query Frame = 0

Query: 1   MEGLASLRTIPKVSFPPISS--STRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPIS 60
           ME  +   T P+VS P IS+   T+RS  + R +N    + T  R SI+L          
Sbjct: 1   MESTSIRATTPRVSVPDISTLKCTQRSLMIRRSMNLGSPLKTG-RKSIAL---------- 60

Query: 61  TAIRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPL 120
             IRAQQ E + GS   SP ++  +NALK+KEWEVG FQ+E+AA+QGIRIRRRPPTGPPL
Sbjct: 61  RCIRAQQSELKDGSVTISPESD-YKNALKIKEWEVGRFQEEIAANQGIRIRRRPPTGPPL 120

Query: 121 HY--------------------------------MKERRPLVSLKRDLERAPPARDFLGA 180
           HY                                +KERRPL SLK+ L+ APP RDF+GA
Sbjct: 121 HYVGPFEFRLQNEGNTPRNILEEIVWNKDMEVSQLKERRPLSSLKKALDNAPPVRDFVGA 180

Query: 181 LKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGS 240
           L+ ++LRT LP LIAEVKKASPSRGVLREDFDPV+IAQAYEKGGAACLSVLTD+K+FQGS
Sbjct: 181 LRKSHLRTGLPALIAEVKKASPSRGVLREDFDPVKIAQAYEKGGAACLSVLTDEKYFQGS 240

Query: 241 FENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM 300
           FENLEKIR AG++CPLLCKEF++DAWQIYYAR+KGADAILLIAAVLPDLDI+YMTKICK+
Sbjct: 241 FENLEKIRGAGIECPLLCKEFIIDAWQIYYARTKGADAILLIAAVLPDLDIQYMTKICKI 300

Query: 301 VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKN 360
           +GL  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGERG+ IR+K+
Sbjct: 301 LGLAALVEVHDEREMDRVLGIDGIELIGINNRNLETFEVDISNTKQLLEGERGEIIRQKD 360

Query: 361 VTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKLPLELASISPAM 420
           + +VGESGLFTPDDIAYVQEAGVKAVLVGESIVKQ+DP KGI+GLFE       S+S   
Sbjct: 361 IIVVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQNDPGKGISGLFE------MSLSTTD 420

Query: 421 FSAGAAASGE-----LEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFNANHFRSL 480
            S G   + +     L+FR  DDAWY + + LE + L + Y  F +E++  +NA  F++L
Sbjct: 421 HSTGGNVTSDSPTPILDFRSTDDAWYTILLALENETLTVKYVGFSEEYNEEYNAGDFKNL 480

Query: 481 SELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYA 540
            ++  FE +FRP S QLQD+EC  +V GM V AS+     D++FY+A+++ + Y  H + 
Sbjct: 481 KQVEFFEEKFRPASVQLQDTECWMLVTGMTVSASYVYSEFDLKFYNAVIDSITYKAHKFD 540

Query: 541 NGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEIN-DTVLATFFAKVREKIQTRM 600
            GEEEC C+F++ W HGP  G+     I ++C++       D  LA+F    REK++   
Sbjct: 541 EGEEECRCSFVVSWLHGPMVGSKESIRIEHICKLTSGSAQIDPTLASFLKMSREKLE--- 600

Query: 601 NRGGTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQ 660
                     L +HN G    +D+CSL                              E  
Sbjct: 601 ----------LASHNSGFI-VEDDCSL------------------------------EGV 660

Query: 661 QTLSSRKSGVIEQDTDIGGVKYQ----------YMILLENLDKELSPVKLAKFLHAQTLI 720
             +  +K+G   QD D+GG              + IL++NL+K+L+P  +  F+H    I
Sbjct: 661 AVIDWKKNG---QDIDMGGKPITAENLREKDNCHFILIDNLEKDLAPRTIVNFIHKHVSI 720

Query: 721 LPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRE 736
             + Y+FPSL  E Y +G +V++ ++ L++L DFL NP H+I+SS GRP V+T       
Sbjct: 721 KSQAYVFPSLLSETYTRGIIVVDSKEKLQKLSDFLRNPAHLIMSSSGRPWVMTEEKLKDR 720

BLAST of CaUC02G028810 vs. ExPASy Swiss-Prot
Match: P49572 (Indole-3-glycerol phosphate synthase, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=IGPS PE=1 SV=2)

HSP 1 Score: 438.7 bits (1127), Expect = 1.4e-121
Identity = 240/406 (59.11%), Postives = 291/406 (71.67%), Query Frame = 0

Query: 1   MEGLASLRTIP-KVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPIST 60
           MEGL  ++ +P KV+ P +            R N S S+  S+         +  +P   
Sbjct: 1   MEGLVPVQRLPIKVASPSL-----------YRCNNSVSIRRSISGFAMDRKINFRAPSQF 60

Query: 61  AIRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLH 120
           +IRAQQ + +   A +S   E + N L++KEWEV M+Q+E+A SQGIRIRR+PP+  PL 
Sbjct: 61  SIRAQQSDLKESLAVSSSSVEDKGNVLRIKEWEVEMYQEELAISQGIRIRRKPPSKAPLG 120

Query: 121 Y---------------------------------MKERRPLVSLKRDLERAPPARDFLGA 180
           Y                                 MKE  PL  LK+ +E APP RDF+GA
Sbjct: 121 YSGPFELRLHNNDADSPRNILEEITWYKDVEVSRMKELNPLDVLKKAVEDAPPTRDFVGA 180

Query: 181 LKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGS 240
           L+ A+ RT  PGLIAEVKKASPSRG+L+E+FDPVEIAQAYEKGGAACLSVLTDQK+FQG 
Sbjct: 181 LRMAHKRTGFPGLIAEVKKASPSRGILKENFDPVEIAQAYEKGGAACLSVLTDQKYFQGG 240

Query: 241 FENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM 300
           FENLE IR+AGVKCPLLCKEFVVD WQIYYAR+KGADA+LLIAAVL DL+I ++ KICK 
Sbjct: 241 FENLEAIRSAGVKCPLLCKEFVVDPWQIYYARTKGADAVLLIAAVLADLEITFLLKICKK 300

Query: 301 VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKN 360
           + L  LVEVHDE+EM R+L I+GIEL+GINNR+LETFEVDISNTK+LLEGE G++IR+++
Sbjct: 301 LSLAALVEVHDEREMGRVLGIEGIELVGINNRSLETFEVDISNTKKLLEGEHGRQIRERD 360

Query: 361 VTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF 373
           + +VGESGLFTPDDIAYVQ AGVKAVLVGESIVKQ+DP KGI GLF
Sbjct: 361 MIVVGESGLFTPDDIAYVQAAGVKAVLVGESIVKQNDPEKGIAGLF 395

BLAST of CaUC02G028810 vs. ExPASy Swiss-Prot
Match: B0JTM2 (Indole-3-glycerol phosphate synthase OS=Microcystis aeruginosa (strain NIES-843) OX=449447 GN=trpC PE=3 SV=1)

HSP 1 Score: 299.7 bits (766), Expect = 1.0e-79
Identity = 156/255 (61.18%), Postives = 192/255 (75.29%), Query Frame = 0

Query: 121 MKERRPLVSLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPV 180
           ++ER PL+ L++ +    P  DFL ALK    +   P LIAEVKKASPS+GV+ EDFDPV
Sbjct: 46  LRERLPLLELRQKIANTAPPCDFLAALKQGKTQ---PALIAEVKKASPSKGVILEDFDPV 105

Query: 181 EIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSK 240
            IA+ YE+GGA CLSVLTD KFFQGS+ENL  +R A V  PLLCKEF++  +QIYYARSK
Sbjct: 106 AIARTYEQGGATCLSVLTDSKFFQGSYENLTLVRQA-VSLPLLCKEFILYPYQIYYARSK 165

Query: 241 GADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNL 300
           GADA+LLIAA+L D D+ Y  KI K +G+T LVEVH   E DR+LAI+GIELIGINNRNL
Sbjct: 166 GADAVLLIAAILSDQDLAYFVKIVKGLGMTALVEVHSLAEFDRVLAIEGIELIGINNRNL 225

Query: 301 ETFEVDISNTKRLLEGERGQKIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVK 360
           ETF VD+ NT++LLE  RG+++R+K + IV ESGL T  D+A V++AG  AVL+GES+VK
Sbjct: 226 ETFTVDLDNTRQLLEA-RGEQVREKGILIVSESGLHTATDLAKVKQAGANAVLIGESLVK 285

Query: 361 QSDPTKGITGLFEKL 376
             DP  GI  LFE L
Sbjct: 286 LPDPALGIQKLFENL 295

BLAST of CaUC02G028810 vs. ExPASy Swiss-Prot
Match: B7K0H0 (Indole-3-glycerol phosphate synthase OS=Rippkaea orientalis (strain PCC 8801) OX=41431 GN=trpC PE=3 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 1.1e-78
Identity = 154/281 (54.80%), Postives = 198/281 (70.46%), Query Frame = 0

Query: 106 IRIRRRPPTGPPLHY--------------MKERRPLVSLKRDLERAPPARDFLGALKAAY 165
           +R + + P G P H               ++E  PL+ L++ ++  PP +DFLGA+    
Sbjct: 17  LRYQVKVPDGEPRHILEEIVWHKEKEVDRLRESLPLLELRKQVQHLPPPQDFLGAITQGK 76

Query: 166 LRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLE 225
            +   P LIAEVKKASPS+GV+REDFDPV IAQAY KGGA+CLSVLTD KFFQGSFENL 
Sbjct: 77  TQ---PALIAEVKKASPSKGVIREDFDPVAIAQAYVKGGASCLSVLTDAKFFQGSFENLA 136

Query: 226 KIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTP 285
            +R + V  PLLCKEF++  +QIY AR+KGADA+LLIAA+L D D+ Y  KI + +G+T 
Sbjct: 137 LVRQS-VDLPLLCKEFILYPYQIYLARTKGADAVLLIAAILSDRDLSYFLKIIQTLGMTA 196

Query: 286 LVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKNVTIVG 345
           L+EVH   E+DR+LAI+G+ LIGINNRNLETFEVD+  T +LL   R  KI+   + I+ 
Sbjct: 197 LIEVHSLTELDRVLAIEGVSLIGINNRNLETFEVDLKTTSQLLAARR-DKIQALGIKIIS 256

Query: 346 ESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF 373
           ESGL TPDD+ +VQ+AG   VL+GES+VKQ DPT+ I  LF
Sbjct: 257 ESGLHTPDDLQFVQQAGSDGVLIGESLVKQPDPTQAIANLF 292

BLAST of CaUC02G028810 vs. ExPASy Swiss-Prot
Match: B1WQE4 (Indole-3-glycerol phosphate synthase OS=Crocosphaera subtropica (strain ATCC 51142 / BH68) OX=43989 GN=trpC PE=3 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 1.8e-76
Identity = 147/252 (58.33%), Postives = 189/252 (75.00%), Query Frame = 0

Query: 121 MKERRPLVSLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPV 180
           M++R  L+ L++  + APPA+DFLGA+     +   P LIAEVKKASPS+GV+REDF+PV
Sbjct: 46  MRDRLSLLDLRKQEQSAPPAKDFLGAISQGKTQ---PALIAEVKKASPSKGVIREDFEPV 105

Query: 181 EIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSK 240
            IAQAY +GGA+CLSVLTD KFFQGSF+NL  +R A V  PLLCKEF++  +QIY AR K
Sbjct: 106 AIAQAYVQGGASCLSVLTDSKFFQGSFDNLALVRQA-VDIPLLCKEFIIYPYQIYLARVK 165

Query: 241 GADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNL 300
           GADAILLIAA+L D D++Y+ KI   +G+TPLVEVH   E+DR+LAI+G+ L+GINNRNL
Sbjct: 166 GADAILLIAAILKDSDLQYLIKIIHGLGMTPLVEVHSLAELDRVLAIEGVSLVGINNRNL 225

Query: 301 ETFEVDISNTKRLLEGERGQKIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVK 360
           ETFEV +  T  L+   R  +I+++ + IV ESG+ TP  +  V EAG  AVL+GES+VK
Sbjct: 226 ETFEVSLETTTNLITA-RQDEIKERGIYIVSESGIHTPQHLQQVTEAGANAVLIGESLVK 285

Query: 361 QSDPTKGITGLF 373
           Q DPT+ I  LF
Sbjct: 286 QDDPTQAIANLF 292

BLAST of CaUC02G028810 vs. ExPASy Swiss-Prot
Match: Q55508 (Indole-3-glycerol phosphate synthase OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=trpC PE=3 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 2.7e-72
Identity = 149/297 (50.17%), Postives = 204/297 (68.69%), Query Frame = 0

Query: 106 IRIRRRPPTGP----------------PLHYMKE-----------RR---PLVSLKRDLE 165
           + IRRRPP  P                P H ++E           RR   PLV L+  ++
Sbjct: 1   MEIRRRPPNPPIKVDILQYQIKHPEAAPRHILEEIVWHKEKEVAQRRELVPLVKLQSLVK 60

Query: 166 RAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLS 225
              P  DF+GAL+ +      P LIAEVKKASPS+G++R DFDPV IA+AYE GGA CLS
Sbjct: 61  DMTPPLDFVGALRQS---PRQPALIAEVKKASPSKGIIRADFDPVAIAKAYEAGGANCLS 120

Query: 226 VLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDL 285
           VLTD+KFFQGSFENL+ +R+A V+ PLLCKEF++  +QIY ARS+GADA+LLIAA+L D 
Sbjct: 121 VLTDEKFFQGSFENLQLVRSA-VQLPLLCKEFIIYPYQIYLARSRGADAVLLIAAILSDK 180

Query: 286 DIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLE 345
           D++Y  KI + +G+  LVEVH  +EMDR+LA+DG++LIG+NNRNL+TF VD+  T+ L  
Sbjct: 181 DLRYFLKIIEGLGMAALVEVHTLEEMDRVLALDGVQLIGVNNRNLQTFTVDLQTTEDLF- 240

Query: 346 GERGQKIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF 373
            +R +++ + ++T+V ESG++   D+  +Q+AG +AVLVGES+VKQ DP + I  L+
Sbjct: 241 AQRREQLTQGDITLVSESGIYELADLQRLQQAGARAVLVGESLVKQPDPQQAIAALY 292

BLAST of CaUC02G028810 vs. ExPASy TrEMBL
Match: A0A5D2X7G9 (Indole-3-glycerol-phosphate synthase OS=Gossypium mustelinum OX=34275 GN=E1A91_A11G161700v1 PE=3 SV=1)

HSP 1 Score: 722.6 bits (1864), Expect = 1.8e-204
Identity = 418/838 (49.88%), Postives = 530/838 (63.25%), Query Frame = 0

Query: 1   MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTA 60
           MEGL SL++ P+ S     S  +     +RR     SMD  LR         SS P   A
Sbjct: 1   MEGLISLKSSPRTSLSSFPSFNQPPNSFARRF----SMDLPLRR--------SSFP---A 60

Query: 61  IRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY 120
           IRAQQ          SP  E EE+ALKVKEWEVGMFQ+EVAASQGIRIRRRPPT PPLHY
Sbjct: 61  IRAQQ------PGTISPKEEDEEDALKVKEWEVGMFQNEVAASQGIRIRRRPPTSPPLHY 120

Query: 121 --------------------------------MKERRPLVSLKRDLERAPPARDFLGALK 180
                                           MKER+PL +LK+ +E AP  RDF+GALK
Sbjct: 121 VGPFEFRLQNDGNTPRNILEEIVWHKDTEVSQMKERKPLATLKKFIENAPLTRDFVGALK 180

Query: 181 AAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFE 240
           AA+ RT LPGLIAEVKKASPSRG+LREDFDPVEIA+AYEKGGAACLSVLTD+KFF+GSFE
Sbjct: 181 AAHSRTGLPGLIAEVKKASPSRGILREDFDPVEIARAYEKGGAACLSVLTDEKFFKGSFE 240

Query: 241 NLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG 300
           NLE IRNAGV+CPLLCKEFV+DAWQIYYAR KGADAILLIAAVLPDLDI+YM KICKM+G
Sbjct: 241 NLEAIRNAGVQCPLLCKEFVIDAWQIYYARIKGADAILLIAAVLPDLDIRYMVKICKMLG 300

Query: 301 LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKNVT 360
           L  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGE GQ IR+K++ 
Sbjct: 301 LAALVEVHDEREMDRVLGIDGIELIGINNRNLETFEVDISNTKKLLEGEHGQLIRQKDII 360

Query: 361 IVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKLPLELASISPAMFS 420
           +VGESGLFTPD + YVQEAGVKA    E  V+ +      +G    L             
Sbjct: 361 VVGESGLFTPDHVGYVQEAGVKA----ERKVEDTPSVMSHSGDSHSL------------- 420

Query: 421 AGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFNANHFRSLSEL 480
             A    + EFR + DDAWY+V + LEG   + LR+ Y EF    DNVF A++F+S  EL
Sbjct: 421 LTAEEGYDTEFRSYADDAWYSVQLLLEGERSEKLRVKYDEFPAASDNVFLADNFKSEDEL 480

Query: 481 SDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGE 540
            DF  RFR +S QLQD  C  VV GM VCAS S  A +V FYDA+V+ V   +HS  NG+
Sbjct: 481 HDFLGRFRKVSAQLQDPNCSKVVKGMSVCASDSFAAGEVLFYDAIVDDVLRKKHSNLNGQ 540

Query: 541 EECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG 600
           EEC C F+L W HGPN GNLT   +A++C +Q  E++  ++      ++  ++   +   
Sbjct: 541 EECECIFLLFWLHGPNVGNLTNKGVADICLLQDSELHPKLIYFMEISMQNILKALPDFVS 600

Query: 601 GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTL 660
           GT S+D +             C++  + R +    +     +          +W  Q   
Sbjct: 601 GTTSDDFV-------------CNIVARLRETNGRPLSGCLRQGKYAQLSLSEVWPPQGGN 660

Query: 661 SSRKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFE 720
              +     QDTD+GG K  Y+IL++NL+K+LS   ++KF+H QT I  +VYIFPSL +E
Sbjct: 661 CDNR-----QDTDVGGDKKLYVILVQNLEKDLSSSAVSKFIHEQTSIATQVYIFPSLPWE 720

Query: 721 AYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLD 780
            Y  G + ++C+K++E+L  FL +P+   +SS GRPLV T +++  + +       ++L 
Sbjct: 721 PYTNGVITMDCKKDVEQLFGFLQSPNQFTVSSSGRPLVATEKLSLNDHW------TLMLK 776

Query: 781 SENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKI 802
           S NK  N ++G    ELKVV  GT EY  AK +++LF++F++HQ+ L+++L  EE  I
Sbjct: 781 SPNKLLNRREGEFSSELKVVCSGTEEYKKAKELRDLFLQFIDHQKTLYKKLCTEETSI 776

BLAST of CaUC02G028810 vs. ExPASy TrEMBL
Match: A0A5D2X7H3 (Indole-3-glycerol-phosphate synthase OS=Gossypium mustelinum OX=34275 GN=E1A91_A11G161700v1 PE=3 SV=1)

HSP 1 Score: 714.9 bits (1844), Expect = 3.7e-202
Identity = 418/847 (49.35%), Postives = 530/847 (62.57%), Query Frame = 0

Query: 1   MEGLASLRTIPKVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPISTA 60
           MEGL SL++ P+ S     S  +     +RR     SMD  LR         SS P   A
Sbjct: 1   MEGLISLKSSPRTSLSSFPSFNQPPNSFARRF----SMDLPLRR--------SSFP---A 60

Query: 61  IRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY 120
           IRAQQ          SP  E EE+ALKVKEWEVGMFQ+EVAASQGIRIRRRPPT PPLHY
Sbjct: 61  IRAQQ------PGTISPKEEDEEDALKVKEWEVGMFQNEVAASQGIRIRRRPPTSPPLHY 120

Query: 121 --------------------------------MKERRPLVSLKRDLERAPPARDFLGALK 180
                                           MKER+PL +LK+ +E AP  RDF+GALK
Sbjct: 121 VGPFEFRLQNDGNTPRNILEEIVWHKDTEVSQMKERKPLATLKKFIENAPLTRDFVGALK 180

Query: 181 AAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFE 240
           AA+ RT LPGLIAEVKKASPSRG+LREDFDPVEIA+AYEKGGAACLSVLTD+KFF+GSFE
Sbjct: 181 AAHSRTGLPGLIAEVKKASPSRGILREDFDPVEIARAYEKGGAACLSVLTDEKFFKGSFE 240

Query: 241 NLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVG 300
           NLE IRNAGV+CPLLCKEFV+DAWQIYYAR KGADAILLIAAVLPDLDI+YM KICKM+G
Sbjct: 241 NLEAIRNAGVQCPLLCKEFVIDAWQIYYARIKGADAILLIAAVLPDLDIRYMVKICKMLG 300

Query: 301 LTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKNVT 360
           L  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGE GQ IR+K++ 
Sbjct: 301 LAALVEVHDEREMDRVLGIDGIELIGINNRNLETFEVDISNTKKLLEGEHGQLIRQKDII 360

Query: 361 IVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKLPLELASISPAMFS 420
           +VGESGLFTPD + YVQEAGVKA    E  V+ +      +G    L             
Sbjct: 361 VVGESGLFTPDHVGYVQEAGVKA----ERKVEDTPSVMSHSGDSHSL------------- 420

Query: 421 AGAAASGELEFR-WDDDAWYNVTVKLEG---DDLRISYCEFDKEHDNVFNANHFRSLSEL 480
             A    + EFR + DDAWY+V + LEG   + LR+ Y EF    DNVF A++F+S  EL
Sbjct: 421 LTAEEGYDTEFRSYADDAWYSVQLLLEGERSEKLRVKYDEFPAASDNVFLADNFKSEDEL 480

Query: 481 SDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGE 540
            DF  RFR +S QLQD  C  VV GM VCAS S  A +V FYDA+V+ V   +HS  NG+
Sbjct: 481 HDFLGRFRKVSAQLQDPNCSKVVKGMSVCASDSFAAGEVLFYDAIVDDVLRKKHSNLNGQ 540

Query: 541 EECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMN-RG 600
           EEC C F+L W HGPN GNLT   +A++C +Q  E++  ++      ++  ++   +   
Sbjct: 541 EECECIFLLFWLHGPNVGNLTNKGVADICLLQDSELHPKLIYFMEISMQNILKALPDFVS 600

Query: 601 GTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTL 660
           GT S+D +             C++  + R +    +     +          +W  Q   
Sbjct: 601 GTTSDDFV-------------CNIVARLRETNGRPLSGCLRQGKYAQLSLSEVWPPQGGN 660

Query: 661 SSRKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFE 720
              +     QDTD+GG K  Y+IL++NL+K+LS   ++KF+H QT I  +VYIFPSL +E
Sbjct: 661 CDNR-----QDTDVGGDKKLYVILVQNLEKDLSSSAVSKFIHEQTSIATQVYIFPSLPWE 720

Query: 721 AYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQG---------RPLVVTGRIAGRETFGT 780
            Y  G + ++C+K++E+L  FL +P+   +SS G         RPLV T +++  + +  
Sbjct: 721 PYTNGVITMDCKKDVEQLFGFLQSPNQFTVSSSGRSSIHITACRPLVATEKLSLNDHW-- 780

Query: 781 LAAGAMVLDSENKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRL 802
                ++L S NK  N ++G    ELKVV  GT EY  AK +++LF++F++HQ+ L+++L
Sbjct: 781 ----TLMLKSPNKLLNRREGEFSSELKVVCSGTEEYKKAKELRDLFLQFIDHQKTLYKKL 785

BLAST of CaUC02G028810 vs. ExPASy TrEMBL
Match: A0A5D3E5V0 (SAWADEE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G007670 PE=4 SV=1)

HSP 1 Score: 711.4 bits (1835), Expect = 4.1e-201
Identity = 353/422 (83.65%), Postives = 381/422 (90.28%), Query Frame = 0

Query: 386 MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFNANHFRSLSELS 445
           MFSAG A+SG+LEF  DDDAWYN  VKL+G  LR+SYCEF +EHDNVF+A+HF+SLSELS
Sbjct: 1   MFSAGDASSGDLEFLSDDDAWYNANVKLQGKVLRVSYCEFSEEHDNVFDADHFQSLSELS 60

Query: 446 DFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEE 505
            FEARFRP+SRQLQDSECPNV PGMPVCAS+SSRADDVRFYDALVEGVDYLEHSYANGEE
Sbjct: 61  VFEARFRPMSRQLQDSECPNVHPGMPVCASYSSRADDVRFYDALVEGVDYLEHSYANGEE 120

Query: 506 ECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGT 565
           ECLCNFILLWQ GPNSGNLT ASIAN+CQIQFD+INDTVLATFF KVREKI+TR NRG  
Sbjct: 121 ECLCNFILLWQRGPNSGNLTIASIANMCQIQFDKINDTVLATFFRKVREKIETRTNRGNI 180

Query: 566 CSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSS 625
           CSED   THN GGA QKD+CSLKLK RLSFFERMDQ +TRRAKRSSG V  WEDQ +LSS
Sbjct: 181 CSEDHFPTHNGGGACQKDDCSLKLKHRLSFFERMDQ-ETRRAKRSSGDVEPWEDQLSLSS 240

Query: 626 RKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY 685
           RK  VIEQDTDIGG+KYQYMILLENLDK L+P+KLAKFL+ +TLILPRVYIFPSLTFE Y
Sbjct: 241 RKREVIEQDTDIGGMKYQYMILLENLDKGLAPLKLAKFLYEETLILPRVYIFPSLTFELY 300

Query: 686 AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSE 745
           A+GAVV+NCRKNL+RL DFLD+PDHVILSSQGRPLVVTG+IA  ETFGTLAAGAMVLDSE
Sbjct: 301 ARGAVVMNCRKNLKRLYDFLDSPDHVILSSQGRPLVVTGKIARHETFGTLAAGAMVLDSE 360

Query: 746 NKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKIYCNG 805
           NKFGNEKDGR  CELKVVKVGT+EYLTAKHMKELF+EFL HQR L QRLAMEE KIYCNG
Sbjct: 361 NKFGNEKDGRASCELKVVKVGTDEYLTAKHMKELFVEFLCHQRGLQQRLAMEESKIYCNG 420

Query: 806 AL 808
           AL
Sbjct: 421 AL 421

BLAST of CaUC02G028810 vs. ExPASy TrEMBL
Match: A0A1S3CR98 (uncharacterized protein LOC103503848 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103503848 PE=4 SV=1)

HSP 1 Score: 711.4 bits (1835), Expect = 4.1e-201
Identity = 353/422 (83.65%), Postives = 381/422 (90.28%), Query Frame = 0

Query: 386 MFSAGAAASGELEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFNANHFRSLSELS 445
           MFSAG A+SG+LEF  DDDAWYN  VKL+G  LR+SYCEF +EHDNVF+A+HF+SLSELS
Sbjct: 1   MFSAGDASSGDLEFLSDDDAWYNANVKLQGKVLRVSYCEFSEEHDNVFDADHFQSLSELS 60

Query: 446 DFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYANGEE 505
            FEARFRP+SRQLQDSECPNV PGMPVCAS+SSRADDVRFYDALVEGVDYLEHSYANGEE
Sbjct: 61  VFEARFRPMSRQLQDSECPNVHPGMPVCASYSSRADDVRFYDALVEGVDYLEHSYANGEE 120

Query: 506 ECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGT 565
           ECLCNFILLWQ GPNSGNLT ASIAN+CQIQFD+INDTVLATFF KVREKI+TR NRG  
Sbjct: 121 ECLCNFILLWQRGPNSGNLTIASIANMCQIQFDKINDTVLATFFRKVREKIETRTNRGNI 180

Query: 566 CSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQQTLSS 625
           CSED   THN GGA QKD+CSLKLK RLSFFERMDQ +TRRAKRSSG V  WEDQ +LSS
Sbjct: 181 CSEDHFPTHNGGGACQKDDCSLKLKHRLSFFERMDQ-ETRRAKRSSGDVEPWEDQLSLSS 240

Query: 626 RKSGVIEQDTDIGGVKYQYMILLENLDKELSPVKLAKFLHAQTLILPRVYIFPSLTFEAY 685
           RK  VIEQDTDIGG+KYQYMILLENLDK L+P+KLAKFL+ +TLILPRVYIFPSLTFE Y
Sbjct: 241 RKREVIEQDTDIGGMKYQYMILLENLDKGLAPLKLAKFLYEETLILPRVYIFPSLTFELY 300

Query: 686 AKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRETFGTLAAGAMVLDSE 745
           A+GAVV+NCRKNL+RL DFLD+PDHVILSSQGRPLVVTG+IA  ETFGTLAAGAMVLDSE
Sbjct: 301 ARGAVVMNCRKNLKRLYDFLDSPDHVILSSQGRPLVVTGKIARHETFGTLAAGAMVLDSE 360

Query: 746 NKFGNEKDGRVRCELKVVKVGTNEYLTAKHMKELFMEFLNHQRRLHQRLAMEEGKIYCNG 805
           NKFGNEKDGR  CELKVVKVGT+EYLTAKHMKELF+EFL HQR L QRLAMEE KIYCNG
Sbjct: 361 NKFGNEKDGRASCELKVVKVGTDEYLTAKHMKELFVEFLCHQRGLQQRLAMEESKIYCNG 420

Query: 806 AL 808
           AL
Sbjct: 421 AL 421

BLAST of CaUC02G028810 vs. ExPASy TrEMBL
Match: A0A200RAM3 (Indole-3-glycerol-phosphate synthase OS=Macleaya cordata OX=56857 GN=BVC80_9059g89 PE=3 SV=1)

HSP 1 Score: 706.4 bits (1822), Expect = 1.3e-199
Identity = 394/785 (50.19%), Postives = 505/785 (64.33%), Query Frame = 0

Query: 1   MEGLASLRTIPKVSFPPISS--STRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPIS 60
           ME  +   T P+VS P IS+   T+RS  + R +N    + T  R SI+L          
Sbjct: 1   MESTSIRATTPRVSVPDISTLKCTQRSLMIRRSMNLGSPLKTG-RKSIAL---------- 60

Query: 61  TAIRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPL 120
             IRAQQ E + GS   SP ++  +NALK+KEWEVG FQ+E+AA+QGIRIRRRPPTGPPL
Sbjct: 61  RCIRAQQSELKDGSVTISPESD-YKNALKIKEWEVGRFQEEIAANQGIRIRRRPPTGPPL 120

Query: 121 HY--------------------------------MKERRPLVSLKRDLERAPPARDFLGA 180
           HY                                +KERRPL SLK+ L+ APP RDF+GA
Sbjct: 121 HYVGPFEFRLQNEGNTPRNILEEIVWNKDMEVSQLKERRPLSSLKKALDNAPPVRDFVGA 180

Query: 181 LKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGS 240
           L+ ++LRT LP LIAEVKKASPSRGVLREDFDPV+IAQAYEKGGAACLSVLTD+K+FQGS
Sbjct: 181 LRKSHLRTGLPALIAEVKKASPSRGVLREDFDPVKIAQAYEKGGAACLSVLTDEKYFQGS 240

Query: 241 FENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM 300
           FENLEKIR AG++CPLLCKEF++DAWQIYYAR+KGADAILLIAAVLPDLDI+YMTKICK+
Sbjct: 241 FENLEKIRGAGIECPLLCKEFIIDAWQIYYARTKGADAILLIAAVLPDLDIQYMTKICKI 300

Query: 301 VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKN 360
           +GL  LVEVHDE+EMDR+L IDGIELIGINNRNLETFEVDISNTK+LLEGERG+ IR+K+
Sbjct: 301 LGLAALVEVHDEREMDRVLGIDGIELIGINNRNLETFEVDISNTKQLLEGERGEIIRQKD 360

Query: 361 VTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLFEKLPLELASISPAM 420
           + +VGESGLFTPDDIAYVQEAGVKAVLVGESIVKQ+DP KGI+GLFE       S+S   
Sbjct: 361 IIVVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQNDPGKGISGLFE------MSLSTTD 420

Query: 421 FSAGAAASGE-----LEFRWDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFNANHFRSL 480
            S G   + +     L+FR  DDAWY + + LE + L + Y  F +E++  +NA  F++L
Sbjct: 421 HSTGGNVTSDSPTPILDFRSTDDAWYTILLALENETLTVKYVGFSEEYNEEYNAGDFKNL 480

Query: 481 SELSDFEARFRPLSRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSYA 540
            ++  FE +FRP S QLQD+EC  +V GM V AS+     D++FY+A+++ + Y  H + 
Sbjct: 481 KQVEFFEEKFRPASVQLQDTECWMLVTGMTVSASYVYSEFDLKFYNAVIDSITYKAHKFD 540

Query: 541 NGEEECLCNFILLWQHGPNSGNLTFASIANLCQIQFDEIN-DTVLATFFAKVREKIQTRM 600
            GEEEC C+F++ W HGP  G+     I ++C++       D  LA+F    REK++   
Sbjct: 541 EGEEECRCSFVVSWLHGPMVGSKESIRIEHICKLTSGSAQIDPTLASFLKMSREKLE--- 600

Query: 601 NRGGTCSEDRLLTHNDGGAHQKDECSLKLKRRLSFFERMDQVDTRRAKRSSGAVGLWEDQ 660
                     L +HN G    +D+CSL                              E  
Sbjct: 601 ----------LASHNSGFI-VEDDCSL------------------------------EGV 660

Query: 661 QTLSSRKSGVIEQDTDIGGVKYQ----------YMILLENLDKELSPVKLAKFLHAQTLI 720
             +  +K+G   QD D+GG              + IL++NL+K+L+P  +  F+H    I
Sbjct: 661 AVIDWKKNG---QDIDMGGKPITAENLREKDNCHFILIDNLEKDLAPRTIVNFIHKHVSI 720

Query: 721 LPRVYIFPSLTFEAYAKGAVVLNCRKNLERLCDFLDNPDHVILSSQGRPLVVTGRIAGRE 736
             + Y+FPSL  E Y +G +V++ ++ L++L DFL NP H+I+SS GRP V+T       
Sbjct: 721 KSQAYVFPSLLSETYTRGIIVVDSKEKLQKLSDFLRNPAHLIMSSSGRPWVMTEEKLKDR 720

BLAST of CaUC02G028810 vs. TAIR 10
Match: AT5G48220.1 (Aldolase-type TIM barrel family protein )

HSP 1 Score: 449.1 bits (1154), Expect = 7.4e-126
Identity = 227/334 (67.96%), Postives = 269/334 (80.54%), Query Frame = 0

Query: 73  AAASPVTESEENAL--KVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLHY---------- 132
           A  S +TE  ++AL  KV E EVGM+Q+EV  SQGIRIRRRPPTGPPLHY          
Sbjct: 40  AQKSGITEGSDSALEAKVSEQEVGMYQNEVVESQGIRIRRRPPTGPPLHYVGPFEFRLQN 99

Query: 133 ----------------------MKERRPLVSLKRDLERAPPARDFLGALKAAYLRTNLPG 192
                                 MKER+PL SLK+ L+  PPA+DF+GAL++A+ RT LPG
Sbjct: 100 EGNTPRNILEEIVWHKDKEVAQMKERKPLYSLKKALDNVPPAKDFIGALRSAHQRTGLPG 159

Query: 193 LIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGV 252
           LIAEVKKASPSRG+LREDF+PVEIAQAYEKGGAACLSVLTD K+F+GS+ENL+ I  AGV
Sbjct: 160 LIAEVKKASPSRGILREDFNPVEIAQAYEKGGAACLSVLTDDKYFKGSYENLQAIMEAGV 219

Query: 253 KCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDE 312
           KCPLL KEF+V+AWQIYY RSKGADA+LLIA+VLPDLDIKYM KICK++G+  LVEVHDE
Sbjct: 220 KCPLLLKEFIVEAWQIYYGRSKGADAVLLIASVLPDLDIKYMIKICKILGMATLVEVHDE 279

Query: 313 KEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKNVTIVGESGLFTP 372
           +EMDR+LAI+G+ELIGINNRNLETFEVD+  TK+LLEGERG+ IR+K++ +VGESGLFTP
Sbjct: 280 REMDRVLAIEGVELIGINNRNLETFEVDLGITKKLLEGERGELIRQKDILVVGESGLFTP 339

BLAST of CaUC02G028810 vs. TAIR 10
Match: AT5G48220.3 (Aldolase-type TIM barrel family protein )

HSP 1 Score: 446.4 bits (1147), Expect = 4.8e-125
Identity = 230/353 (65.16%), Postives = 277/353 (78.47%), Query Frame = 0

Query: 57  ISTAIRAQQIESEAGSAA---ASPVTESEENAL--KVKEWEVGMFQDEVAASQGIRIRRR 116
           +S A + Q+   +    A   +S +TE  ++AL  KV E EVGM+Q+EV  SQGIRIRRR
Sbjct: 6   VSIAFQLQEFGIDVRLCALYESSGITEGSDSALEAKVSEQEVGMYQNEVVESQGIRIRRR 65

Query: 117 PPTGPPLHY--------------------------------MKERRPLVSLKRDLERAPP 176
           PPTGPPLHY                                MKER+PL SLK+ L+  PP
Sbjct: 66  PPTGPPLHYVGPFEFRLQNEGNTPRNILEEIVWHKDKEVAQMKERKPLYSLKKALDNVPP 125

Query: 177 ARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTD 236
           A+DF+GAL++A+ RT LPGLIAEVKKASPSRG+LREDF+PVEIAQAYEKGGAACLSVLTD
Sbjct: 126 AKDFIGALRSAHQRTGLPGLIAEVKKASPSRGILREDFNPVEIAQAYEKGGAACLSVLTD 185

Query: 237 QKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKY 296
            K+F+GS+ENL+ I  AGVKCPLL KEF+V+AWQIYY RSKGADA+LLIA+VLPDLDIKY
Sbjct: 186 DKYFKGSYENLQAIMEAGVKCPLLLKEFIVEAWQIYYGRSKGADAVLLIASVLPDLDIKY 245

Query: 297 MTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERG 356
           M KICK++G+  LVEVHDE+EMDR+LAI+G+ELIGINNRNLETFEVD+  TK+LLEGERG
Sbjct: 246 MIKICKILGMATLVEVHDEREMDRVLAIEGVELIGINNRNLETFEVDLGITKKLLEGERG 305

Query: 357 QKIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF 373
           + IR+K++ +VGESGLFTP+DIA+VQEAGVKAVLVGES++KQSDP K I+ LF
Sbjct: 306 ELIRQKDILVVGESGLFTPEDIAFVQEAGVKAVLVGESLIKQSDPGKAISTLF 358

BLAST of CaUC02G028810 vs. TAIR 10
Match: AT2G04400.1 (Aldolase-type TIM barrel family protein )

HSP 1 Score: 438.7 bits (1127), Expect = 9.9e-123
Identity = 240/406 (59.11%), Postives = 291/406 (71.67%), Query Frame = 0

Query: 1   MEGLASLRTIP-KVSFPPISSSTRRSKFLSRRLNFSPSMDTSLRNSISLASSSSSSPIST 60
           MEGL  ++ +P KV+ P +            R N S S+  S+         +  +P   
Sbjct: 1   MEGLVPVQRLPIKVASPSL-----------YRCNNSVSIRRSISGFAMDRKINFRAPSQF 60

Query: 61  AIRAQQIESEAGSAAASPVTESEENALKVKEWEVGMFQDEVAASQGIRIRRRPPTGPPLH 120
           +IRAQQ + +   A +S   E + N L++KEWEV M+Q+E+A SQGIRIRR+PP+  PL 
Sbjct: 61  SIRAQQSDLKESLAVSSSSVEDKGNVLRIKEWEVEMYQEELAISQGIRIRRKPPSKAPLG 120

Query: 121 Y---------------------------------MKERRPLVSLKRDLERAPPARDFLGA 180
           Y                                 MKE  PL  LK+ +E APP RDF+GA
Sbjct: 121 YSGPFELRLHNNDADSPRNILEEITWYKDVEVSRMKELNPLDVLKKAVEDAPPTRDFVGA 180

Query: 181 LKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEIAQAYEKGGAACLSVLTDQKFFQGS 240
           L+ A+ RT  PGLIAEVKKASPSRG+L+E+FDPVEIAQAYEKGGAACLSVLTDQK+FQG 
Sbjct: 181 LRMAHKRTGFPGLIAEVKKASPSRGILKENFDPVEIAQAYEKGGAACLSVLTDQKYFQGG 240

Query: 241 FENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGADAILLIAAVLPDLDIKYMTKICKM 300
           FENLE IR+AGVKCPLLCKEFVVD WQIYYAR+KGADA+LLIAAVL DL+I ++ KICK 
Sbjct: 241 FENLEAIRSAGVKCPLLCKEFVVDPWQIYYARTKGADAVLLIAAVLADLEITFLLKICKK 300

Query: 301 VGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLETFEVDISNTKRLLEGERGQKIRKKN 360
           + L  LVEVHDE+EM R+L I+GIEL+GINNR+LETFEVDISNTK+LLEGE G++IR+++
Sbjct: 301 LSLAALVEVHDEREMGRVLGIEGIELVGINNRSLETFEVDISNTKKLLEGEHGRQIRERD 360

Query: 361 VTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQSDPTKGITGLF 373
           + +VGESGLFTPDDIAYVQ AGVKAVLVGESIVKQ+DP KGI GLF
Sbjct: 361 MIVVGESGLFTPDDIAYVQAAGVKAVLVGESIVKQNDPEKGIAGLF 395

BLAST of CaUC02G028810 vs. TAIR 10
Match: AT5G48220.2 (Aldolase-type TIM barrel family protein )

HSP 1 Score: 434.9 bits (1117), Expect = 1.4e-121
Identity = 215/310 (69.35%), Postives = 254/310 (81.94%), Query Frame = 0

Query: 95  MFQDEVAASQGIRIRRRPPTGPPLHY--------------------------------MK 154
           M+Q+EV  SQGIRIRRRPPTGPPLHY                                MK
Sbjct: 1   MYQNEVVESQGIRIRRRPPTGPPLHYVGPFEFRLQNEGNTPRNILEEIVWHKDKEVAQMK 60

Query: 155 ERRPLVSLKRDLERAPPARDFLGALKAAYLRTNLPGLIAEVKKASPSRGVLREDFDPVEI 214
           ER+PL SLK+ L+  PPA+DF+GAL++A+ RT LPGLIAEVKKASPSRG+LREDF+PVEI
Sbjct: 61  ERKPLYSLKKALDNVPPAKDFIGALRSAHQRTGLPGLIAEVKKASPSRGILREDFNPVEI 120

Query: 215 AQAYEKGGAACLSVLTDQKFFQGSFENLEKIRNAGVKCPLLCKEFVVDAWQIYYARSKGA 274
           AQAYEKGGAACLSVLTD K+F+GS+ENL+ I  AGVKCPLL KEF+V+AWQIYY RSKGA
Sbjct: 121 AQAYEKGGAACLSVLTDDKYFKGSYENLQAIMEAGVKCPLLLKEFIVEAWQIYYGRSKGA 180

Query: 275 DAILLIAAVLPDLDIKYMTKICKMVGLTPLVEVHDEKEMDRMLAIDGIELIGINNRNLET 334
           DA+LLIA+VLPDLDIKYM KICK++G+  LVEVHDE+EMDR+LAI+G+ELIGINNRNLET
Sbjct: 181 DAVLLIASVLPDLDIKYMIKICKILGMATLVEVHDEREMDRVLAIEGVELIGINNRNLET 240

Query: 335 FEVDISNTKRLLEGERGQKIRKKNVTIVGESGLFTPDDIAYVQEAGVKAVLVGESIVKQS 373
           FEVD+  TK+LLEGERG+ IR+K++ +VGESGLFTP+DIA+VQEAGVKAVLVGES++KQS
Sbjct: 241 FEVDLGITKKLLEGERGELIRQKDILVVGESGLFTPEDIAFVQEAGVKAVLVGESLIKQS 300

BLAST of CaUC02G028810 vs. TAIR 10
Match: AT4G25330.1 (unknown protein; Has 21 Blast hits to 21 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 21; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 118.6 bits (296), Expect = 2.3e-26
Identity = 66/175 (37.71%), Postives = 96/175 (54.86%), Query Frame = 0

Query: 396 ELEFR-WDDDAWYNVTVKLEGDDLRISYCEFDKEHDNVFNANHFRSLSELSDFEARFRPL 455
           ELEFR  +D+AWY V      D L IS+  F  EHD  + A+ F++  E+ +FE RFR  
Sbjct: 35  ELEFRSAEDEAWYAVEFSDICDALWISFNGFSYEHDEFYPADDFKNSDEIQEFEERFRAC 94

Query: 456 SRQLQDSECPNVVPGMPVCASHSSRADDVRFYDALVEGVDYLEHSY-ANGEEECLCNFIL 515
           S Q+QD ECP V  G  VCA+  S  ++V+FYDA+V  V+  +H     G E C C+F L
Sbjct: 95  SEQMQDIECPKVHEGTQVCATFPSVTEEVKFYDAIVVTVERTKHERDEEGNEICGCDFKL 154

Query: 516 LWQHGPNSGNLTFASIANLCQIQFDEINDTVLATFFAKVREKIQTRMNRGGTCSE 569
            W+ GP    +T A + ++C    D   +  + +F  + R K+      G TC++
Sbjct: 155 FWKQGPWINQVTTAKVGDICLRAKDNRINPKVVSFLKEARRKL-----HGETCNQ 204

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888477.14.1e-21990.52uncharacterized protein LOC120078315 isoform X1 [Benincasa hispida][more]
TYJ09753.13.7e-20449.88hypothetical protein E1A91_A11G161700v1 [Gossypium mustelinum][more]
TYJ09754.17.7e-20249.35hypothetical protein E1A91_A11G161700v1 [Gossypium mustelinum][more]
XP_008466441.18.5e-20183.65PREDICTED: uncharacterized protein LOC103503848 isoform X1 [Cucumis melo] >TYK31... [more]
OVA19762.12.7e-19950.19Indole-3-glycerol phosphate synthase [Macleaya cordata][more]
Match NameE-valueIdentityDescription
P495721.4e-12159.11Indole-3-glycerol phosphate synthase, chloroplastic OS=Arabidopsis thaliana OX=3... [more]
B0JTM21.0e-7961.18Indole-3-glycerol phosphate synthase OS=Microcystis aeruginosa (strain NIES-843)... [more]
B7K0H01.1e-7854.80Indole-3-glycerol phosphate synthase OS=Rippkaea orientalis (strain PCC 8801) OX... [more]
B1WQE41.8e-7658.33Indole-3-glycerol phosphate synthase OS=Crocosphaera subtropica (strain ATCC 511... [more]
Q555082.7e-7250.17Indole-3-glycerol phosphate synthase OS=Synechocystis sp. (strain PCC 6803 / Kaz... [more]
Match NameE-valueIdentityDescription
A0A5D2X7G91.8e-20449.88Indole-3-glycerol-phosphate synthase OS=Gossypium mustelinum OX=34275 GN=E1A91_A... [more]
A0A5D2X7H33.7e-20249.35Indole-3-glycerol-phosphate synthase OS=Gossypium mustelinum OX=34275 GN=E1A91_A... [more]
A0A5D3E5V04.1e-20183.65SAWADEE domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3CR984.1e-20183.65uncharacterized protein LOC103503848 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A200RAM31.3e-19950.19Indole-3-glycerol-phosphate synthase OS=Macleaya cordata OX=56857 GN=BVC80_9059g... [more]
Match NameE-valueIdentityDescription
AT5G48220.17.4e-12667.96Aldolase-type TIM barrel family protein [more]
AT5G48220.34.8e-12565.16Aldolase-type TIM barrel family protein [more]
AT2G04400.19.9e-12359.11Aldolase-type TIM barrel family protein [more]
AT5G48220.21.4e-12169.35Aldolase-type TIM barrel family protein [more]
AT4G25330.12.3e-2637.71unknown protein; Has 21 Blast hits to 21 proteins in 10 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013785Aldolase-type TIM barrelGENE3D3.20.20.70Aldolase class Icoord: 117..378
e-value: 3.2E-95
score: 320.2
IPR013798Indole-3-glycerol phosphate synthasePFAMPF00218IGPScoord: 120..369
e-value: 4.1E-77
score: 258.8
IPR013798Indole-3-glycerol phosphate synthaseHAMAPMF_00134_BIGPS_Bcoord: 95..373
score: 32.871384
IPR013798Indole-3-glycerol phosphate synthaseCDDcd00331IGPScoord: 147..371
e-value: 5.42659E-104
score: 317.098
IPR032001SAWADEE domainPFAMPF16719SAWADEEcoord: 396..533
e-value: 1.9E-37
score: 128.4
NoneNo IPR availablePANTHERPTHR22854TRYPTOPHAN BIOSYNTHESIS PROTEINcoord: 121..374
NoneNo IPR availablePANTHERPTHR22854:SF18ALDOLASE-TYPE TIM BARREL FAMILY PROTEIN-RELATEDcoord: 121..374
NoneNo IPR availablePANTHERPTHR22854TRYPTOPHAN BIOSYNTHESIS PROTEINcoord: 45..121
NoneNo IPR availablePANTHERPTHR22854:SF18ALDOLASE-TYPE TIM BARREL FAMILY PROTEIN-RELATEDcoord: 45..121
IPR001468Indole-3-glycerol phosphate synthase, conserved sitePROSITEPS00614IGPScoord: 159..177
IPR011060Ribulose-phosphate binding barrelSUPERFAMILY51366Ribulose-phoshate binding barrelcoord: 119..372

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC02G028810.1CaUC02G028810.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000162 tryptophan biosynthetic process
biological_process GO:0006568 tryptophan metabolic process
molecular_function GO:0003682 chromatin binding
molecular_function GO:0004425 indole-3-glycerol-phosphate synthase activity
molecular_function GO:0004640 phosphoribosylanthranilate isomerase activity
molecular_function GO:0003824 catalytic activity