Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAACAGTCAGTTTGATGAAATGGAAAAAAGAGTACGCACATTTCTAATCCTCATCTTCTCCATCAATCTTCTTCTTCACGTCACAGCTTCCAAAGTCCAATCTCTCGTACAATCCTACACTCTTCGCTTCTTCAAGTTCAATCCATTTTCATTTTCATCTTCCTCATTCAAAATGTTATTTCCTACCGTTCATTCAATTTTGATTTTTACTTCGTCTGTGATTTCTTTGTATACGCATAGAAGCTGAGGAATCGAGTGATTTTCTTCAGTTCAAGGTCACTGGCTCCGCCTCCGTTTTCTATTTACGGTAAAGCACCCCATCTTTCGAAAGATGTTTTGTAAATTTCGTTTGGTTAATGGGTCTAGAATGCAATGGCGCAGCGGTCAACTATTTCGGTTGCAAGTGTAATTTGATGAATGTAGTTTTGTTGATGTGAGATTTTCAATTTGTTGGACACTAAAGATTGCGCACGTGTTCTGATATTTTGAAATAATGGCGAATTGGTAGAGACTTCTATTTTTCGTTTCCCTCGGCAAAATTGATGTGTAAGAAAACGAATTTGAAGTGAAGTTGAAAAACTGCGACTGGGTTAGTTATAGGAGATTGAACTTTTAAATCTCTGAAGTTCAACGGCTGGCGTAAAGAGGTACCAGCTGTTTGAAAGCTCTTCCTTGTTCCCGAGCTCTGCGCTAATTCTCCTTCCAGTTTCTTCGTTTTACCTGCATTTTATTTATGGGGTGGTTTCTTGGATTAATCATACTTTTTTAGGGTAGCTTAGCTTCGTCATTGATTCCTTGTATACATGTGTCTGCATTTAAGAGCTAGGCAGCTGTATTAGATAAAAGTGATTACAATAAAGTCATCATGTCTCACTTGCCTTGGGCTTTTACTCCCAAGATTCCCCATGTAATAGAGACTAATGGATCTATCAGCCCTCTAGTCATTGTTATCAAGATATCTTCCAACAACTTAGGCATGTTGCAACACCCAACACAAATCAATGATCGGAACCAAAATTTTCCTAGGTGTTAACCAAAGAGAAGCCACAAAGAGGTGACGTATCTTTCAAAAAAGTGTCACGTAAAACATCCTTGCAAATTTTGGGTACATGGAAGGATGTTTACTTTGATATTGCTCGTTAGAATTGATTATTCTTAGGTGAATAATGATTAATGTGGCAAAGTGCTATTTTAATTTATTTATTTTGAGCTTTGGCCTTTCAAATGAGTTTTATGGTGGAGCCGTAGAGATGAGGAAAGTTATCTTAATTCTTATTAGGTGTTCTATAAAATATTGAGCATATTTTTAGCTGAGCCTTGACATTGGGTGGATGGGGGAGGGAAGAGAGAGAGATTGGGAGAGAAAAGAGAAGATATAAACAAATTTGTACTCCAAATTTTTCTTGGATGACTCGATTCTTCATTAATTTTTCATCTTGTTGATGTACAAGAGGCTAATAGTTTCAAAGTACATTAGTGCCGCAATTTATTAAAAACATGAAGTTGGAGAACTATTTCAGTTACGAAAGGACTCCTAGTTATGTTCAATATAGATAATCCGATTAAAATTAGAACTAGGATATTGACCACATTTAAAAATATAAGATTTCTAATGCACCAATGACAGTCAGGCCCCCTTTCTTCCAAGTGTGTAGGCTCATGTTGAGTCTCATTGTTTCTGTAAGGCTTACAAACTAAGTTATCTTACCTTATATTGTACAACATGAAAATATTGCTGCCTACGATCACAATTACTTCACCCTTATATGTATTGACTGAAAGCGATGGACTTTCTACAGATGGCAGAGCATGTGACTGTCACACCATCTATCGAGCTGCAAGTTTGGAGAACTCCCTTCATAGTCAAAAGCTTTGCACCATGCAATTTCAGTTTTAAAAGAGAACAGAGGTTAACTTATTTTGCTTATTTCCTTACCTTCAATTTTATTTTATCTCAAGACGATACTTTTTACATTTCCTACATACATTTAATATTTGCCCTAATTATGTTAGCCATTTTTTGGAATGTCTAGAAGTCTTGATAAGGTTAGGAAGTGACTATTAAAGTTCAAACTTGTTTTTGTGTGCAACAGTCCTGAGACATTAGAAAGAGAAAATAAACTGAACGGTGTGTCTCAGTGGTGATTATCTATGGTTCCTTTTGCTCTCTTAGTGAAAATGTTATTGCATAGTTTCTTTTTAGGGGAGGCTCTCAAAAATGATTTCATCTTTTTGTAAGCTTTGGTTTGTCTTAAGAATAAGTATAAACTGAATGCCTTGTTCTTATTGGTTCTTTATGTGTAGAAAGTACAACTTGATGTTCTTATTTCAACTTTTGTATTCTAGAAAATCCTCTTGTGAGAGCTATAAGTTCATAAGGATCTCAACTTGGAGAAGGCGTGAGCTTAGTGGTTTTTGTGGCTCAAACTTAATTGTAAATCCTGTTCCCAGGAAGACCTTCAGAGAACATGCTTACCTAAGGTCTTTGGTAAACGTTGATGGAACAACAGCCTCTGAGGTACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTTCTAACATATATGGCTGGAGTAATACCTTTACCAAAGTCTAATCAACCTGGAAATATCATCTCTCAAACCGATTCAGCCTCAGATAACCCAACCTTTTCTGGTAGGTAAGGGTGCACAGTTTACAAGGTTGAGATATGTTGTCGCTAACTAACATAATTTTCTGTGTGCTGCGTTGTTATTTGATGTGCATATCTGCACTTTATGTCTACTCCTGTTTACTTTGAGGTTTTATATTTTATTTATATATTTTTTTATTGCAGTGGCATGAAGGCTGATGATCAAATTAATCCAAAGCATGCATTAGATGTAGTTAAAGGAAAGATTTTGGATTTTCTAGATGCTTTTGAACGTAGGAGAAGTATGGAAAATGAGGTATTTGAATTTGCAGAATGTCATGCCAAGCAACCTCTAAGCTTGAATGCAATTGCTGAAGGTCCGAGGTTAAGATTGCTTTGGGCTTCTTTTCAACTAATTGAGGAAGAGGCAGGCATCTAACTGTACTGTACTCTCTTGCGACATTATATCCTGCATGTTCTTTTTAATCTTTGGGTGGTTGTTACTTGCTAGTTGCTATGTGGTTATTATATTTGGCATTATGAGATAAATATGCAAACTGGTGTTGAAACTTGAATATTGATGAAAAGAGGAATGTATTTTTCCCTTGTCATTTTAGCCTTCTGGGCATCTTCAACCATGATAAGTTATTTTCTTTAATAGTATTTATTTAATTTTTAATAATTGCCTATTGTTACGAGTTCTAACCTTACATGGGATCGGCAGATTTACAATGAGCTGTCTGTATTTTTTATGGGAGGTTATATTATGATGATATTTGCATAAGTGCTTGCTTCTATTTCTGCTTTTTCCATCACGAGCTTCTCCATCTCTCTATTGCTAGGGAATAATTATTTACAAGAAAGTTTAGCCAAAGGGGACAAATTGGAAGCTAGAAACTTTGCCTAGTTCCATATTTCCTCTTTCCTCCTTGTCTGAGAAGATTCTGTGCATACATCGTAGCCACAATTTCCTAAGCAGGGTTTTCAGTCCATTCATCGAAAGAATTCTGTCTTGTTTTTTAAGTATATGTATGCTAAGTAGTTGAGTGACAAACACTCCTTTTTCCATACTGTTTGAGCGAAGGAAGAATAAATGATCTTGGCATTCTTCCTTTTTGTCTCACAGTACATATCAACTTGGTCTGACCATCATTTTGGGCCACCTTCGTTGTAAGATTTCCCTAGTGTTCATGGATCTAAGAAGAAAATCCCACATAATAAAGAAACTTGGTTTGCTTAGGTAACCTCTTTCTCTAGACTGCTTTCGCCACCTTGATTGGAAATTCCTCTTCTGGTTTCTGTTAAAAGACTACACGGAGATTTTTGACAGAGAAGTTACCCAAATTTAAGGAGCCTCACCCGTTTATCTGGGGTCTCTGTAATTCTTTATTCAAAGTTGCAAATAAAATTTTCAGCTTCTTTGGAACTCCCTGCTTTTCAGCTCAAATATAGGTTCCAGTTCTCAGTCTCACTCCGAGTTGGCATCCACCAAAAGCTTACTCTCCTTGATTTCATGGGATTTTTATTTACCAACTAATTTGATAATTCATTTTTAGGAATTGAAGATTCTCATTATTTAACTTCACTGTTGAAAATTTTGCATCTTTTGTTAATTTGACATCTTTTTTGCAGGTCAACAATATTTCTGATGCTACTATTCAGAACATGGACGATTTGTCTAAAATATTTTCTAAATTTATACAAAAATCCTCTCAACCTGTTTGCATGTCTTGGCTGAAAAGCGAACTGTCTATGGAAAATAATGATTCTAGTAAGGTACTTCTAGGAGCAAATTTTATTTTCTAAATGTGTTTTTTGTCATTTTCTCATAGCTTTACTAGTTAACACAATTGCCTCTTCCCGTCCTTAGGCGTTTCTTTCTTCGATGTCTGAGAAGCTTAAAGCAGAAGACAATATTTTACAAGGAATTAAAAAGTCTGGCAAGGAAGAGCTCTATGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGGTATTTTGTTTTCTTACTCTGTCTAAGTGTAGGAAGTACCCTCACATTTCACGAAAGGAATTGTGATACCTAGGCCTTTCTCTTTAGGCAAAGGATATCATAATTTTAACTTTGGTATTCTAATTCTTGCTTGATTGGTAGGCCTTCAATATTGTATCTGTGTTTAATTCGTGGCCTTTGCCATTAGAAAGCTTAGATAAAATCTTGGAAATATGATGATGGTCTTTGTTGGGTAATGTTCGAATGCACTGGTTGAATGCCTCTTTTGTGTTCTTTCATTTCTTCTCAATGAAAGCTCGGTTTCTTAAAAAAGAAAAAAAGCGCACTGGCTGCAGTTACCTGTGGAATTCAGAGTGGGCAGTTGGAAGGTTTGTGTGCTTGGGGATACAAGTTTATGAAAAATTCAAGTTTCACTCAAAGCTATGACGAAGTCAATATAGAAGGAGATATAGCAGAAAAAGAAGCATGGGATTGATGCAGCTAAATTCGGTAGAATTAGGTAACAGAAACTCTGTGAACATGAGATGGAGGGTATTCCTAAGGAAATTAAACCAGTATTGGAGTATTTCATTGGTGTTAGTGTTCTTGAAATAGCATGTGTTGCCTCCTATTAGGCATTGTGATAGATGATTGATTTTGTCTTAGGCGCTGCCCCTATAAACATTCATTCCTATTGTTCTCCACAATTCTAGAAAGAAGAGATCAAAAAGTTGGTGAGGAGATGTTGTTTGGGTTTATGCAACCAAGCTTTGGTTCTTTATCTAAGCCTTATTCTACTTATAAGAAAAAATGACGGCAGGGTCTTGTGTGGACTATTGTGCATTAGACCAATTCATATTTTTTCGAGAAGGGAAACAAATTTCATTGATAAATGAAATAAATGAAGTAAAACTCCAACATCTAAGAGGTGATTACAAGAAAGACTATAGGTCTTCTAAGAAGGAAGGTGTGATTTTCAAAATTGATTTTGAGAAGGCCTACGATCATGTGGACTGGAACTTTCTTGACAAAATCTTGGCTAAAAAGGGCTTTGGATACCGGTGGAGATCGTGGGTTTGGAGTTGTACGAGATCGGTGAGGTATTCGATTCTTGTGAATGGTAGCCCCAAAGGTAAAATCGTGGCGTCTTGAGGCCTTAGGCAAGGTGATCCCCTATCGCCCTTTCTCTTTCTCTTAGTGGTGGATGCTCTTAGTAGAATGATAGCAAAAGGAGTTGAGGGTGGAATTATTGAGGGCTTCAAGGTGGGTAGGGAAGGTGTGTCTCTGTCCCATCTGCAGTTTGCCGATGATACATTATTCTTTTGCTCTAGTAAAGAGGAGTCCTTCCTGATCTTGAACCATATGCTGGGATTCTTTGAGTTAATGTCAGGATTGAAGATCAATAGAAGTAATGCCAAGTGATAGGTTTGAATTGTGAAGATACCAAAGTCGGTAGGTGGGCCTCCTTAGTGGGTTGTGATGTGGGTGTGTTCCCGATCTCCTATTTAGGTCTTTCTCTTGGTCACAATCCTAGGAGTCTTTCTTTCTAGAGTCCTGTGGAAGACAAAGTGAGAAAGAGGCTTGCCTCTTGGAAAAGGAGTTTCTTTTCTAAAGGAGGGAGGTTAACTCTTATTCGATCTGTTTTGAGTGGGATCCCTACTTACTTCTTCTCGTTGTTTAGAGCTCCCCAGAAGGTGTGTAGGAGAATGGAGAAGCTCATGAGGGATTTCTTATGGAAGGGGGGAGAGGGAGGTAAAGGGGCCCATCTAGTGAGTTGGGAGGTTGTGGGTAAGCCGATTAGTTTGGGAGGTTTAGAATTGGGGAATCTTAAGGTTTGCAACAAAGCCCTGTTAGCTAAATGGCTCTGGCGGTTCCCCCTAGAGTCTTCTTCCTTTTGGCATAGGATCATTGTGAGCAAATACGGTCCTCATCCTAATGAGTGACTGACGAGTGGGGGTAAAGGCACTTTCAGAAATCCATGGAAAGATATAGCTCTCGAGATTCCTACCTTTTCTAGCCTTGTTCATTGTTTGGTGGGGGACGGGAAGGATACGTATTTCTGGGAAGATAGATGGTTGGGGGATAAACCCCTATGTCTATCTTTCCCTCGTTTGTTCTATTTATCCTCCATGAAAAACCGTTCTGTGGCTGATGTTTTGTTTCATTCAGGGAGCTCTCCTTCGTTTTCTTTTGGCTTCAGTCGTCCGTTGTCCAATAGGGAAACGACGAATGTTATGTCCCTTATGTCTTTGATTGAGGAGTTTGATTTCAGGGTGGGGAGGAGGGATATTCGTTGTTGGAGCCCTAGTCCTTCTGCCGGGTTCTCTTGTAGCTCCCTTTTCCGTTGGCTCTTGCTCCCTTCTCCCCAAAGTGAGTCTATTTTTTCGCGTGTGTGGAAGGTTAAAGTCCCGAAGAAGATCAGGTTTTTTATGTGGCAAGTTATCCATGGTAGGGTTAATACCTATGATCGTCTGTTGAAAAGGATGCCTTCTTTGGTTGGTCCGTTTTGCTGTATTCTGTGTCGGAAGGCGGAGGAAGATCTGGATCATATTCTGTGGAGTTGTGCCTTTGCGCGTGCTGTGTGAGATCGATTTGGTCAGGTTTTTGGGTTGCAAGGGATATCTTTTGTTGACCACAGGCAAATGATTGAGGAGTTCCTCCTCCATCCGTCATTTCGAGAGAGAGCGAGATTTTTGTGGCTCAGGGGGATGTGTGCTTTATTGTGGAATCTTTGGGGTGAAAGGAACAATAGAGTGTTTAGGGAGAGAGAGAGGGATCCTGAGGATGTGTGGTCCCTCACTCGTTATCATGTTTCTCTTTGGGCTTCGGTCTCTAAGATGTTTTGTAATTATCCGTTAAGTTGTATTTTGCTTGATTGGGGTCCCTTTCTCTAGTTGGACTCCTTTTTGTGGGCTTGTTTTTTGTATGTCCGTGTATTCTTTCATTTTTTCTCAATGAAAGTTGGTTTTTCTATAAAAAAAAAAGAAAGATCTCCAGTTTAGAAATTAGACCAATTCATATTACCTAATAAGTTTCTAATTCTGGTCATGGAGGAGTTACTGAATGAATTGTATGGAGCAACCATGTTCTTGAAGATCAATTAAGTCAAGATGCCACCAGATCTGGCTAAAGCTAGGGGATGTGCATAAAATGGTTTTCTAACCCATGAAGGTCACAGTGAGTTCTTACTCTTAGTAATGCCCTTACAGCGCGGCTTCTACCTTCCTGTCCATTATGAATGATGGGTTGTGCCCATTCCCGCAGAAAGTTGCCTTGGTTTTCTTTGATGATTTCTTGTGTATGGTGCTACCATTCAGGACATCAAGGTTCACTTAATGGTTGTGTTAGTTCTTGGTTTGGGGTTTATTGTTAGAAAGAAATCAATGTGTTTTTGAAGGTAAAAAGATAGTCGAGAGGAAAGATTTGATCTCATCAATCAAGTTTTCACATAGCAGTTATTCTTCTTCAGAAATTACTTTTCAGTTGGAAAGCTTTATTGAGTTGGAGTTTTGATTGAAATGTGTATTGGCTCTTCCATGAAGGTATTTTACTTCTCTAGAAAGAACTGTTTGGAGCCAGCTGGTTGGTCAAATAAAAATTGACAGCGAATGTAGTAGAGTAAATTTGAAACTAACGGCGTGCGAAAATTTCCTTGCATTTTTCTTTAGAATTTCTCTATCTTCTGGAAATTGTTATCAGCTTAGGACATGTTGGTTTGGTTGTTATGTCACGTGTAGTTACTAGTTTCCTAAGTTACGTGTTTATTACTTGTGTTACATGTCCCAGAAGCCCACTCTTTGGTCGGACAATCCTGAAGCGTGAAGGTGGGGAGAGTTGATCTCTTGAAGGTGCTTCTGTTTTTATACCAGTATTGGAATCAAACCTTTTATGGAGATGGATTGATGAATCCCTGTATTACTTGTTACGTGCCATCCTTGAGATTTTGGCACAGTACTTTAAGGAGATTGCTTGTTATTTTTTTTCAAATATTTAATATGGACTCAGACTAGATTCATATAACTCTTCATAACTTGTCTTATAACTGGGCTTTTTTTTAGCAGGGATTATTGCTATTATGACCATAGCCTGTACGTCAAGCATGGGATTTCAATATTAGAAGATTTGCTAATAACCTTTGCTGACGGGATTGCAAGTATGTATCTGGAATTTATTTCTGTTGACAGCACTTTCTTCGATGAAGTGGATAACATTGGCCTGGCATTGTGTACCCTATCAACACGGGCACTCCAAAGGTTGCGTAATGAGGTACTCTTTTTTTTCACTGGGGGTTCATATATAGTTATGAACCCTTATAGCTCCTTTAATTTATCTTTTCATGAAAATTTTGGGAACTCAAAGTGTTGTTTTTTCCTAATATTTTAGAGCCAAAGAGTGGAAAAAAAAATCCAGTCGATTGTCATTTTGGAAACAAAAACCACAATACTTCACTGTCTGGTGTTTGATTATTTTGTTCTTGCACAACATTTTTCTCTAGCTTTATAATTTGTCTGCAATTTCCCTGTTAACGTTTTTCTGCTTCGAAGATGTATGAAGAGACTTAATGCAATTGACCAATTTATCGAAACAAAATACTTCCCTTCATTTTTGACTTCCCCCTTTTAGTCGCAGAAGGCTAATTTATCGCTTTAAATTCCCTCGTGGAAATCTCTAGGTGATTTGGGGACAAAGAATGGGTTTAGGGTAACAATAATATGAGTGGTATTGGTTTAAGTATGAAAGAAATACATTAGTCAGATTAAAATCTGATGGTAAACAGAAATGGATGATTTCAATTCTTCAAGATTGCATAGTATAGTTTTTGCAGCAGGAATAGACTGCTCTAGATTATTTTCTGCTGCAAAATTATTTTCATATATTGGAGAAAATTTTAAGGGTGAATCTTGTTGATAGGTAGCTATGAACCAATGGTTGTATCAAAACGTCGAGGCAATTGTATCGATGTATGAAGACCGATTTGATCTATGTACACTTGGTAGTCAACAGATTGAGATACCAGGCAGTAGACAGGTCAATATTGATAATTGGTGGATGAAACATATCCTCAGAAGAACTGAAACTTTGTCCTCTCAGTTACATTATGTTGTGATACGCTCCTTCTCCATGCCTGTAAAGCGGACCAAGGAGTTGAGAGCTTTAAGGGGATGGTACGTTCTTTCAATCTAATCCATATGATTCTCTAATTTTCATGCACAATGCTCAAGTCCAATTCTGAGGTGGACAGTCACTTTTCTGTAACAGATTCTGAAATAGCTTTAACAGGAGAAATCTGTTGCATGCTTTGATCATAGTAGTCGAACTGTTTTGCTGAAGAGAAAAGAATTCCAATGATACTTTTGGTGGACCTTGATATAATATGGGTTAAATAACAAATTTAGTTCTTGAAGTTTGATAGTTGTGTTTATTTAGCCTTTGAACTCTACAAATTTTAATTTTAATTCTAATAACTTTTTGTTGTTAACATCGTTGTTTAAATATTAATGTAACGTGTTAACTGAACTAACAGCTAATAGTTAAAGTTCGTTGGAACCATTTGACTAGTTGTGTGGAAGGTGTTATATTGGGTTAGAAGGGAGTATATGATAAAGATGTTGAGTAGTGTTCTGAAGGGAGGTATTGGGATTTGGTCCCTCTTTTTGTACATGTTTAGTCAATAAAGTTTCACTTTGACTATGATCACTTTGGTTTATACCTGATGTTTTATGGATTGTGTGGGTTGAGAGGAATAATAGAATCTTTAGAGAGCTTGAGAGGACATCAATGGATGGTTGGTTTCTTGTAAGGTTTTATATTTCTCTAGGCTTAGATGGCAAAGTTTTTTTTCCCCTTCTTCTTAATTATCCTTGAAGTCTCATTTTACTTGATTGAAGCCTCTTTTTATAGCTTGACCTCCTTTTTTTGTGGGCTTTGTTTTTTTGCATGTCATTGTATTCTTTCATTTTTTTTTCTCCATAAAAGTCAGTTTATTCATAAAAAAAATCACTTTGATTCATACATATGCTTTCTCTTGTGCATAATGAGTTCCCTAATGATAAAGAGGTCAATCCATCGATGGTGTTCTCCATGTTATCTATAAGATTTATTGTCTTTGGTGACTTAAATTAACAGAATTGTAATTTCATTACTGCAGGAGGTATTACTTCAGCCTGTTGATTGAATTATCCGACATTACGATGCCATTGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCATTCTTTCTAGTTTGCCTGATTGGAAGATCTTTAGGGCTCATCTATACAGGAATTAGGCAGTCACTAAGGTGGAAGTAATTGATAATTCTGATTTCTATTTGGTTTTTTCTCCATTATTTTGGGTTCCTTCTCATTTTTGGTGTATCAACTTGAATTTTGGTAGTTTTAACAATTTTTTAGAGTTTATCCATTGCAATATGAGATAGCATAGAATGTAGAAGTACATATTTTTGCTTTCTATCATATGAATTTCACTGAAAATCCATATTGCAGATTGAACTTAT
mRNA sequence
GAAACAGTCAGTTTGATGAAATGGAAAAAAGAGTACGCACATTTCTAATCCTCATCTTCTCCATCAATCTTCTTCTTCACGTCACAGCTTCCAAAGTCCAATCTCTCAAGCTGAGGAATCGAGTGATTTTCTTCAGTTCAAGGTCACTGGCTCCGCCTCCGTTTTCTATTTACGATGGCAGAGCATGTGACTGTCACACCATCTATCGAGCTGCAAGTTTGGAGAACTCCCTTCATAGTCAAAAGCTTTGCACCATGCAATTTCAGTTTTAAAAGAGAACAGAGAAAATCCTCTTGTGAGAGCTATAAGTTCATAAGGATCTCAACTTGGAGAAGGCGTGAGCTTAGTGGTTTTTGTGGCTCAAACTTAATTGTAAATCCTGTTCCCAGGAAGACCTTCAGAGAACATGCTTACCTAAGGTCTTTGGTAAACGTTGATGGAACAACAGCCTCTGAGGTACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTTCTAACATATATGGCTGGAGTAATACCTTTACCAAAGTCTAATCAACCTGGAAATATCATCTCTCAAACCGATTCAGCCTCAGATAACCCAACCTTTTCTGGTAGTGGCATGAAGGCTGATGATCAAATTAATCCAAAGCATGCATTAGATGTAGTTAAAGGAAAGATTTTGGATTTTCTAGATGCTTTTGAACGTAGGAGAAGTATGGAAAATGAGGTATTTGAATTTGCAGAATGTCATGCCAAGCAACCTCTAAGCTTGAATGCAATTGCTGAAGGTCCGAGGTTAAGATTGCTTTGGGCTTCTTTTCAACTAATTGAGGAAGAGGTCAACAATATTTCTGATGCTACTATTCAGAACATGGACGATTTGTCTAAAATATTTTCTAAATTTATACAAAAATCCTCTCAACCTGTTTGCATGTCTTGGCTGAAAAGCGAACTGTCTATGGAAAATAATGATTCTAGTAAGGCGTTTCTTTCTTCGATGTCTGAGAAGCTTAAAGCAGAAGACAATATTTTACAAGGAATTAAAAAGTCTGGCAAGGAAGAGCTCTATGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGGGATTATTGCTATTATGACCATAGCCTGTACGTCAAGCATGGGATTTCAATATTAGAAGATTTGCTAATAACCTTTGCTGACGGGATTGCAAGTATGTATCTGGAATTTATTTCTGTTGACAGCACTTTCTTCGATGAAGTGGATAACATTGGCCTGGCATTGTGTACCCTATCAACACGGGCACTCCAAAGGTTGCGTAATGAGGTAGCTATGAACCAATGGTTGTATCAAAACGTCGAGGCAATTGTATCGATGTATGAAGACCGATTTGATCTATGTACACTTGGTAGTCAACAGATTGAGATACCAGGCAGTAGACAGGTCAATATTGATAATTGGTGGATGAAACATATCCTCAGAAGAACTGAAACTTTGTCCTCTCAGTTACATTATGTTGTGATACGCTCCTTCTCCATGCCTGTAAAGCGGACCAAGGAGTTGAGAGCTTTAAGGGGATGGAGGTATTACTTCAGCCTGTTGATTGAATTATCCGACATTACGATGCCATTGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCATTCTTTCTAGTTTGCCTGATTGGAAGATCTTTAGGGCTCATCTATACAGGAATTAGGCAGTCACTAAGGTGGAAGTAATTGATAATTCTGATTTCTATTTGGTTTTTTCTCCATTATTTTGGGTTCCTTCTCATTTTTGGTGTATCAACTTGAATTTTGGTAGTTTTAACAATTTTTTAGAGTTTATCCATTGCAATATGAGATAGCATAGAATGTAGAAGTACATATTTTTGCTTTCTATCATATGAATTTCACTGAAAATCCATATTGCAGATTGAACTTAT
Coding sequence (CDS)
ATGGCAGAGCATGTGACTGTCACACCATCTATCGAGCTGCAAGTTTGGAGAACTCCCTTCATAGTCAAAAGCTTTGCACCATGCAATTTCAGTTTTAAAAGAGAACAGAGAAAATCCTCTTGTGAGAGCTATAAGTTCATAAGGATCTCAACTTGGAGAAGGCGTGAGCTTAGTGGTTTTTGTGGCTCAAACTTAATTGTAAATCCTGTTCCCAGGAAGACCTTCAGAGAACATGCTTACCTAAGGTCTTTGGTAAACGTTGATGGAACAACAGCCTCTGAGGTACTTTTTGTTGATCAATTGCTTCTGATGACCAGTATATTTCTAACATATATGGCTGGAGTAATACCTTTACCAAAGTCTAATCAACCTGGAAATATCATCTCTCAAACCGATTCAGCCTCAGATAACCCAACCTTTTCTGGTAGTGGCATGAAGGCTGATGATCAAATTAATCCAAAGCATGCATTAGATGTAGTTAAAGGAAAGATTTTGGATTTTCTAGATGCTTTTGAACGTAGGAGAAGTATGGAAAATGAGGTATTTGAATTTGCAGAATGTCATGCCAAGCAACCTCTAAGCTTGAATGCAATTGCTGAAGGTCCGAGGTTAAGATTGCTTTGGGCTTCTTTTCAACTAATTGAGGAAGAGGTCAACAATATTTCTGATGCTACTATTCAGAACATGGACGATTTGTCTAAAATATTTTCTAAATTTATACAAAAATCCTCTCAACCTGTTTGCATGTCTTGGCTGAAAAGCGAACTGTCTATGGAAAATAATGATTCTAGTAAGGCGTTTCTTTCTTCGATGTCTGAGAAGCTTAAAGCAGAAGACAATATTTTACAAGGAATTAAAAAGTCTGGCAAGGAAGAGCTCTATGCAGAATTGATGCACTTTCTTAGTTTTGGTGCTCGCAGGGATTATTGCTATTATGACCATAGCCTGTACGTCAAGCATGGGATTTCAATATTAGAAGATTTGCTAATAACCTTTGCTGACGGGATTGCAAGTATGTATCTGGAATTTATTTCTGTTGACAGCACTTTCTTCGATGAAGTGGATAACATTGGCCTGGCATTGTGTACCCTATCAACACGGGCACTCCAAAGGTTGCGTAATGAGGTAGCTATGAACCAATGGTTGTATCAAAACGTCGAGGCAATTGTATCGATGTATGAAGACCGATTTGATCTATGTACACTTGGTAGTCAACAGATTGAGATACCAGGCAGTAGACAGGTCAATATTGATAATTGGTGGATGAAACATATCCTCAGAAGAACTGAAACTTTGTCCTCTCAGTTACATTATGTTGTGATACGCTCCTTCTCCATGCCTGTAAAGCGGACCAAGGAGTTGAGAGCTTTAAGGGGATGGAGGTATTACTTCAGCCTGTTGATTGAATTATCCGACATTACGATGCCATTGATAAGAGTAGTAATCGATAAAATCAGTAGCGGAATATCATTCTTTCTAGTTTGCCTGATTGGAAGATCTTTAGGGCTCATCTATACAGGAATTAGGCAGTCACTAAGGTGGAAGTAA
Protein sequence
MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGFCGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPKSNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENEVFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFIQKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNWWMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Homology
BLAST of Lcy05g009360 vs. ExPASy TrEMBL
Match:
A0A6J1DDE2 (uncharacterized protein LOC111019333 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111019333 PE=4 SV=1)
HSP 1 Score: 911.8 bits (2355), Expect = 1.3e-261
Identity = 464/515 (90.10%), Postives = 480/515 (93.20%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAEHV VTP I+LQ+ RTPF +KS PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF
Sbjct: 1 MAEHVAVTPCIKLQIRRTPFKMKSSTPCNFSFKIEQRKSSCENNKFIRISAWRRCQLSGF 60
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GS LIVNP PRKTFREHAYLRSLVNVDGT ASEVL VDQLLLM SIFLTYMAGVIP+PK
Sbjct: 61 GGSKLIVNPAPRKTFREHAYLRSLVNVDGTAASEVLIVDQLLLMISIFLTYMAGVIPVPK 120
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMENE
Sbjct: 121 SNQPGSIISHTSAASDNPTFSGSGMKTEDQINPKNALHVVKGKILDFLDAFERRKSMENE 180
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
VFEFAECH KQPLSLNAIAEGPRLRLLWASFQLIEEEVNNIS+ TIQNMDDLSKIFSKFI
Sbjct: 181 VFEFAECHVKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKIFSKFI 240
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSS PVC SWLK ELSME NDSSKAFLS MSEKLKAEDNILQGIKKSGKEELYAELMHF
Sbjct: 241 QKSSHPVCRSWLKKELSMEKNDSSKAFLSLMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDS+F DEVDN+GLA
Sbjct: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFSDEVDNVGLA 360
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLCTLGSQ IE+PGSRQ IDNW
Sbjct: 361 LCNLSTRALQRLRNEVVMNQWLYQNVEAIVSMYEDRFDLCTLGSQLIELPGSRQAKIDNW 420
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WM+ LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV
Sbjct: 421 WMRQFLRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 515
BLAST of Lcy05g009360 vs. ExPASy TrEMBL
Match:
A0A6J1DC25 (uncharacterized protein LOC111019333 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111019333 PE=4 SV=1)
HSP 1 Score: 911.8 bits (2355), Expect = 1.3e-261
Identity = 464/515 (90.10%), Postives = 480/515 (93.20%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAEHV VTP I+LQ+ RTPF +KS PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF
Sbjct: 6 MAEHVAVTPCIKLQIRRTPFKMKSSTPCNFSFKIEQRKSSCENNKFIRISAWRRCQLSGF 65
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GS LIVNP PRKTFREHAYLRSLVNVDGT ASEVL VDQLLLM SIFLTYMAGVIP+PK
Sbjct: 66 GGSKLIVNPAPRKTFREHAYLRSLVNVDGTAASEVLIVDQLLLMISIFLTYMAGVIPVPK 125
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMENE
Sbjct: 126 SNQPGSIISHTSAASDNPTFSGSGMKTEDQINPKNALHVVKGKILDFLDAFERRKSMENE 185
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
VFEFAECH KQPLSLNAIAEGPRLRLLWASFQLIEEEVNNIS+ TIQNMDDLSKIFSKFI
Sbjct: 186 VFEFAECHVKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKIFSKFI 245
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSS PVC SWLK ELSME NDSSKAFLS MSEKLKAEDNILQGIKKSGKEELYAELMHF
Sbjct: 246 QKSSHPVCRSWLKKELSMEKNDSSKAFLSLMSEKLKAEDNILQGIKKSGKEELYAELMHF 305
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDS+F DEVDN+GLA
Sbjct: 306 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFSDEVDNVGLA 365
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLCTLGSQ IE+PGSRQ IDNW
Sbjct: 366 LCNLSTRALQRLRNEVVMNQWLYQNVEAIVSMYEDRFDLCTLGSQLIELPGSRQAKIDNW 425
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WM+ LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV
Sbjct: 426 WMRQFLRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 485
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 486 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 520
BLAST of Lcy05g009360 vs. ExPASy TrEMBL
Match:
A0A6J1DCX7 (uncharacterized protein LOC111019333 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111019333 PE=4 SV=1)
HSP 1 Score: 911.8 bits (2355), Expect = 1.3e-261
Identity = 464/515 (90.10%), Postives = 480/515 (93.20%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAEHV VTP I+LQ+ RTPF +KS PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF
Sbjct: 16 MAEHVAVTPCIKLQIRRTPFKMKSSTPCNFSFKIEQRKSSCENNKFIRISAWRRCQLSGF 75
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GS LIVNP PRKTFREHAYLRSLVNVDGT ASEVL VDQLLLM SIFLTYMAGVIP+PK
Sbjct: 76 GGSKLIVNPAPRKTFREHAYLRSLVNVDGTAASEVLIVDQLLLMISIFLTYMAGVIPVPK 135
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMENE
Sbjct: 136 SNQPGSIISHTSAASDNPTFSGSGMKTEDQINPKNALHVVKGKILDFLDAFERRKSMENE 195
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
VFEFAECH KQPLSLNAIAEGPRLRLLWASFQLIEEEVNNIS+ TIQNMDDLSKIFSKFI
Sbjct: 196 VFEFAECHVKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKIFSKFI 255
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSS PVC SWLK ELSME NDSSKAFLS MSEKLKAEDNILQGIKKSGKEELYAELMHF
Sbjct: 256 QKSSHPVCRSWLKKELSMEKNDSSKAFLSLMSEKLKAEDNILQGIKKSGKEELYAELMHF 315
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDS+F DEVDN+GLA
Sbjct: 316 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFSDEVDNVGLA 375
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLCTLGSQ IE+PGSRQ IDNW
Sbjct: 376 LCNLSTRALQRLRNEVVMNQWLYQNVEAIVSMYEDRFDLCTLGSQLIELPGSRQAKIDNW 435
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WM+ LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV
Sbjct: 436 WMRQFLRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 495
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 496 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 530
BLAST of Lcy05g009360 vs. ExPASy TrEMBL
Match:
A0A6J1JVH5 (uncharacterized protein LOC111488215 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111488215 PE=4 SV=1)
HSP 1 Score: 907.9 bits (2345), Expect = 1.9e-260
Identity = 463/515 (89.90%), Postives = 484/515 (93.98%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAE+V V P I+LQ+ RTPF KS APC+FSFKRE+RKSSC SYKF RISTWRRR LSGF
Sbjct: 1 MAENVVVAPCIKLQIGRTPFEAKSSAPCSFSFKREERKSSCGSYKFTRISTWRRRALSGF 60
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GSNLIV+P PRK FREHA LRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIP+PK
Sbjct: 61 RGSNLIVSPAPRKIFREHACLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPVPK 120
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPGNIIS T+SASDNPTFSGSGMK DDQIN K+ALDVVKGKILDFLDAFE R+S+ENE
Sbjct: 121 SNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFEHRKSVENE 180
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
V EFAE HAKQPLSLNAI EGPRLRLLWASFQLIEEEVNN+S ATIQNMDDLS IFSKFI
Sbjct: 181 VLEFAESHAKQPLSLNAIGEGPRLRLLWASFQLIEEEVNNLSTATIQNMDDLSIIFSKFI 240
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSSQPVCMSWLK+ELSM+NNDSSKAFLS MSEKLKAEDNIL GIKKSGKEELYAELMHF
Sbjct: 241 QKSSQPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHF 300
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFG RRDYCYYD+SL+VKHGISILEDLLITFADGIASMYLEFISVDS+FFDEVDNIGLA
Sbjct: 301 LSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLA 360
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLCTL SQQIE+PGSRQ NIDNW
Sbjct: 361 LCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELPGSRQANIDNW 420
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV
Sbjct: 421 WMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 515
BLAST of Lcy05g009360 vs. ExPASy TrEMBL
Match:
A0A6J1GQ87 (uncharacterized protein LOC111456110 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111456110 PE=4 SV=1)
HSP 1 Score: 907.1 bits (2343), Expect = 3.3e-260
Identity = 462/515 (89.71%), Postives = 484/515 (93.98%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAE+V V P I+LQ+ RTPF KS APC+FSFKREQR+SSC SYKF RISTWRRR LSGF
Sbjct: 1 MAENVVVAPCIKLQIGRTPFEAKSAAPCSFSFKREQRESSCGSYKFTRISTWRRRALSGF 60
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GSNLIV+P PRK FREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIP+PK
Sbjct: 61 RGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPVPK 120
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPGNIIS T+SASDNPTFSGSGMK DDQIN K+ALDVVKGKILDFLDAFERR+S+ENE
Sbjct: 121 SNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENE 180
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
V EFAE HAKQPLSLNAI EGPRLRLLWASFQLIEEEVNN+S ATIQNMDDLS IFSKFI
Sbjct: 181 VLEFAESHAKQPLSLNAIGEGPRLRLLWASFQLIEEEVNNLSTATIQNMDDLSIIFSKFI 240
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSS PVCMSWLK+ELSM+NNDSSKAFLS MSEKLKAEDNIL GIKKSGKEELYAELMHF
Sbjct: 241 QKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHF 300
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFG RRDYCYYD+SL+VKHGISILEDLLITFADGIASMYLEFISVDS+FFDEVDNIGLA
Sbjct: 301 LSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLA 360
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLCTL SQQIE+PGSRQ NIDNW
Sbjct: 361 LCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELPGSRQANIDNW 420
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYFSLLIELSDIT P+IRV
Sbjct: 421 WMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRV 480
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 515
BLAST of Lcy05g009360 vs. NCBI nr
Match:
XP_022151393.1 (uncharacterized protein LOC111019333 isoform X2 [Momordica charantia])
HSP 1 Score: 911.8 bits (2355), Expect = 2.7e-261
Identity = 464/515 (90.10%), Postives = 480/515 (93.20%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAEHV VTP I+LQ+ RTPF +KS PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF
Sbjct: 6 MAEHVAVTPCIKLQIRRTPFKMKSSTPCNFSFKIEQRKSSCENNKFIRISAWRRCQLSGF 65
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GS LIVNP PRKTFREHAYLRSLVNVDGT ASEVL VDQLLLM SIFLTYMAGVIP+PK
Sbjct: 66 GGSKLIVNPAPRKTFREHAYLRSLVNVDGTAASEVLIVDQLLLMISIFLTYMAGVIPVPK 125
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMENE
Sbjct: 126 SNQPGSIISHTSAASDNPTFSGSGMKTEDQINPKNALHVVKGKILDFLDAFERRKSMENE 185
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
VFEFAECH KQPLSLNAIAEGPRLRLLWASFQLIEEEVNNIS+ TIQNMDDLSKIFSKFI
Sbjct: 186 VFEFAECHVKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKIFSKFI 245
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSS PVC SWLK ELSME NDSSKAFLS MSEKLKAEDNILQGIKKSGKEELYAELMHF
Sbjct: 246 QKSSHPVCRSWLKKELSMEKNDSSKAFLSLMSEKLKAEDNILQGIKKSGKEELYAELMHF 305
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDS+F DEVDN+GLA
Sbjct: 306 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFSDEVDNVGLA 365
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLCTLGSQ IE+PGSRQ IDNW
Sbjct: 366 LCNLSTRALQRLRNEVVMNQWLYQNVEAIVSMYEDRFDLCTLGSQLIELPGSRQAKIDNW 425
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WM+ LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV
Sbjct: 426 WMRQFLRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 485
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 486 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 520
BLAST of Lcy05g009360 vs. NCBI nr
Match:
XP_022151402.1 (uncharacterized protein LOC111019333 isoform X3 [Momordica charantia])
HSP 1 Score: 911.8 bits (2355), Expect = 2.7e-261
Identity = 464/515 (90.10%), Postives = 480/515 (93.20%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAEHV VTP I+LQ+ RTPF +KS PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF
Sbjct: 1 MAEHVAVTPCIKLQIRRTPFKMKSSTPCNFSFKIEQRKSSCENNKFIRISAWRRCQLSGF 60
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GS LIVNP PRKTFREHAYLRSLVNVDGT ASEVL VDQLLLM SIFLTYMAGVIP+PK
Sbjct: 61 GGSKLIVNPAPRKTFREHAYLRSLVNVDGTAASEVLIVDQLLLMISIFLTYMAGVIPVPK 120
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMENE
Sbjct: 121 SNQPGSIISHTSAASDNPTFSGSGMKTEDQINPKNALHVVKGKILDFLDAFERRKSMENE 180
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
VFEFAECH KQPLSLNAIAEGPRLRLLWASFQLIEEEVNNIS+ TIQNMDDLSKIFSKFI
Sbjct: 181 VFEFAECHVKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKIFSKFI 240
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSS PVC SWLK ELSME NDSSKAFLS MSEKLKAEDNILQGIKKSGKEELYAELMHF
Sbjct: 241 QKSSHPVCRSWLKKELSMEKNDSSKAFLSLMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDS+F DEVDN+GLA
Sbjct: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFSDEVDNVGLA 360
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLCTLGSQ IE+PGSRQ IDNW
Sbjct: 361 LCNLSTRALQRLRNEVVMNQWLYQNVEAIVSMYEDRFDLCTLGSQLIELPGSRQAKIDNW 420
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WM+ LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV
Sbjct: 421 WMRQFLRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 515
BLAST of Lcy05g009360 vs. NCBI nr
Match:
XP_022151384.1 (uncharacterized protein LOC111019333 isoform X1 [Momordica charantia])
HSP 1 Score: 911.8 bits (2355), Expect = 2.7e-261
Identity = 464/515 (90.10%), Postives = 480/515 (93.20%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAEHV VTP I+LQ+ RTPF +KS PCNFSFK EQRKSSCE+ KFIRIS WRR +LSGF
Sbjct: 16 MAEHVAVTPCIKLQIRRTPFKMKSSTPCNFSFKIEQRKSSCENNKFIRISAWRRCQLSGF 75
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GS LIVNP PRKTFREHAYLRSLVNVDGT ASEVL VDQLLLM SIFLTYMAGVIP+PK
Sbjct: 76 GGSKLIVNPAPRKTFREHAYLRSLVNVDGTAASEVLIVDQLLLMISIFLTYMAGVIPVPK 135
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPG+IIS T +ASDNPTFSGSGMK +DQINPK+AL VVKGKILDFLDAFERR+SMENE
Sbjct: 136 SNQPGSIISHTSAASDNPTFSGSGMKTEDQINPKNALHVVKGKILDFLDAFERRKSMENE 195
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
VFEFAECH KQPLSLNAIAEGPRLRLLWASFQLIEEEVNNIS+ TIQNMDDLSKIFSKFI
Sbjct: 196 VFEFAECHVKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISNTTIQNMDDLSKIFSKFI 255
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSS PVC SWLK ELSME NDSSKAFLS MSEKLKAEDNILQGIKKSGKEELYAELMHF
Sbjct: 256 QKSSHPVCRSWLKKELSMEKNDSSKAFLSLMSEKLKAEDNILQGIKKSGKEELYAELMHF 315
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDS+F DEVDN+GLA
Sbjct: 316 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSSFSDEVDNVGLA 375
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LC LSTRALQRLRNEV MNQWLYQNVEAIVSMYEDRFDLCTLGSQ IE+PGSRQ IDNW
Sbjct: 376 LCNLSTRALQRLRNEVVMNQWLYQNVEAIVSMYEDRFDLCTLGSQLIELPGSRQAKIDNW 435
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WM+ LRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV
Sbjct: 436 WMRQFLRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 495
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 496 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 530
BLAST of Lcy05g009360 vs. NCBI nr
Match:
XP_022991669.1 (uncharacterized protein LOC111488215 isoform X2 [Cucurbita maxima])
HSP 1 Score: 907.9 bits (2345), Expect = 4.0e-260
Identity = 463/515 (89.90%), Postives = 484/515 (93.98%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAE+V V P I+LQ+ RTPF KS APC+FSFKRE+RKSSC SYKF RISTWRRR LSGF
Sbjct: 1 MAENVVVAPCIKLQIGRTPFEAKSSAPCSFSFKREERKSSCGSYKFTRISTWRRRALSGF 60
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GSNLIV+P PRK FREHA LRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIP+PK
Sbjct: 61 RGSNLIVSPAPRKIFREHACLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPVPK 120
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPGNIIS T+SASDNPTFSGSGMK DDQIN K+ALDVVKGKILDFLDAFE R+S+ENE
Sbjct: 121 SNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFEHRKSVENE 180
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
V EFAE HAKQPLSLNAI EGPRLRLLWASFQLIEEEVNN+S ATIQNMDDLS IFSKFI
Sbjct: 181 VLEFAESHAKQPLSLNAIGEGPRLRLLWASFQLIEEEVNNLSTATIQNMDDLSIIFSKFI 240
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSSQPVCMSWLK+ELSM+NNDSSKAFLS MSEKLKAEDNIL GIKKSGKEELYAELMHF
Sbjct: 241 QKSSQPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHF 300
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFG RRDYCYYD+SL+VKHGISILEDLLITFADGIASMYLEFISVDS+FFDEVDNIGLA
Sbjct: 301 LSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLA 360
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLCTL SQQIE+PGSRQ NIDNW
Sbjct: 361 LCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELPGSRQANIDNW 420
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV
Sbjct: 421 WMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 515
BLAST of Lcy05g009360 vs. NCBI nr
Match:
XP_022953640.1 (uncharacterized protein LOC111456110 isoform X2 [Cucurbita moschata])
HSP 1 Score: 907.1 bits (2343), Expect = 6.8e-260
Identity = 462/515 (89.71%), Postives = 484/515 (93.98%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPCNFSFKREQRKSSCESYKFIRISTWRRRELSGF 60
MAE+V V P I+LQ+ RTPF KS APC+FSFKREQR+SSC SYKF RISTWRRR LSGF
Sbjct: 1 MAENVVVAPCIKLQIGRTPFEAKSAAPCSFSFKREQRESSCGSYKFTRISTWRRRALSGF 60
Query: 61 CGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPLPK 120
GSNLIV+P PRK FREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIP+PK
Sbjct: 61 RGSNLIVSPAPRKIFREHAYLRSLVNVDGTTASEVLFVDQLLLMTSIFLTYMAGVIPVPK 120
Query: 121 SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRRSMENE 180
SNQPGNIIS T+SASDNPTFSGSGMK DDQIN K+ALDVVKGKILDFLDAFERR+S+ENE
Sbjct: 121 SNQPGNIISNTNSASDNPTFSGSGMKTDDQINSKYALDVVKGKILDFLDAFERRKSVENE 180
Query: 181 VFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKIFSKFI 240
V EFAE HAKQPLSLNAI EGPRLRLLWASFQLIEEEVNN+S ATIQNMDDLS IFSKFI
Sbjct: 181 VLEFAESHAKQPLSLNAIGEGPRLRLLWASFQLIEEEVNNLSTATIQNMDDLSIIFSKFI 240
Query: 241 QKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYAELMHF 300
QKSS PVCMSWLK+ELSM+NNDSSKAFLS MSEKLKAEDNIL GIKKSGKEELYAELMHF
Sbjct: 241 QKSSLPVCMSWLKNELSMKNNDSSKAFLSLMSEKLKAEDNILPGIKKSGKEELYAELMHF 300
Query: 301 LSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVDNIGLA 360
LSFG RRDYCYYD+SL+VKHGISILEDLLITFADGIASMYLEFISVDS+FFDEVDNIGLA
Sbjct: 301 LSFGPRRDYCYYDYSLFVKHGISILEDLLITFADGIASMYLEFISVDSSFFDEVDNIGLA 360
Query: 361 LCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQIEIPGSRQVNIDNW 420
LCTLSTRALQRLRNEVAMNQWLYQN+EAIVSMYEDRFDLCTL SQQIE+PGSRQ NIDNW
Sbjct: 361 LCTLSTRALQRLRNEVAMNQWLYQNIEAIVSMYEDRFDLCTLSSQQIELPGSRQANIDNW 420
Query: 421 WMKHILRRTETLSSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDITMPLIRV 480
WMKHILRR ETLSS+L YVVI SF+MPVKRTKELRALRGWRYYFSLLIELSDIT P+IRV
Sbjct: 421 WMKHILRRRETLSSELRYVVIDSFAMPVKRTKELRALRGWRYYFSLLIELSDITTPMIRV 480
Query: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK
Sbjct: 481 VIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 515
BLAST of Lcy05g009360 vs. TAIR 10
Match:
AT5G48830.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 399.8 bits (1026), Expect = 3.3e-111
Identity = 245/522 (46.93%), Postives = 323/522 (61.88%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPC---NFSFKREQRKSSCESYKFIRISTWRRREL 60
M HV V+PS +Q+ R + S C N K QR S K + + L
Sbjct: 1 MVGHVVVSPSSSVQL-RMHNVHSSQTSCFSTNPRVKFSQRSCKIVSRKAYKFNVSTSLNL 60
Query: 61 SGFCGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASE-VLFVDQLLLMTSIFLTYMAGVI 120
C + + SL + DG S V DQ+LL SIFLTYMAGVI
Sbjct: 61 GSSCSQG--------DSTCKCTCFASLADFDGVAGSGWVPIGDQVLLTASIFLTYMAGVI 120
Query: 121 PLPK-SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRR 180
P+ K S + + + T SG + D + + K DVVK K+LD LDA +R
Sbjct: 121 PVQKTSTYSSGKSTIVEEIPEVGTSKSSGRETDFEGDLKSVWDVVKVKLLDSLDAIKREN 180
Query: 181 SMENEVFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKI 240
++ ++V + K PLSL AI+EGP+L LLW+ FQ +EEE N IS TI N D+
Sbjct: 181 TLGSKVLKPKPPQGKPPLSLYAISEGPQLYLLWSCFQKLEEETNKIS-GTI-NSDEWMGS 240
Query: 241 FSKFIQKSSQPVCMSWLKSELSMENNDSSKAFLSSMSEKLKAEDNILQGIKKSGKEELYA 300
F++ ++++ Q C +WLK EL +EN DS A + L +D I I+KSGKE+L+A
Sbjct: 241 FTQIVREAYQAACTAWLKEELYVENTDSDNAITPLLIRMLNEKDAIFDKIRKSGKEDLFA 300
Query: 301 ELMHFLSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDSTFFDEVD 360
E ++F FG+ YD S + HG++ILED +IT ADG+AS+YLE ISVDS F +E++
Sbjct: 301 EFLYFHKFGSPGKAFCYDLSFFRTHGVAILEDFMITLADGVASIYLELISVDSKFSNEMN 360
Query: 361 NIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQI-EIPGSRQ 420
+ GL++C+LS+RALQ+LRNEVA+ QWL+QN+EA+VSMYEDRFDL L +Q I + GS
Sbjct: 361 SGGLSICSLSSRALQKLRNEVALYQWLHQNLEAVVSMYEDRFDLYILQTQVINNLDGSDD 420
Query: 421 VNIDNWWMKHILRRTETL-SSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSLLIELSDI 480
+WW K L +T+ SS L Y +I FS+PVKRTKEL+AL GWRYYFSL +ELSDI
Sbjct: 421 TESLSWWRKFTLGKTKAAPSSPLRYSIISDFSLPVKRTKELKALSGWRYYFSLFLELSDI 480
Query: 481 TMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
MP+IRVV+DK+SS ISFFLV LIGRS+GLI+TGIRQSLRWK
Sbjct: 481 GMPIIRVVLDKVSSVISFFLVTLIGRSVGLIFTGIRQSLRWK 511
BLAST of Lcy05g009360 vs. TAIR 10
Match:
AT5G48830.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages. )
HSP 1 Score: 386.7 bits (992), Expect = 2.9e-107
Identity = 244/529 (46.12%), Postives = 323/529 (61.06%), Query Frame = 0
Query: 1 MAEHVTVTPSIELQVWRTPFIVKSFAPC---NFSFKREQRKSSCESYKFIRISTWRRREL 60
M HV V+PS +Q+ R + S C N K QR S K + + L
Sbjct: 1 MVGHVVVSPSSSVQL-RMHNVHSSQTSCFSTNPRVKFSQRSCKIVSRKAYKFNVSTSLNL 60
Query: 61 SGFCGSNLIVNPVPRKTFREHAYLRSLVNVDGTTASE-VLFVDQLLLMTSIFLTYMAGVI 120
C + + SL + DG S V DQ+LL SIFLTYMAGVI
Sbjct: 61 GSSCSQG--------DSTCKCTCFASLADFDGVAGSGWVPIGDQVLLTASIFLTYMAGVI 120
Query: 121 PLPK-SNQPGNIISQTDSASDNPTFSGSGMKADDQINPKHALDVVKGKILDFLDAFERRR 180
P+ K S + + + T SG + D + + K DVVK K+LD LDA +R
Sbjct: 121 PVQKTSTYSSGKSTIVEEIPEVGTSKSSGRETDFEGDLKSVWDVVKVKLLDSLDAIKREN 180
Query: 181 SMENEVFEFAECHAKQPLSLNAIAEGPRLRLLWASFQLIEEEVNNISDATIQNMDDLSKI 240
++ ++V + K PLSL AI+EGP+L LLW+ FQ +EEE N IS TI N D+
Sbjct: 181 TLGSKVLKPKPPQGKPPLSLYAISEGPQLYLLWSCFQKLEEETNKIS-GTI-NSDEWMGS 240
Query: 241 FSKFIQKSSQPVCMSWLKSELSMENNDSS-------KAFLSSMSEKLKAEDNILQGIKKS 300
F++ ++++ Q C +WLK EL +EN DS +A + L +D I I+KS
Sbjct: 241 FTQIVREAYQAACTAWLKEELYVENTDSDNNLARDLQAITPLLIRMLNEKDAIFDKIRKS 300
Query: 301 GKEELYAELMHFLSFGARRDYCYYDHSLYVKHGISILEDLLITFADGIASMYLEFISVDS 360
GKE+L+AE ++F FG+ YD S + HG++ILED +IT ADG+AS+YLE ISVDS
Sbjct: 301 GKEDLFAEFLYFHKFGSPGKAFCYDLSFFRTHGVAILEDFMITLADGVASIYLELISVDS 360
Query: 361 TFFDEVDNIGLALCTLSTRALQRLRNEVAMNQWLYQNVEAIVSMYEDRFDLCTLGSQQI- 420
F +E+++ GL++C+LS+RALQ+LRNEVA+ QWL+QN+EA+VSMYEDRFDL L +Q I
Sbjct: 361 KFSNEMNSGGLSICSLSSRALQKLRNEVALYQWLHQNLEAVVSMYEDRFDLYILQTQVIN 420
Query: 421 EIPGSRQVNIDNWWMKHILRRTETL-SSQLHYVVIRSFSMPVKRTKELRALRGWRYYFSL 480
+ GS +WW K L +T+ SS L Y +I FS+PVKRTKEL+AL GW YYFSL
Sbjct: 421 NLDGSDDTESLSWWRKFTLGKTKAAPSSPLRYSIISDFSLPVKRTKELKALSGW-YYFSL 480
Query: 481 LIELSDITMPLIRVVIDKISSGISFFLVCLIGRSLGLIYTGIRQSLRWK 516
+ELSDI MP+IRVV+DK+SS ISFFLV LIGRS+GLI+TGIRQSLRWK
Sbjct: 481 FLELSDIGMPIIRVVLDKVSSVISFFLVTLIGRSVGLIFTGIRQSLRWK 517
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DDE2 | 1.3e-261 | 90.10 | uncharacterized protein LOC111019333 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DC25 | 1.3e-261 | 90.10 | uncharacterized protein LOC111019333 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DCX7 | 1.3e-261 | 90.10 | uncharacterized protein LOC111019333 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1JVH5 | 1.9e-260 | 89.90 | uncharacterized protein LOC111488215 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1GQ87 | 3.3e-260 | 89.71 | uncharacterized protein LOC111456110 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
XP_022151393.1 | 2.7e-261 | 90.10 | uncharacterized protein LOC111019333 isoform X2 [Momordica charantia] | [more] |
XP_022151402.1 | 2.7e-261 | 90.10 | uncharacterized protein LOC111019333 isoform X3 [Momordica charantia] | [more] |
XP_022151384.1 | 2.7e-261 | 90.10 | uncharacterized protein LOC111019333 isoform X1 [Momordica charantia] | [more] |
XP_022991669.1 | 4.0e-260 | 89.90 | uncharacterized protein LOC111488215 isoform X2 [Cucurbita maxima] | [more] |
XP_022953640.1 | 6.8e-260 | 89.71 | uncharacterized protein LOC111456110 isoform X2 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
AT5G48830.1 | 3.3e-111 | 46.93 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT5G48830.2 | 2.9e-107 | 46.12 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |