Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATATTACATGTATTGTATGCCTTAGAAATCCGCGCCAATTTGCCAATCTTCTTCCCGCTGAAAAATGAACCTCAAATTTAGGGCACAGAAATGGCTTAATCCCGGTCTGAACTTGAATTGAACTCGGTGATTCTCAAGAATGTCTTCTCGGGGCCGAAATTTCAACTCGGACGACGCCGGCGGAAACTCGGCCATGGAGTTAGATGGTGGCCGACAGCTTCAGGAAGAAGAAGATGATGATCCGTTTCTTAAATTTGTCGATTACGCGAGGTCTGTGCTAGCATTTGAAGACGAAGAAGACTTCGACCCTAATGTTAATGGAACGGAGACCAATACGCCGGGTTGGAGTTGGATCGCCTCTCGGGTCCTCAGAACTTGTATCGCCTACTCGAGTTCTGTTACCCCTGCGATTTTGCTATCTGAGCTCTCGCAGGTACTGTGAATTCAAACTTCATTAAGGTTTCTTCTTTCGATTTTAGATAAGCTTGGAGCAATCCAATGGAAATTTTTTATTAATTGCCTGATTTTCCAGTCTAAGGTAACTGCAAAGTGAGAACTTCCTGAATTGTTGATGTTTCACAATTAGACTAAAGAGTTCACCGTAATTGATTACACGAGTTTAATGGAATCTCTTAGTTACAAGTTAAATTTGTACCAATAGGCCATCTGAGTTCCTTTTTATTTTATACACCCACTTATTACCAAGTGATCGTAGTGAGGAGAGGGTGAAACAGGAGCCCATGTTTTATTCTAAATGAGGGTTGAATACTCAGTACTCATAGCTTGCTTCAAGGCTCTTGAGGTTCATGTCTCCTTAAAACTGTTTAAACCAATAGTCCTATTGGTTTTTGAAGTACTAGGCACCATCGTGTGGGCCAACTTTGAGCGATCTAGATTGCACCCATGTTCCTTTCCAGTTCAGCATCTCATCAGCTGAGAGGCTATTTCAACTGTCAATTATGCAATTATCCAAGTGAGAAATAAGCGGTTGCCTCCCAAGCCTCATATGTTCGATTGGGAATAAGAAATCCACAGGTGATGAAGGCATTCGTTTCTTCTCTAGGAGGCAAATACATTCTGACCTTTCCTTATCTCCCCTATCAAGTGTCCCTATAGTTTGTAACTCATTAGCATAAGAAGAAAACATTTTTGCCAAAGCATAAAATTGTTTACACCGATGTCACCTGATTCACCAGGTTTCCTTTTGATGATTAAAGTCTGGTATCGAGAGATGTTTTGGGATGAAAGTGATTTAGTTGAAATAAAACATGGTGGATAATGTGGCATCTTGGGGAGAGTTAATGCTGAATGGTTGGTAAGACCAGACAAAGTTTTATCTATACCCTTGAACAACTACATCTAACCCTTATGTTAAGGGGTCAAAGGATTGCACCTGATTTAGATGGAGAAGACAAAGCACAGTAGTTGCTAGATATTGTACCACTTCAGTAAGTCTGAAAAGGACAATAACTAAATGTTTTATATAAACAATGACTGGTTCCACTTCTGATGTATCTTATTAAAAAAAAACGGTTCCACTTCTAGTTACCGACTAAGATAATATAAATTAAACTTTCACAGTTCTATTCCACAAGACAGTCATTTGGCTAGTGATTTTGTTTTGAAGGCCTGGTATGAGCAACACAGAGTTGGGGCTCCCAAGAAAGTACCTGAATGTATTAATCAGTTGAAGAAGAAAAATAGGAGAAAGAAGCTCCCAAAAACAGTTACTATTGACTCCATATATGCGAAGAATTTCCTAGCTTTAAGTAGTGTATTGGAAGCTGTCATTGTTGATGCATTTATTCTTCCAGGTACTTCTATTTCCAATTTTAAGCCGATGATTGTAATTTTAGCAATGTTCTTGATGTTCCCAATTAGTCATACATGATGCATGAACCTAGTATATTTGTAGCTTGAGGAAGGGTTGAAGAAATTGAAACATAAAGATTGTACTTGTTCAATTACTCAGTAAGATATATACCTTTTTTCAGAGAACTGATAACTTCTTTTATTTTTATATTTTAGATCTACTTAACTTCAGGTGTTGTTTCCCTAGTTCAACTGTTTAGATGTATTACGGACAAGGATCTGAATCTTTTATCTGCACGATTTTCTTAAGGTACAAATATACACATGCTTACTTTAGGGGATTTTTGGAGCTCTAATACAATCGATCTTTATCTCCATCGTAGGTAAACCATTTACTATCTGATGCTACCTTTATGATCGTTCATGCAATAAAACCATTTGGAATGAACGTGTTCTGACTGGCCATTTGTTTTCTTAATTAGACGATCCTTACAGTAGTATTACTGCACTGCACTTTTTGTGACACTAATTTGAAAAAACCAGCTGCACTGCACTGCACTTTTTGTGACGTATTACTGCATTGCACTTTCTTCGTTGCACTGCACTTTTTGTAGTTTTCTTTCACTTCCTTTTGGGAGTTTATTTCCCTTAAACTTTTTTGTATCTTTTCATTATTTCAACGAGAAGTTCTTATCTTGTTCAAAAGTGAAGAATGATAGTATACAGTGGGCCAGCTAAAAGCGAGAACACAAGAAAAAGTTTACAAAGAAGAACCCCCCCCAGCTGCTGTTGATAAATAATAAAGTGTAATTGCAAAGAAGTTTTTTAAGTATGAACCGAAAAAGAAGCGTTAACATGCACAAAAGCCCACACCTCCTCCTACAAGCTATCTAACACCTGAAAGTTATGGGTTTCTTTCCACCCAAAGATGCCATAGAGTGATGAAAAAGCTAGCTTGCCAAAGTGTATGACCTTTCTCCTCAAACAATGAATTCAAAATGACTTCCACCAGTACTGAGACATCATTACCTCTCTACGATCTGATTTGGAAAGGCCACTATCTAAAGAAGGTGAAGTTTTTCCTAGGGAGCAATCTCACAAGGCAATCAACACACATGAGAAGCTCCAGAGAAGACTGCCCTACATGGCTCTATCTCCACATTGGCGTACAATTTGCAAACAGCACTCTTAATCACAGAGTCACCTCTTCTTCTCTTGTGAATTTGCTAAGACCTTTCGGAGCTTTCTCCTCCCGACATTCAATTGGCACACAGTTCTCCCAGAAGACCCTTATCTTTTCCTCTCTCACAACCTTGTTGGACACACTTTCATAAAAGCCAAGAAGATTCCGTGGGAGAATTTTGTTAGAGCATTCTGCTGGAACATTTGGCAAGAAAGAAAAAGAAGAATCTTTCAATCAAAGGAATTCTTCTACTATCGATTTTTTTTTGAGTTCCCTACTATAGATTTTTTGAGGCTATAGTTGATCGAAGAAAATTCCTAATCTAAGTTTTTCATTTAAAGCCAAGGCTTTTTTTATTGATAACCTCTTCCAAGAATTCCTAGTAACATGATCAAAGGCTTTTTTGAAATCTATTTTAAAAGAACAACTTCCTTCTTCCTAACTCAGTATTATTCCAAAGCCCCATTGCAATGATTTATTCTTTTTATTGCCTACTATCTCTCAATATGTCACTGAATGCCTACATTTTTTTTTTTTGAAAAGCAATAGAGAAGGCCTGGATTTGAATGCTGGGGTTTTTTAGAACTAAGGATACAGTTGTACACCCTGTAGAATTGGTCTTGCTCAATTAAGATTGTGAACTCAAGGACACGGTTGCTCGATATGAATGCTGGATTTTGTATTGAATGTTGACATTTAAGCAGTGTTTATTAAAGACTCAGATGTATATGCCCAAATCTTTTTTTTTTCTTTCTTCTTCTTCTTCTTCTTCTTTTTTTATTTTATTTTATTTTTTTTTATGCCTAAACATCGAATTCCGTCTCTGTCTTTTCATCTTGATTTGAGGTTGTGTTACAACTCTATAAGCTGCATAGAACTGGCAATCTTCATGTTTCATGGAAGTACAAGTAAAACAGAATCTTTGATGGCAGATTCTATGACTTAGTGGATGGGATTCTGAAGAAAGGGAGGCAAATATTTTTAACTGGATGCTATCTTCGTGCTGCCAGCGGTGGATCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTATTAGATGAGGTTGTCAAAAATGGTTTTTCAAACTACCAGAATCAATATTACATTTATTTTGCTTATACGCAGTTAGTATCTTCTGTTTATTGTAGGAAGAGGACGATGATGTAATACTTCTAGGAGCTCAATTTTGTTCCGATTCCTTTTCTTCTGTTTCTTTTGATGCCGTCAATCAAGGGACTACATATTCATTATATGCAAGGCAAGATATAACTGTTGTTTGTTGGCCTATTTTTCTCCAGTTTTTTGCAATCTCTATCTTAACTGGTTTCTTAACCTGCAATGAGCTGAATGTAGGATTTTGGTCTTCTTTTGTTTTCCCCTCTTCCTAAAATGGTCATTGCTTAAACTCTAGAAAGGCTGAATCAACCAAATAAATCGATTTTCACGAATGATGGATTTTGGATGTCCAATGAGCTAACTATGGGTTGCCTTATCTTCAACATTTGCAGGATTGAGTCTATTGGTCCAATGGAAATTCATGAAAAGACTAATGGCGTACAGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGTTAAAGTTTCTCTTATGGGGCGAACAGGTGATGCTAGCCAATCTTTTAAGGTGAATCTTTTAAGTTGATCTTAATTATTCAAGAATACTCTTATATGATTATATCAGCATAACAATTGAATTATCATTTCAACAGTGTTGGTAGCTTGCTTGCACTTGATAGACCATATATTGCAACTGTAAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTGTGTTGTAGATTCCCACCTCCCCCACACCAAATGCACACACGCAGGCACAAACTTTTAGTTTTCTGTTTCTTGGATCCCATCAATATGTAAATATTCTACTACACAGGTATGTGTTTTAACACAGAATATAAACCAAGCTTCAAGGATGCTTAGTACGTCAAATCCTACTCAGGGTCCCCGAGTTTCTCAAGTTTCCTTGCCCTGCGATTCACATGGGACAATTGATTTTGGTAATTATCCTTTTCGGGTGAGTATAACTTCTAGATTATCATAATATGTGTGTATACATATGTTTGGATAGGAAACAGTTTTTATTGATAATATAAAATTTACAAAAGGGAGATAACCCATCCAAATGAAGTTACAAAAAACTTCTTCAATTGGATAACGGGGAGTTTAAGCTATAGTCAATAAAGGGATGTGCTTGTTTACACCAATATAAAGCTAGAAAAAGTATGTTGTCAATTAAAGCTTCATAAGATCTTTCCTTGTCTTGAAAAATCCTTGAATTCCTTTATGTCCAGATGGTCCAATATATATTTGTAGATAAATAGATAGATATTAATTTTTGGATAGCATTGGTATACCTTTTCTCTTTGAATTTATAATGTTGCAATCCCATTATTCCATGTAAAGCTCCATACACCTGTATCAATAATACCCGACTCTGGTTATATCGTAAATGGATTCGGACTGGTCCTACGTATTTAGCATCCTGTAAGTTTTTTGCAATCCGTTTGTAGTATCAAATTCATAATTTCAGGTTGTACAAGATGCAAAAAGTAAATGTCAATTTGTGCCTTGCACAGATCTGATGTTAAAATTTATTCACTTGACATATTATGAATGACAGGTTGCAATGCTACAGAATTAAAGTTCAGGTCCCCTACGGCCCTACCATCCAAGCTTTTGATTTTTCTTTGTTTGCAAAACAATTAACTGCAATTACATGAATTGTGATGCTTTAAGAAACTTCCTTCCATCTTGAAGACCGTCTCGTTAATCTTTATTGAGATGAATCTAAATTTTTCATGCTATTTGTGAAGTTTACAGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCATTTTAGATATAGTTAATGAAAGAAATACCACAGAAGCTGTTTTCTCTATGAAAATTGAAGATAACACTGGAGAAATTTCGGCAAAGTTGCACTTTGTGGGATCTTGGTATGGACATTTTCTTTGCCTAACTTCTAATTTCTGTGTTTATGAGTCTACTTTTACATTGCTCTTCAAACAACAGAAACTTCTGATCATATATACTTCCACTTATTTTCTGAGTTCCATTTTCCTTTTTCTCTTCTTTGTGCCCCCTCTTGTAAAGGTCGCTGGGAAGGGTAGGCGTTGGACATACAGTATATATAAGTGGCCTGACATGCACAATGAACAAGAATCGGTGAGCTAATTCAAACTCTTTACCCAGCTATCTAGTAAATGAATGGGAAGTCTTGGACAATTTGAAAATGCTTCTGGACAAAAAAAATAATAACTTGTTTTTGATTGAGGTTTCTCTCCTTTCAGATATTTTGGCGATTTCTTCATGGCATTTCTAATTTGATTTGAGTTTTTTTCTTGTGAAGCTTTGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACCTTAGCTGCTTGCCAGCATTGTTAACTTCATCTTGTCTACATAAACTTTCACGACTTTCTGATCTTACCAGCAACGCTCATGGTACAAAGGTTTGGTTTTTTTCTTTTTTCCTTCTCTTTCTTTCTGTCTATTATGTTATGTTTGATGTGGAACTTTTGAAAAGAGAAAGGGAAAACTAACCGGAAGAGGGAGAGGGGTCCCTTGTTGAAGAGCCTTTCCAAAAAAAAAAAAAAACTGATATTGAGGTTAATAATTCAAAATCACAAAAAAAAAAAAAAAGGCTCTCTTTAATGTTGAAGGTTTGAGTTCAACAACAATTGCTTTAATGCTGAAGGTTTCAGATTTCTTTTCATCCATCAGGATCAATGGGTGGTCTAATCGCACATCTTCATGAAACCCTAGCTCTCCATAATCTACCATGCTGAAAGGATTTTGGGAGTCATTCATCTATATCCTTTGGAGAACAAGTGAAGCAAATAAGCTCCATCCCTTTGTTGCTTAGGACCAGCGAATTCTCATTCTTATAACATAATACATAAACACCAAACCTTAGGAGAGAGCCCAATGACCAATGAGGCTTCTCTTTTTGCAACCTTTATTGGAAGCTTAGGGAAAATTTTAATTTTCGTTCTCACCTGAACATCAAAATTAGGAATTTTCTTCCCACATCTTCTTTCCATAAAACATCATCACTGCGCTTTCAAAACACAACTTTGCGGCCTTCAAATAATTTATGTATTTTTTCCACTCTCATTTCGAGTATTCTTTCCTAGAGAACACAGCAGTTATTTAAGTTGGCCCTTAATTTATTCCCTGACGATCCTGTTGATTAATCATACCCCTCCCAAGGTGGAAAATGGGAAGGGAAATTTTTCCTGTAACCAGGTAATCTACTGTTGGAGTTATCACCACCATATATACATCGAAAATAGATAAAGAAAATTATCATTGTCGATGCAGTCTTACATCGCTTGCTTGTTATTTCTTGAGAATTTTAAGCATTCCAATCATAATATCAGTACTTCAACTCATAGTGCCTGCCCAATTCAATGCAGTTGTATGATATACAATCCTATACCGAAATTTTGGTGAACCTATAAGAGCATGATTTTTACTTTTTTTAACTCTTCCAATAGTCTTTGTCATATCCCATATCTATCAGCTGTGATTTTTATTTGTTTTCCCAAAATCTCCTGGCCAGGTCTGTCAAGTTCAGCTTGACCAAGTTTCACATTGTCATGTTAGTACGAAATTTTTGCATGCAATTTGTGGTCATTTTGTCGAGGAGACACCTGGCAGGATCGAGTGCAGCTTCTGTCGTTGTGAATGCGAGTCTGAGCTTGTGCGTACATTCGACCTAAAAATCACCCTTGCAGACGATAGTGCAAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTGTGAACTACCTGAGGTAGAGACTAAACTTAAAATAATGCTTTAAAAAGCAAAAAAAGCTTATGCTTCAGAACAACCAATACAGGTAGCTTTTGTTTAAATAACAAAGAATAAGTTGCTAGTATTACCACAAGTTGTCCATAGATCAAATTATTTCGTGGATTAGATTTTACTTGATTGTCTACCGCTCCATTGCTTTGCCATGGTATTAAGTATTTTGATTTGAGCTTTTTTCTTACTGAGAGATGGAATAGAATAACCTGATTGAAGGGTAATGATCGTCCTGTAATGCATTGTATATATAAACTACTATCATGCGTGTGTGATGACCTAATCACGAGAATCATAATTTAATTTGAAGTCTTATGTTTAGTCAGATTCTTCTATCGGTGAGTGTTCTTAAAGAGGTGGTTCTAACTTTGCAGGAAGAACAAGTAATGTACCCATCTTCACTCGAGAATGAAAGTTTTGTGGTTGCAATAGTGAATTGCAGGAGGCAAACCAGCAGATGTGGAGATAATGTCTATTCTGTTCATGATCCACTTTCATGGGAGATTACTCGTGCACTAAAGTGTGATTGATGTTGCATTATCTTTCATGAAGACTTGTTCATTGATTTGACCTTTCAGAGGTTCGAATATCCTTACAACTCGAAAAATAGCCCATTCATTTTGAGTTCACCAATCGAAAGGTGAGTGAAGAGTCCATATCTCTATGAAGGTACACATTACTAGTCTCTCTATTTTGACTTTGGCAATGTTTTGGCCCCTTGGGCCACTATGTTAATAGTGATTAAGTCTTTAATATAGATGTCTTTTTAGTTACTCTCATTATCAAATATTAAACAAG
mRNA sequence
AAAATATTACATGTATTGTATGCCTTAGAAATCCGCGCCAATTTGCCAATCTTCTTCCCGCTGAAAAATGAACCTCAAATTTAGGGCACAGAAATGGCTTAATCCCGGTCTGAACTTGAATTGAACTCGGTGATTCTCAAGAATGTCTTCTCGGGGCCGAAATTTCAACTCGGACGACGCCGGCGGAAACTCGGCCATGGAGTTAGATGGTGGCCGACAGCTTCAGGAAGAAGAAGATGATGATCCGTTTCTTAAATTTGTCGATTACGCGAGGTCTGTGCTAGCATTTGAAGACGAAGAAGACTTCGACCCTAATGTTAATGGAACGGAGACCAATACGCCGGGTTGGAGTTGGATCGCCTCTCGGGTCCTCAGAACTTGTATCGCCTACTCGAGTTCTGTTACCCCTGCGATTTTGCTATCTGAGCTCTCGCAGGCCTGGTATGAGCAACACAGAGTTGGGGCTCCCAAGAAAGTACCTGAATGTATTAATCAGTTGAAGAAGAAAAATAGGAGAAAGAAGCTCCCAAAAACAGTTACTATTGACTCCATATATGCGAAGAATTTCCTAGCTTTAAGTAGTGTATTGGAAGCTGTCATTGTTGATGCATTTATTCTTCCAGGTACAAATATACACATGCTTACTTTAGGGGATTTTTGGAGCTCTAATACAATCGATCTTTATCTCCATCGTAGATTCTATGACTTAGTGGATGGGATTCTGAAGAAAGGGAGGCAAATATTTTTAACTGGATGCTATCTTCGTGCTGCCAGCGGTGGATCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTATTAGATGAGGAAGAGGACGATGATGTAATACTTCTAGGAGCTCAATTTTGTTCCGATTCCTTTTCTTCTGTTTCTTTTGATGCCGTCAATCAAGGGACTACATATTCATTATATGCAAGGATTGAGTCTATTGGTCCAATGGAAATTCATGAAAAGACTAATGGCGTACAGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGTTAAAGTTTCTCTTATGGGGCGAACAGGTGATGCTAGCCAATCTTTTAAGTGTTGGTAGCTTGCTTGCACTTGATAGACCATATATTGCAACTGTAAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTATGTGTTTTAACACAGAATATAAACCAAGCTTCAAGGATGCTTAGTACGTCAAATCCTACTCAGGGTCCCCGAGTTTCTCAAGTTTCCTTGCCCTGCGATTCACATGGGACAATTGATTTTGGTAATTATCCTTTTCGGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCATTTTAGATATAGTTAATGAAAGAAATACCACAGAAGCTGTTTTCTCTATGAAAATTGAAGATAACACTGGAGAAATTTCGGCAAAGTTGCACTTTGTGGGATCTTGGTCGCTGGGAAGGGTAGGCGTTGGACATACAGTATATATAAGTGGCCTGACATGCACAATGAACAAGAATCGCTTTGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACCTTAGCTGCTTGCCAGCATTGTTAACTTCATCTTGTCTACATAAACTTTCACGACTTTCTGATCTTACCAGCAACGCTCATGGTACAAAGGTCTGTCAAGTTCAGCTTGACCAAGTTTCACATTGTCATGTTAGTACGAAATTTTTGCATGCAATTTGTGGTCATTTTGTCGAGGAGACACCTGGCAGGATCGAGTGCAGCTTCTGTCGTTGTGAATGCGAGTCTGAGCTTGTGCGTACATTCGACCTAAAAATCACCCTTGCAGACGATAGTGCAAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTGTGAACTACCTGAGGAAGAACAAGTAATGTACCCATCTTCACTCGAGAATGAAAGTTTTGTGGTTGCAATAGTGAATTGCAGGAGGCAAACCAGCAGATGTGGAGATAATGTCTATTCTGTTCATGATCCACTTTCATGGGAGATTACTCGTGCACTAAAGTGTGATTGATGTTGCATTATCTTTCATGAAGACTTGTTCATTGATTTGACCTTTCAGAGGTTCGAATATCCTTACAACTCGAAAAATAGCCCATTCATTTTGAGTTCACCAATCGAAAGGTGAGTGAAGAGTCCATATCTCTATGAAGGTACACATTACTAGTCTCTCTATTTTGACTTTGGCAATGTTTTGGCCCCTTGGGCCACTATGTTAATAGTGATTAAGTCTTTAATATAGATGTCTTTTTAGTTACTCTCATTATCAAATATTAAACAAG
Coding sequence (CDS)
ATGTCTTCTCGGGGCCGAAATTTCAACTCGGACGACGCCGGCGGAAACTCGGCCATGGAGTTAGATGGTGGCCGACAGCTTCAGGAAGAAGAAGATGATGATCCGTTTCTTAAATTTGTCGATTACGCGAGGTCTGTGCTAGCATTTGAAGACGAAGAAGACTTCGACCCTAATGTTAATGGAACGGAGACCAATACGCCGGGTTGGAGTTGGATCGCCTCTCGGGTCCTCAGAACTTGTATCGCCTACTCGAGTTCTGTTACCCCTGCGATTTTGCTATCTGAGCTCTCGCAGGCCTGGTATGAGCAACACAGAGTTGGGGCTCCCAAGAAAGTACCTGAATGTATTAATCAGTTGAAGAAGAAAAATAGGAGAAAGAAGCTCCCAAAAACAGTTACTATTGACTCCATATATGCGAAGAATTTCCTAGCTTTAAGTAGTGTATTGGAAGCTGTCATTGTTGATGCATTTATTCTTCCAGGTACAAATATACACATGCTTACTTTAGGGGATTTTTGGAGCTCTAATACAATCGATCTTTATCTCCATCGTAGATTCTATGACTTAGTGGATGGGATTCTGAAGAAAGGGAGGCAAATATTTTTAACTGGATGCTATCTTCGTGCTGCCAGCGGTGGATCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTATTAGATGAGGAAGAGGACGATGATGTAATACTTCTAGGAGCTCAATTTTGTTCCGATTCCTTTTCTTCTGTTTCTTTTGATGCCGTCAATCAAGGGACTACATATTCATTATATGCAAGGATTGAGTCTATTGGTCCAATGGAAATTCATGAAAAGACTAATGGCGTACAGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGTTAAAGTTTCTCTTATGGGGCGAACAGGTGATGCTAGCCAATCTTTTAAGTGTTGGTAGCTTGCTTGCACTTGATAGACCATATATTGCAACTGTAAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTATGTGTTTTAACACAGAATATAAACCAAGCTTCAAGGATGCTTAGTACGTCAAATCCTACTCAGGGTCCCCGAGTTTCTCAAGTTTCCTTGCCCTGCGATTCACATGGGACAATTGATTTTGGTAATTATCCTTTTCGGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCATTTTAGATATAGTTAATGAAAGAAATACCACAGAAGCTGTTTTCTCTATGAAAATTGAAGATAACACTGGAGAAATTTCGGCAAAGTTGCACTTTGTGGGATCTTGGTCGCTGGGAAGGGTAGGCGTTGGACATACAGTATATATAAGTGGCCTGACATGCACAATGAACAAGAATCGCTTTGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACCTTAGCTGCTTGCCAGCATTGTTAACTTCATCTTGTCTACATAAACTTTCACGACTTTCTGATCTTACCAGCAACGCTCATGGTACAAAGGTCTGTCAAGTTCAGCTTGACCAAGTTTCACATTGTCATGTTAGTACGAAATTTTTGCATGCAATTTGTGGTCATTTTGTCGAGGAGACACCTGGCAGGATCGAGTGCAGCTTCTGTCGTTGTGAATGCGAGTCTGAGCTTGTGCGTACATTCGACCTAAAAATCACCCTTGCAGACGATAGTGCAAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTGTGAACTACCTGAGGAAGAACAAGTAATGTACCCATCTTCACTCGAGAATGAAAGTTTTGTGGTTGCAATAGTGAATTGCAGGAGGCAAACCAGCAGATGTGGAGATAATGTCTATTCTGTTCATGATCCACTTTCATGGGAGATTACTCGTGCACTAAAGTGTGATTGA
Protein sequence
MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKKKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLYLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKFLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTGISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTCTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDPLSWEITRALKCD
Homology
BLAST of Tan0020038 vs. NCBI nr
Match:
XP_023538883.1 (uncharacterized protein LOC111799677 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1260.7 bits (3261), Expect = 0.0e+00
Identity = 620/672 (92.26%), Postives = 650/672 (96.73%), Query Frame = 0
Query: 2 SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
SSRGR+FNSD+AGGNSAMEL+ R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3 SSRGRHFNSDEAGGNSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62
Query: 62 TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
TET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63 TETYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122
Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182
Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242
Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
AQFCSDSFSSVS DAVN+GTTYSLYARIESIGP+EIHEKTNG+QMIQI L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPIEIHEKTNGLQMIQISLLDNDGFKLKF 302
Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
LLWGEQV+LANLLSVGSLLALDRPYIATVNENG+GTSDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGLGTSDELCLEYGSATQLYLVPCIQHEE 362
Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
QVCVLTQNI+QASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNISQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422
Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
ISLYGII+DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIIIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVRSWSLGRVGVGHTVYISGLTC 482
Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
T+ KN EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+V+LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRVRLDQ 542
Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
VSHCHVSTKFLHA CGHFVEETPGR ECSFCRCEC+SELVRTFDLKITLADD+AKIFA C
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPGRTECSFCRCECKSELVRTFDLKITLADDTAKIFACC 602
Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTS+CGDNVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSKCGDNVYSVNDP 662
Query: 662 LSWEITRALKCD 674
LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674
BLAST of Tan0020038 vs. NCBI nr
Match:
KAG7029015.1 (hypothetical protein SDJN02_10198 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1255.7 bits (3248), Expect = 0.0e+00
Identity = 618/672 (91.96%), Postives = 648/672 (96.43%), Query Frame = 0
Query: 2 SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
SSRGR+F SD+AGGNSAMEL+ R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3 SSRGRHFKSDEAGGNSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62
Query: 62 TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
TET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63 TETYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122
Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182
Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLI LLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLITLLDEEEDDDVILLG 242
Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
AQFCSDSFSSVS DAVN+GTTYSLYARIESIGP+EIHEKTNG+QMIQI L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPIEIHEKTNGLQMIQISLLDNDGFKLKF 302
Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 362
Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
QVCVLTQNI+QASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNISQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422
Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
ISLYGII+DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIIIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVRSWSLGRVGVGHTVYISGLTC 482
Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
T+ KN EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLT N+HGTKVC+V+LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTCNSHGTKVCRVRLDQ 542
Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
VSHCHVSTKFLHA CGHFVEETPGRIECSFCRC+C+SELVRTFDLKITLADD+AKIFA C
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPGRIECSFCRCKCKSELVRTFDLKITLADDTAKIFACC 602
Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTS+CGDNVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSKCGDNVYSVNDP 662
Query: 662 LSWEITRALKCD 674
LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674
BLAST of Tan0020038 vs. NCBI nr
Match:
XP_022938337.1 (uncharacterized protein LOC111444466 [Cucurbita moschata])
HSP 1 Score: 1251.1 bits (3236), Expect = 0.0e+00
Identity = 617/672 (91.82%), Postives = 645/672 (95.98%), Query Frame = 0
Query: 2 SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
SSRGR+F SD+AGGNSAMEL+ R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3 SSRGRHFKSDEAGGNSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62
Query: 62 TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
TET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63 TETYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122
Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182
Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242
Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
AQFCSDSFSSVS DAVN+GT YSLYARIESIGP+EIHEKTNG+QMIQI L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVNKGTAYSLYARIESIGPIEIHEKTNGLQMIQISLLDNDGFKLKF 302
Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 362
Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
QVCVLTQNI+QASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNISQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422
Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
ISLYGII+DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIIIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVKSWSLGRVGVGHTVYISGLTC 482
Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
T+ KN EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+V+LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRVRLDQ 542
Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
VSHCHVSTKFLHA CGHFVEETP RIEC FCRCEC SELVRTFDLKITLADD+AKIFAWC
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPRRIECCFCRCECMSELVRTFDLKITLADDTAKIFAWC 602
Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCR QTS+ GDNVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRWQTSKSGDNVYSVNDP 662
Query: 662 LSWEITRALKCD 674
LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674
BLAST of Tan0020038 vs. NCBI nr
Match:
XP_022972298.1 (uncharacterized protein LOC111470879 isoform X1 [Cucurbita maxima] >XP_022972299.1 uncharacterized protein LOC111470879 isoform X2 [Cucurbita maxima] >XP_022972300.1 uncharacterized protein LOC111470879 isoform X3 [Cucurbita maxima])
HSP 1 Score: 1244.6 bits (3219), Expect = 0.0e+00
Identity = 612/672 (91.07%), Postives = 644/672 (95.83%), Query Frame = 0
Query: 2 SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
SSR R F S +AGG SAMEL+ R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3 SSRDRYFKSYEAGGKSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62
Query: 62 TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
T+T TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63 TKTYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122
Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182
Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242
Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
AQFCSDSFSSVS DAV++GTTYSLYARIESIGP EIHEKTNG+QMIQI+L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVDKGTTYSLYARIESIGPKEIHEKTNGLQMIQILLLDNDGFKLKF 302
Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIG+SDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGSSDELCLEYGSATQLYLVPCIQHEE 362
Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
QVCVLTQNINQASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNINQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422
Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
ISLYGI++DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIVIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVRSWSLGRVGVGHTVYISGLTC 482
Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
T+ KN EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+ +LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRARLDQ 542
Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
VSHCHVSTKFLHA CGHFVEETPGRIEC FCR EC+SELVRTFDLKITLADD+AKIFAWC
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPGRIECCFCRSECKSELVRTFDLKITLADDTAKIFAWC 602
Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTS+CG+NVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSKCGNNVYSVNDP 662
Query: 662 LSWEITRALKCD 674
LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674
BLAST of Tan0020038 vs. NCBI nr
Match:
XP_004134503.1 (uncharacterized protein LOC101215087 [Cucumis sativus] >KGN57165.1 hypothetical protein Csa_009918 [Cucumis sativus])
HSP 1 Score: 1223.4 bits (3164), Expect = 0.0e+00
Identity = 598/673 (88.86%), Postives = 639/673 (94.95%), Query Frame = 0
Query: 1 MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVN 60
MSS ++FNS DA NSAMELD ++LQEE DDDPFLKFVDYARSVLAFED+EDFDPN+N
Sbjct: 1 MSSHSKHFNSHDAARNSAMELDDPQKLQEEGDDDPFLKFVDYARSVLAFEDDEDFDPNIN 60
Query: 61 GTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLK 120
GTET+TPGW+WIASRVLRTC+AYSSSVTPAILLSELSQAWYEQHRVGAPKK+PECINQLK
Sbjct: 61 GTETHTPGWTWIASRVLRTCMAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLK 120
Query: 121 KKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDL 180
KKNRRKKLPKTVTIDSIY KNFLALSSVLEAVI+D FILPGTNIHMLTLGDFWSSNTIDL
Sbjct: 121 KKNRRKKLPKTVTIDSIYEKNFLALSSVLEAVILDEFILPGTNIHMLTLGDFWSSNTIDL 180
Query: 181 YLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILL 240
YLHRRFYDLV+GILKKGRQIF+TGCYLRAASGGSG+PRLLPTEYLIILLDEEEDDDV+LL
Sbjct: 181 YLHRRFYDLVNGILKKGRQIFVTGCYLRAASGGSGYPRLLPTEYLIILLDEEEDDDVMLL 240
Query: 241 GAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLK 300
GAQFCSD+FSSVS D+VN+GTTYSLYARIESIGP+EIHE NG++MIQIILVDNDGFKLK
Sbjct: 241 GAQFCSDTFSSVSLDSVNEGTTYSLYARIESIGPLEIHEMMNGLRMIQIILVDNDGFKLK 300
Query: 301 FLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHE 360
FLLWGEQV+LANLLSVGS+LALDRPY+ATVNENG+GTSDELCLEYGSATQLYLVPCIQHE
Sbjct: 301 FLLWGEQVLLANLLSVGSVLALDRPYVATVNENGVGTSDELCLEYGSATQLYLVPCIQHE 360
Query: 361 EQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMT 420
EQVCVLTQNINQASR +S S PTQ P+VSQVSLPCDSHG IDFGNYPFRSFVIDLQDKMT
Sbjct: 361 EQVCVLTQNINQASRTVSMSYPTQSPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMT 420
Query: 421 GISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLT 480
GISLYG +LDI NERNTTEA FSM+IEDNTGE+ AKL FV SWSLGRV VGHTV+ISGLT
Sbjct: 421 GISLYGNVLDIANERNTTEAGFSMRIEDNTGEVLAKLRFVRSWSLGRVSVGHTVFISGLT 480
Query: 481 CTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLD 540
CT NKNR EALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSN HGTKVCQV+LD
Sbjct: 481 CTKNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLD 540
Query: 541 QVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAW 600
QVSHCHVSTKFLHAICGHFVEETP RIECSFCRCEC+SEL+RTFDLKITLADDSAKIFAW
Sbjct: 541 QVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELMRTFDLKITLADDSAKIFAW 600
Query: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHD 660
CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENE+FVVAIVNCRR++S G+N+ +D
Sbjct: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENENFVVAIVNCRRRSSTYGNNLNFAND 660
Query: 661 PLSWEITRALKCD 674
PLSWEITRALKC+
Sbjct: 661 PLSWEITRALKCE 673
BLAST of Tan0020038 vs. ExPASy TrEMBL
Match:
A0A6J1FDS0 (uncharacterized protein LOC111444466 OS=Cucurbita moschata OX=3662 GN=LOC111444466 PE=4 SV=1)
HSP 1 Score: 1251.1 bits (3236), Expect = 0.0e+00
Identity = 617/672 (91.82%), Postives = 645/672 (95.98%), Query Frame = 0
Query: 2 SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
SSRGR+F SD+AGGNSAMEL+ R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3 SSRGRHFKSDEAGGNSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62
Query: 62 TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
TET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63 TETYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122
Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182
Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242
Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
AQFCSDSFSSVS DAVN+GT YSLYARIESIGP+EIHEKTNG+QMIQI L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVNKGTAYSLYARIESIGPIEIHEKTNGLQMIQISLLDNDGFKLKF 302
Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 362
Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
QVCVLTQNI+QASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNISQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422
Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
ISLYGII+DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIIIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVKSWSLGRVGVGHTVYISGLTC 482
Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
T+ KN EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+V+LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRVRLDQ 542
Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
VSHCHVSTKFLHA CGHFVEETP RIEC FCRCEC SELVRTFDLKITLADD+AKIFAWC
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPRRIECCFCRCECMSELVRTFDLKITLADDTAKIFAWC 602
Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCR QTS+ GDNVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRWQTSKSGDNVYSVNDP 662
Query: 662 LSWEITRALKCD 674
LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674
BLAST of Tan0020038 vs. ExPASy TrEMBL
Match:
A0A6J1IB36 (uncharacterized protein LOC111470879 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470879 PE=4 SV=1)
HSP 1 Score: 1244.6 bits (3219), Expect = 0.0e+00
Identity = 612/672 (91.07%), Postives = 644/672 (95.83%), Query Frame = 0
Query: 2 SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
SSR R F S +AGG SAMEL+ R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3 SSRDRYFKSYEAGGKSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62
Query: 62 TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
T+T TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63 TKTYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122
Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182
Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242
Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
AQFCSDSFSSVS DAV++GTTYSLYARIESIGP EIHEKTNG+QMIQI+L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVDKGTTYSLYARIESIGPKEIHEKTNGLQMIQILLLDNDGFKLKF 302
Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIG+SDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGSSDELCLEYGSATQLYLVPCIQHEE 362
Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
QVCVLTQNINQASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNINQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422
Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
ISLYGI++DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIVIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVRSWSLGRVGVGHTVYISGLTC 482
Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
T+ KN EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+ +LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRARLDQ 542
Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
VSHCHVSTKFLHA CGHFVEETPGRIEC FCR EC+SELVRTFDLKITLADD+AKIFAWC
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPGRIECCFCRSECKSELVRTFDLKITLADDTAKIFAWC 602
Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTS+CG+NVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSKCGNNVYSVNDP 662
Query: 662 LSWEITRALKCD 674
LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674
BLAST of Tan0020038 vs. ExPASy TrEMBL
Match:
A0A0A0L5D2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G166300 PE=4 SV=1)
HSP 1 Score: 1223.4 bits (3164), Expect = 0.0e+00
Identity = 598/673 (88.86%), Postives = 639/673 (94.95%), Query Frame = 0
Query: 1 MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVN 60
MSS ++FNS DA NSAMELD ++LQEE DDDPFLKFVDYARSVLAFED+EDFDPN+N
Sbjct: 1 MSSHSKHFNSHDAARNSAMELDDPQKLQEEGDDDPFLKFVDYARSVLAFEDDEDFDPNIN 60
Query: 61 GTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLK 120
GTET+TPGW+WIASRVLRTC+AYSSSVTPAILLSELSQAWYEQHRVGAPKK+PECINQLK
Sbjct: 61 GTETHTPGWTWIASRVLRTCMAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLK 120
Query: 121 KKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDL 180
KKNRRKKLPKTVTIDSIY KNFLALSSVLEAVI+D FILPGTNIHMLTLGDFWSSNTIDL
Sbjct: 121 KKNRRKKLPKTVTIDSIYEKNFLALSSVLEAVILDEFILPGTNIHMLTLGDFWSSNTIDL 180
Query: 181 YLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILL 240
YLHRRFYDLV+GILKKGRQIF+TGCYLRAASGGSG+PRLLPTEYLIILLDEEEDDDV+LL
Sbjct: 181 YLHRRFYDLVNGILKKGRQIFVTGCYLRAASGGSGYPRLLPTEYLIILLDEEEDDDVMLL 240
Query: 241 GAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLK 300
GAQFCSD+FSSVS D+VN+GTTYSLYARIESIGP+EIHE NG++MIQIILVDNDGFKLK
Sbjct: 241 GAQFCSDTFSSVSLDSVNEGTTYSLYARIESIGPLEIHEMMNGLRMIQIILVDNDGFKLK 300
Query: 301 FLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHE 360
FLLWGEQV+LANLLSVGS+LALDRPY+ATVNENG+GTSDELCLEYGSATQLYLVPCIQHE
Sbjct: 301 FLLWGEQVLLANLLSVGSVLALDRPYVATVNENGVGTSDELCLEYGSATQLYLVPCIQHE 360
Query: 361 EQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMT 420
EQVCVLTQNINQASR +S S PTQ P+VSQVSLPCDSHG IDFGNYPFRSFVIDLQDKMT
Sbjct: 361 EQVCVLTQNINQASRTVSMSYPTQSPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMT 420
Query: 421 GISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLT 480
GISLYG +LDI NERNTTEA FSM+IEDNTGE+ AKL FV SWSLGRV VGHTV+ISGLT
Sbjct: 421 GISLYGNVLDIANERNTTEAGFSMRIEDNTGEVLAKLRFVRSWSLGRVSVGHTVFISGLT 480
Query: 481 CTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLD 540
CT NKNR EALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSN HGTKVCQV+LD
Sbjct: 481 CTKNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLD 540
Query: 541 QVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAW 600
QVSHCHVSTKFLHAICGHFVEETP RIECSFCRCEC+SEL+RTFDLKITLADDSAKIFAW
Sbjct: 541 QVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELMRTFDLKITLADDSAKIFAW 600
Query: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHD 660
CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENE+FVVAIVNCRR++S G+N+ +D
Sbjct: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENENFVVAIVNCRRRSSTYGNNLNFAND 660
Query: 661 PLSWEITRALKCD 674
PLSWEITRALKC+
Sbjct: 661 PLSWEITRALKCE 673
BLAST of Tan0020038 vs. ExPASy TrEMBL
Match:
A0A5A7U7H0 (Nucleic acid-binding proteins superfamily isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold171G008000 PE=4 SV=1)
HSP 1 Score: 1223.0 bits (3163), Expect = 0.0e+00
Identity = 600/673 (89.15%), Postives = 638/673 (94.80%), Query Frame = 0
Query: 1 MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVN 60
MSS ++FNS DAG SAMELD R+LQEE DDDPFLKFVDYARSVLAFED+EDFDPNVN
Sbjct: 1 MSSLSKHFNSHDAGRYSAMELDDPRKLQEEGDDDPFLKFVDYARSVLAFEDDEDFDPNVN 60
Query: 61 GTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLK 120
GTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKK+PECINQLK
Sbjct: 61 GTETDTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLK 120
Query: 121 KKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDL 180
KKNRRKKLPKTVTIDSIY KNFL++SSVLEAVI+D FILPGTNIHMLTLGDFWSSNTIDL
Sbjct: 121 KKNRRKKLPKTVTIDSIYEKNFLSISSVLEAVILDEFILPGTNIHMLTLGDFWSSNTIDL 180
Query: 181 YLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILL 240
YLHRRFYDLVDGILKKGRQIF+TGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDV+LL
Sbjct: 181 YLHRRFYDLVDGILKKGRQIFVTGCYLRAASGGSGYPRLLPTEYLVILLDEEEDDDVMLL 240
Query: 241 GAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLK 300
GAQFCSD+FSSVS D+VN+GTTYSLYARIESIGP+EIHEK NG++MIQIILVDNDGFKLK
Sbjct: 241 GAQFCSDTFSSVSLDSVNEGTTYSLYARIESIGPLEIHEKINGLRMIQIILVDNDGFKLK 300
Query: 301 FLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHE 360
FLLWGEQV+LANLLSVGS+LALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHE
Sbjct: 301 FLLWGEQVLLANLLSVGSVLALDRPYVATVNENGVGTSEELCLEYGSATQLYLVPCIQHE 360
Query: 361 EQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMT 420
EQVCVLTQNINQASR +S S PTQGP+VSQVSLPCDSHG IDFGNYPFRSFVIDLQDKMT
Sbjct: 361 EQVCVLTQNINQASRTVSMSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMT 420
Query: 421 GISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLT 480
GISLYG +LDI NERNTTEA FSM+IEDNTGEI AKL F SWSLGRV VGHTV+ISGLT
Sbjct: 421 GISLYGNVLDIANERNTTEAGFSMRIEDNTGEILAKLRFERSWSLGRVSVGHTVFISGLT 480
Query: 481 CTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLD 540
CT NKNR EALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSN HGTKVC+V+LD
Sbjct: 481 CTKNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCRVRLD 540
Query: 541 QVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAW 600
QVSHCHVSTKFLHAICGHFVEETP RIECSFC CEC+SELVRTFDLKITLADDSAKIFAW
Sbjct: 541 QVSHCHVSTKFLHAICGHFVEETPARIECSFCCCECKSELVRTFDLKITLADDSAKIFAW 600
Query: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHD 660
C GQTAAELLQISPDEFCELPEEEQVMYPSSLENE+FVVAIVNCRRQ+ + G+NV +D
Sbjct: 601 CMGQTAAELLQISPDEFCELPEEEQVMYPSSLENENFVVAIVNCRRQSRKYGNNVNFAND 660
Query: 661 PLSWEITRALKCD 674
PLSWEITRALKC+
Sbjct: 661 PLSWEITRALKCE 673
BLAST of Tan0020038 vs. ExPASy TrEMBL
Match:
A0A1S3AX73 (uncharacterized protein LOC103483891 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103483891 PE=4 SV=1)
HSP 1 Score: 1221.5 bits (3159), Expect = 0.0e+00
Identity = 600/673 (89.15%), Postives = 637/673 (94.65%), Query Frame = 0
Query: 1 MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVN 60
MSS ++FNS DAG SAMELD R+LQEE DDDPFLKFVDYARSVLAFED+EDFDPNVN
Sbjct: 1 MSSLSKHFNSHDAGRYSAMELDDPRKLQEEGDDDPFLKFVDYARSVLAFEDDEDFDPNVN 60
Query: 61 GTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLK 120
GTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKK+PECINQLK
Sbjct: 61 GTETDTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLK 120
Query: 121 KKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDL 180
KKNRRKKLPKTVTIDSIY KNFL+LSSVLEAVI+D FILPGTNIHMLTLGDFWSSNTIDL
Sbjct: 121 KKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPGTNIHMLTLGDFWSSNTIDL 180
Query: 181 YLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILL 240
YLHRRFYDLVDGILKKGRQIF+TGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDV+LL
Sbjct: 181 YLHRRFYDLVDGILKKGRQIFVTGCYLRAASGGSGYPRLLPTEYLVILLDEEEDDDVMLL 240
Query: 241 GAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLK 300
GAQFCSD+FSSVS D+VN+GTTYSLYARIESIGP+EIHEK NG++MIQIILVDNDGFKLK
Sbjct: 241 GAQFCSDTFSSVSLDSVNEGTTYSLYARIESIGPLEIHEKINGLRMIQIILVDNDGFKLK 300
Query: 301 FLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHE 360
FLLWGEQV+LA LLSVGS+LALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHE
Sbjct: 301 FLLWGEQVLLAKLLSVGSVLALDRPYVATVNENGVGTSEELCLEYGSATQLYLVPCIQHE 360
Query: 361 EQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMT 420
EQVCVLTQNINQASR +S S PTQGP+VSQVSLPCDSHG IDFGNYPFRSFVIDLQDKMT
Sbjct: 361 EQVCVLTQNINQASRTVSMSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMT 420
Query: 421 GISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLT 480
GISLYG +LDI NERNTTEA FSM+IEDNTGEI AKL F SWSLGRV VGHTV+ISGLT
Sbjct: 421 GISLYGNVLDIANERNTTEAGFSMRIEDNTGEILAKLRFERSWSLGRVSVGHTVFISGLT 480
Query: 481 CTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLD 540
CT NKNR EALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSN HGTKVC+V+LD
Sbjct: 481 CTKNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCRVRLD 540
Query: 541 QVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAW 600
QVSHCHVSTKFLHAICGHFVEETP RIECSFC CEC+SELVRTFDLKITLADDSAKIFAW
Sbjct: 541 QVSHCHVSTKFLHAICGHFVEETPARIECSFCCCECKSELVRTFDLKITLADDSAKIFAW 600
Query: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHD 660
C GQTAAELLQISPDEFCELPEEEQVMYPSSLENE+FVVAIVNCRRQ+ + G+NV +D
Sbjct: 601 CMGQTAAELLQISPDEFCELPEEEQVMYPSSLENENFVVAIVNCRRQSRKYGNNVNFAND 660
Query: 661 PLSWEITRALKCD 674
PLSWEITRALKC+
Sbjct: 661 PLSWEITRALKCE 673
BLAST of Tan0020038 vs. TAIR 10
Match:
AT3G17030.1 (Nucleic acid-binding proteins superfamily )
HSP 1 Score: 750.4 bits (1936), Expect = 1.3e-216
Identity = 384/680 (56.47%), Postives = 497/680 (73.09%), Query Frame = 0
Query: 12 DAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEED------FDPNVNGTETN 71
D G S +E+ +EE +DPFL F+DYAR+V++ ED+ED P TE +
Sbjct: 3 DTNGASLIEIG-----DQEEVEDPFLAFLDYARTVISPEDDEDEKEESKRGPGEAMTEAS 62
Query: 72 TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKKKNRR 131
PGW W+ASR+L+TC AYSS VT AILLS+LSQAW+EQ++ G KK PE I+QLKK +RR
Sbjct: 63 GPGWGWVASRILKTCTAYSSGVTAAILLSDLSQAWHEQNKPGMSKKKPELIDQLKKGHRR 122
Query: 132 KKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLYLHRR 191
++L TVTIDSIY KNFL+++SVLEAVI++A +LPGTNI MLTLGDFWSSNTIDLYLHRR
Sbjct: 123 RRLANTVTIDSIYEKNFLSMNSVLEAVIINADVLPGTNIFMLTLGDFWSSNTIDLYLHRR 182
Query: 192 FYDLVD---GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLGA 251
+Y+LV+ GIL+KGR++ +TGCYLR A G G PRLLPTEYL++LLDE++DDD IL+ A
Sbjct: 183 YYELVETPNGILRKGREVLITGCYLRTAREGFGTPRLLPTEYLVVLLDEDQDDDAILIAA 242
Query: 252 QFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKFL 311
QFCSD+FSSVS DA N G +YSLYARIESIGP+E + + QI LVD DG +LKF+
Sbjct: 243 QFCSDTFSSVSLDAFNDGASYSLYARIESIGPLESELTFSTARRRQISLVDGDGDRLKFI 302
Query: 312 LWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQ 371
LWGEQV++ANLLSVGS+L ++RPYI+++ E+ + + E CLEYGSAT LYLVP EE+
Sbjct: 303 LWGEQVIVANLLSVGSVLGIERPYISSLEESAMEGNYEFCLEYGSATHLYLVPSTLQEER 362
Query: 372 VCV-LTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 431
VCV L+Q+ Q S++L + VSQV+LP D+ G++DF NYPFR+ + D++DK TG
Sbjct: 363 VCVALSQHQCQGSKLLGSVG------VSQVTLPRDADGSVDFSNYPFRTMITDIRDKTTG 422
Query: 432 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 491
ISLYG++ DI + N T VFS+KIED TG I AKLHF WSLGR+G+GH VY+SGL+C
Sbjct: 423 ISLYGVVTDISCDPNATGVVFSLKIEDTTGAIWAKLHFTNYWSLGRLGLGHVVYVSGLSC 482
Query: 492 TMNK-NRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAH-GTKVCQVQL 551
+ K N E LW E A+FVNLSCLPA LTSSC+H +S LS ++ +C+V+L
Sbjct: 483 KITKENCIEMLWHEKDEKATFVNLSCLPAFLTSSCIHLISTLSQISKQRKPAINICRVKL 542
Query: 552 DQVSHCH-VSTKFLHAICGHFVEETP-----GRIECSFCRCECES--ELVRTFDLKITLA 611
D++ CH ++T+ H++CGHF++E + CSFCR C S E+VRTF + ITLA
Sbjct: 543 DEIDQCHNINTRLAHSLCGHFIDEESSSSYGANLHCSFCRVSCNSNTEVVRTFHITITLA 602
Query: 612 DDSAKIFAWCTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRC 671
D+ K++AWCTGQ+A+ +LQISPDEFC+LPE++Q+MYPSSLENE F+V + N +
Sbjct: 603 DEETKLYAWCTGQSASAILQISPDEFCDLPEDDQLMYPSSLENEWFLVILANSGSRNLGS 662
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023538883.1 | 0.0e+00 | 92.26 | uncharacterized protein LOC111799677 [Cucurbita pepo subsp. pepo] | [more] |
KAG7029015.1 | 0.0e+00 | 91.96 | hypothetical protein SDJN02_10198 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022938337.1 | 0.0e+00 | 91.82 | uncharacterized protein LOC111444466 [Cucurbita moschata] | [more] |
XP_022972298.1 | 0.0e+00 | 91.07 | uncharacterized protein LOC111470879 isoform X1 [Cucurbita maxima] >XP_022972299... | [more] |
XP_004134503.1 | 0.0e+00 | 88.86 | uncharacterized protein LOC101215087 [Cucumis sativus] >KGN57165.1 hypothetical ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FDS0 | 0.0e+00 | 91.82 | uncharacterized protein LOC111444466 OS=Cucurbita moschata OX=3662 GN=LOC1114444... | [more] |
A0A6J1IB36 | 0.0e+00 | 91.07 | uncharacterized protein LOC111470879 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A0A0L5D2 | 0.0e+00 | 88.86 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G166300 PE=4 SV=1 | [more] |
A0A5A7U7H0 | 0.0e+00 | 89.15 | Nucleic acid-binding proteins superfamily isoform 1 OS=Cucumis melo var. makuwa ... | [more] |
A0A1S3AX73 | 0.0e+00 | 89.15 | uncharacterized protein LOC103483891 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT3G17030.1 | 1.3e-216 | 56.47 | Nucleic acid-binding proteins superfamily | [more] |