Tan0020038 (gene) Snake gourd v1

Overview
NameTan0020038
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNucleic acid-binding proteins superfamily isoform 1
LocationLG10: 9935506 .. 9944369 (-)
RNA-Seq ExpressionTan0020038
SyntenyTan0020038
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATATTACATGTATTGTATGCCTTAGAAATCCGCGCCAATTTGCCAATCTTCTTCCCGCTGAAAAATGAACCTCAAATTTAGGGCACAGAAATGGCTTAATCCCGGTCTGAACTTGAATTGAACTCGGTGATTCTCAAGAATGTCTTCTCGGGGCCGAAATTTCAACTCGGACGACGCCGGCGGAAACTCGGCCATGGAGTTAGATGGTGGCCGACAGCTTCAGGAAGAAGAAGATGATGATCCGTTTCTTAAATTTGTCGATTACGCGAGGTCTGTGCTAGCATTTGAAGACGAAGAAGACTTCGACCCTAATGTTAATGGAACGGAGACCAATACGCCGGGTTGGAGTTGGATCGCCTCTCGGGTCCTCAGAACTTGTATCGCCTACTCGAGTTCTGTTACCCCTGCGATTTTGCTATCTGAGCTCTCGCAGGTACTGTGAATTCAAACTTCATTAAGGTTTCTTCTTTCGATTTTAGATAAGCTTGGAGCAATCCAATGGAAATTTTTTATTAATTGCCTGATTTTCCAGTCTAAGGTAACTGCAAAGTGAGAACTTCCTGAATTGTTGATGTTTCACAATTAGACTAAAGAGTTCACCGTAATTGATTACACGAGTTTAATGGAATCTCTTAGTTACAAGTTAAATTTGTACCAATAGGCCATCTGAGTTCCTTTTTATTTTATACACCCACTTATTACCAAGTGATCGTAGTGAGGAGAGGGTGAAACAGGAGCCCATGTTTTATTCTAAATGAGGGTTGAATACTCAGTACTCATAGCTTGCTTCAAGGCTCTTGAGGTTCATGTCTCCTTAAAACTGTTTAAACCAATAGTCCTATTGGTTTTTGAAGTACTAGGCACCATCGTGTGGGCCAACTTTGAGCGATCTAGATTGCACCCATGTTCCTTTCCAGTTCAGCATCTCATCAGCTGAGAGGCTATTTCAACTGTCAATTATGCAATTATCCAAGTGAGAAATAAGCGGTTGCCTCCCAAGCCTCATATGTTCGATTGGGAATAAGAAATCCACAGGTGATGAAGGCATTCGTTTCTTCTCTAGGAGGCAAATACATTCTGACCTTTCCTTATCTCCCCTATCAAGTGTCCCTATAGTTTGTAACTCATTAGCATAAGAAGAAAACATTTTTGCCAAAGCATAAAATTGTTTACACCGATGTCACCTGATTCACCAGGTTTCCTTTTGATGATTAAAGTCTGGTATCGAGAGATGTTTTGGGATGAAAGTGATTTAGTTGAAATAAAACATGGTGGATAATGTGGCATCTTGGGGAGAGTTAATGCTGAATGGTTGGTAAGACCAGACAAAGTTTTATCTATACCCTTGAACAACTACATCTAACCCTTATGTTAAGGGGTCAAAGGATTGCACCTGATTTAGATGGAGAAGACAAAGCACAGTAGTTGCTAGATATTGTACCACTTCAGTAAGTCTGAAAAGGACAATAACTAAATGTTTTATATAAACAATGACTGGTTCCACTTCTGATGTATCTTATTAAAAAAAAACGGTTCCACTTCTAGTTACCGACTAAGATAATATAAATTAAACTTTCACAGTTCTATTCCACAAGACAGTCATTTGGCTAGTGATTTTGTTTTGAAGGCCTGGTATGAGCAACACAGAGTTGGGGCTCCCAAGAAAGTACCTGAATGTATTAATCAGTTGAAGAAGAAAAATAGGAGAAAGAAGCTCCCAAAAACAGTTACTATTGACTCCATATATGCGAAGAATTTCCTAGCTTTAAGTAGTGTATTGGAAGCTGTCATTGTTGATGCATTTATTCTTCCAGGTACTTCTATTTCCAATTTTAAGCCGATGATTGTAATTTTAGCAATGTTCTTGATGTTCCCAATTAGTCATACATGATGCATGAACCTAGTATATTTGTAGCTTGAGGAAGGGTTGAAGAAATTGAAACATAAAGATTGTACTTGTTCAATTACTCAGTAAGATATATACCTTTTTTCAGAGAACTGATAACTTCTTTTATTTTTATATTTTAGATCTACTTAACTTCAGGTGTTGTTTCCCTAGTTCAACTGTTTAGATGTATTACGGACAAGGATCTGAATCTTTTATCTGCACGATTTTCTTAAGGTACAAATATACACATGCTTACTTTAGGGGATTTTTGGAGCTCTAATACAATCGATCTTTATCTCCATCGTAGGTAAACCATTTACTATCTGATGCTACCTTTATGATCGTTCATGCAATAAAACCATTTGGAATGAACGTGTTCTGACTGGCCATTTGTTTTCTTAATTAGACGATCCTTACAGTAGTATTACTGCACTGCACTTTTTGTGACACTAATTTGAAAAAACCAGCTGCACTGCACTGCACTTTTTGTGACGTATTACTGCATTGCACTTTCTTCGTTGCACTGCACTTTTTGTAGTTTTCTTTCACTTCCTTTTGGGAGTTTATTTCCCTTAAACTTTTTTGTATCTTTTCATTATTTCAACGAGAAGTTCTTATCTTGTTCAAAAGTGAAGAATGATAGTATACAGTGGGCCAGCTAAAAGCGAGAACACAAGAAAAAGTTTACAAAGAAGAACCCCCCCCAGCTGCTGTTGATAAATAATAAAGTGTAATTGCAAAGAAGTTTTTTAAGTATGAACCGAAAAAGAAGCGTTAACATGCACAAAAGCCCACACCTCCTCCTACAAGCTATCTAACACCTGAAAGTTATGGGTTTCTTTCCACCCAAAGATGCCATAGAGTGATGAAAAAGCTAGCTTGCCAAAGTGTATGACCTTTCTCCTCAAACAATGAATTCAAAATGACTTCCACCAGTACTGAGACATCATTACCTCTCTACGATCTGATTTGGAAAGGCCACTATCTAAAGAAGGTGAAGTTTTTCCTAGGGAGCAATCTCACAAGGCAATCAACACACATGAGAAGCTCCAGAGAAGACTGCCCTACATGGCTCTATCTCCACATTGGCGTACAATTTGCAAACAGCACTCTTAATCACAGAGTCACCTCTTCTTCTCTTGTGAATTTGCTAAGACCTTTCGGAGCTTTCTCCTCCCGACATTCAATTGGCACACAGTTCTCCCAGAAGACCCTTATCTTTTCCTCTCTCACAACCTTGTTGGACACACTTTCATAAAAGCCAAGAAGATTCCGTGGGAGAATTTTGTTAGAGCATTCTGCTGGAACATTTGGCAAGAAAGAAAAAGAAGAATCTTTCAATCAAAGGAATTCTTCTACTATCGATTTTTTTTTGAGTTCCCTACTATAGATTTTTTGAGGCTATAGTTGATCGAAGAAAATTCCTAATCTAAGTTTTTCATTTAAAGCCAAGGCTTTTTTTATTGATAACCTCTTCCAAGAATTCCTAGTAACATGATCAAAGGCTTTTTTGAAATCTATTTTAAAAGAACAACTTCCTTCTTCCTAACTCAGTATTATTCCAAAGCCCCATTGCAATGATTTATTCTTTTTATTGCCTACTATCTCTCAATATGTCACTGAATGCCTACATTTTTTTTTTTTGAAAAGCAATAGAGAAGGCCTGGATTTGAATGCTGGGGTTTTTTAGAACTAAGGATACAGTTGTACACCCTGTAGAATTGGTCTTGCTCAATTAAGATTGTGAACTCAAGGACACGGTTGCTCGATATGAATGCTGGATTTTGTATTGAATGTTGACATTTAAGCAGTGTTTATTAAAGACTCAGATGTATATGCCCAAATCTTTTTTTTTTCTTTCTTCTTCTTCTTCTTCTTCTTTTTTTATTTTATTTTATTTTTTTTTATGCCTAAACATCGAATTCCGTCTCTGTCTTTTCATCTTGATTTGAGGTTGTGTTACAACTCTATAAGCTGCATAGAACTGGCAATCTTCATGTTTCATGGAAGTACAAGTAAAACAGAATCTTTGATGGCAGATTCTATGACTTAGTGGATGGGATTCTGAAGAAAGGGAGGCAAATATTTTTAACTGGATGCTATCTTCGTGCTGCCAGCGGTGGATCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTATTAGATGAGGTTGTCAAAAATGGTTTTTCAAACTACCAGAATCAATATTACATTTATTTTGCTTATACGCAGTTAGTATCTTCTGTTTATTGTAGGAAGAGGACGATGATGTAATACTTCTAGGAGCTCAATTTTGTTCCGATTCCTTTTCTTCTGTTTCTTTTGATGCCGTCAATCAAGGGACTACATATTCATTATATGCAAGGCAAGATATAACTGTTGTTTGTTGGCCTATTTTTCTCCAGTTTTTTGCAATCTCTATCTTAACTGGTTTCTTAACCTGCAATGAGCTGAATGTAGGATTTTGGTCTTCTTTTGTTTTCCCCTCTTCCTAAAATGGTCATTGCTTAAACTCTAGAAAGGCTGAATCAACCAAATAAATCGATTTTCACGAATGATGGATTTTGGATGTCCAATGAGCTAACTATGGGTTGCCTTATCTTCAACATTTGCAGGATTGAGTCTATTGGTCCAATGGAAATTCATGAAAAGACTAATGGCGTACAGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGTTAAAGTTTCTCTTATGGGGCGAACAGGTGATGCTAGCCAATCTTTTAAGGTGAATCTTTTAAGTTGATCTTAATTATTCAAGAATACTCTTATATGATTATATCAGCATAACAATTGAATTATCATTTCAACAGTGTTGGTAGCTTGCTTGCACTTGATAGACCATATATTGCAACTGTAAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTGTGTTGTAGATTCCCACCTCCCCCACACCAAATGCACACACGCAGGCACAAACTTTTAGTTTTCTGTTTCTTGGATCCCATCAATATGTAAATATTCTACTACACAGGTATGTGTTTTAACACAGAATATAAACCAAGCTTCAAGGATGCTTAGTACGTCAAATCCTACTCAGGGTCCCCGAGTTTCTCAAGTTTCCTTGCCCTGCGATTCACATGGGACAATTGATTTTGGTAATTATCCTTTTCGGGTGAGTATAACTTCTAGATTATCATAATATGTGTGTATACATATGTTTGGATAGGAAACAGTTTTTATTGATAATATAAAATTTACAAAAGGGAGATAACCCATCCAAATGAAGTTACAAAAAACTTCTTCAATTGGATAACGGGGAGTTTAAGCTATAGTCAATAAAGGGATGTGCTTGTTTACACCAATATAAAGCTAGAAAAAGTATGTTGTCAATTAAAGCTTCATAAGATCTTTCCTTGTCTTGAAAAATCCTTGAATTCCTTTATGTCCAGATGGTCCAATATATATTTGTAGATAAATAGATAGATATTAATTTTTGGATAGCATTGGTATACCTTTTCTCTTTGAATTTATAATGTTGCAATCCCATTATTCCATGTAAAGCTCCATACACCTGTATCAATAATACCCGACTCTGGTTATATCGTAAATGGATTCGGACTGGTCCTACGTATTTAGCATCCTGTAAGTTTTTTGCAATCCGTTTGTAGTATCAAATTCATAATTTCAGGTTGTACAAGATGCAAAAAGTAAATGTCAATTTGTGCCTTGCACAGATCTGATGTTAAAATTTATTCACTTGACATATTATGAATGACAGGTTGCAATGCTACAGAATTAAAGTTCAGGTCCCCTACGGCCCTACCATCCAAGCTTTTGATTTTTCTTTGTTTGCAAAACAATTAACTGCAATTACATGAATTGTGATGCTTTAAGAAACTTCCTTCCATCTTGAAGACCGTCTCGTTAATCTTTATTGAGATGAATCTAAATTTTTCATGCTATTTGTGAAGTTTACAGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCATTTTAGATATAGTTAATGAAAGAAATACCACAGAAGCTGTTTTCTCTATGAAAATTGAAGATAACACTGGAGAAATTTCGGCAAAGTTGCACTTTGTGGGATCTTGGTATGGACATTTTCTTTGCCTAACTTCTAATTTCTGTGTTTATGAGTCTACTTTTACATTGCTCTTCAAACAACAGAAACTTCTGATCATATATACTTCCACTTATTTTCTGAGTTCCATTTTCCTTTTTCTCTTCTTTGTGCCCCCTCTTGTAAAGGTCGCTGGGAAGGGTAGGCGTTGGACATACAGTATATATAAGTGGCCTGACATGCACAATGAACAAGAATCGGTGAGCTAATTCAAACTCTTTACCCAGCTATCTAGTAAATGAATGGGAAGTCTTGGACAATTTGAAAATGCTTCTGGACAAAAAAAATAATAACTTGTTTTTGATTGAGGTTTCTCTCCTTTCAGATATTTTGGCGATTTCTTCATGGCATTTCTAATTTGATTTGAGTTTTTTTCTTGTGAAGCTTTGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACCTTAGCTGCTTGCCAGCATTGTTAACTTCATCTTGTCTACATAAACTTTCACGACTTTCTGATCTTACCAGCAACGCTCATGGTACAAAGGTTTGGTTTTTTTCTTTTTTCCTTCTCTTTCTTTCTGTCTATTATGTTATGTTTGATGTGGAACTTTTGAAAAGAGAAAGGGAAAACTAACCGGAAGAGGGAGAGGGGTCCCTTGTTGAAGAGCCTTTCCAAAAAAAAAAAAAAACTGATATTGAGGTTAATAATTCAAAATCACAAAAAAAAAAAAAAAGGCTCTCTTTAATGTTGAAGGTTTGAGTTCAACAACAATTGCTTTAATGCTGAAGGTTTCAGATTTCTTTTCATCCATCAGGATCAATGGGTGGTCTAATCGCACATCTTCATGAAACCCTAGCTCTCCATAATCTACCATGCTGAAAGGATTTTGGGAGTCATTCATCTATATCCTTTGGAGAACAAGTGAAGCAAATAAGCTCCATCCCTTTGTTGCTTAGGACCAGCGAATTCTCATTCTTATAACATAATACATAAACACCAAACCTTAGGAGAGAGCCCAATGACCAATGAGGCTTCTCTTTTTGCAACCTTTATTGGAAGCTTAGGGAAAATTTTAATTTTCGTTCTCACCTGAACATCAAAATTAGGAATTTTCTTCCCACATCTTCTTTCCATAAAACATCATCACTGCGCTTTCAAAACACAACTTTGCGGCCTTCAAATAATTTATGTATTTTTTCCACTCTCATTTCGAGTATTCTTTCCTAGAGAACACAGCAGTTATTTAAGTTGGCCCTTAATTTATTCCCTGACGATCCTGTTGATTAATCATACCCCTCCCAAGGTGGAAAATGGGAAGGGAAATTTTTCCTGTAACCAGGTAATCTACTGTTGGAGTTATCACCACCATATATACATCGAAAATAGATAAAGAAAATTATCATTGTCGATGCAGTCTTACATCGCTTGCTTGTTATTTCTTGAGAATTTTAAGCATTCCAATCATAATATCAGTACTTCAACTCATAGTGCCTGCCCAATTCAATGCAGTTGTATGATATACAATCCTATACCGAAATTTTGGTGAACCTATAAGAGCATGATTTTTACTTTTTTTAACTCTTCCAATAGTCTTTGTCATATCCCATATCTATCAGCTGTGATTTTTATTTGTTTTCCCAAAATCTCCTGGCCAGGTCTGTCAAGTTCAGCTTGACCAAGTTTCACATTGTCATGTTAGTACGAAATTTTTGCATGCAATTTGTGGTCATTTTGTCGAGGAGACACCTGGCAGGATCGAGTGCAGCTTCTGTCGTTGTGAATGCGAGTCTGAGCTTGTGCGTACATTCGACCTAAAAATCACCCTTGCAGACGATAGTGCAAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTGTGAACTACCTGAGGTAGAGACTAAACTTAAAATAATGCTTTAAAAAGCAAAAAAAGCTTATGCTTCAGAACAACCAATACAGGTAGCTTTTGTTTAAATAACAAAGAATAAGTTGCTAGTATTACCACAAGTTGTCCATAGATCAAATTATTTCGTGGATTAGATTTTACTTGATTGTCTACCGCTCCATTGCTTTGCCATGGTATTAAGTATTTTGATTTGAGCTTTTTTCTTACTGAGAGATGGAATAGAATAACCTGATTGAAGGGTAATGATCGTCCTGTAATGCATTGTATATATAAACTACTATCATGCGTGTGTGATGACCTAATCACGAGAATCATAATTTAATTTGAAGTCTTATGTTTAGTCAGATTCTTCTATCGGTGAGTGTTCTTAAAGAGGTGGTTCTAACTTTGCAGGAAGAACAAGTAATGTACCCATCTTCACTCGAGAATGAAAGTTTTGTGGTTGCAATAGTGAATTGCAGGAGGCAAACCAGCAGATGTGGAGATAATGTCTATTCTGTTCATGATCCACTTTCATGGGAGATTACTCGTGCACTAAAGTGTGATTGATGTTGCATTATCTTTCATGAAGACTTGTTCATTGATTTGACCTTTCAGAGGTTCGAATATCCTTACAACTCGAAAAATAGCCCATTCATTTTGAGTTCACCAATCGAAAGGTGAGTGAAGAGTCCATATCTCTATGAAGGTACACATTACTAGTCTCTCTATTTTGACTTTGGCAATGTTTTGGCCCCTTGGGCCACTATGTTAATAGTGATTAAGTCTTTAATATAGATGTCTTTTTAGTTACTCTCATTATCAAATATTAAACAAG

mRNA sequence

AAAATATTACATGTATTGTATGCCTTAGAAATCCGCGCCAATTTGCCAATCTTCTTCCCGCTGAAAAATGAACCTCAAATTTAGGGCACAGAAATGGCTTAATCCCGGTCTGAACTTGAATTGAACTCGGTGATTCTCAAGAATGTCTTCTCGGGGCCGAAATTTCAACTCGGACGACGCCGGCGGAAACTCGGCCATGGAGTTAGATGGTGGCCGACAGCTTCAGGAAGAAGAAGATGATGATCCGTTTCTTAAATTTGTCGATTACGCGAGGTCTGTGCTAGCATTTGAAGACGAAGAAGACTTCGACCCTAATGTTAATGGAACGGAGACCAATACGCCGGGTTGGAGTTGGATCGCCTCTCGGGTCCTCAGAACTTGTATCGCCTACTCGAGTTCTGTTACCCCTGCGATTTTGCTATCTGAGCTCTCGCAGGCCTGGTATGAGCAACACAGAGTTGGGGCTCCCAAGAAAGTACCTGAATGTATTAATCAGTTGAAGAAGAAAAATAGGAGAAAGAAGCTCCCAAAAACAGTTACTATTGACTCCATATATGCGAAGAATTTCCTAGCTTTAAGTAGTGTATTGGAAGCTGTCATTGTTGATGCATTTATTCTTCCAGGTACAAATATACACATGCTTACTTTAGGGGATTTTTGGAGCTCTAATACAATCGATCTTTATCTCCATCGTAGATTCTATGACTTAGTGGATGGGATTCTGAAGAAAGGGAGGCAAATATTTTTAACTGGATGCTATCTTCGTGCTGCCAGCGGTGGATCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTATTAGATGAGGAAGAGGACGATGATGTAATACTTCTAGGAGCTCAATTTTGTTCCGATTCCTTTTCTTCTGTTTCTTTTGATGCCGTCAATCAAGGGACTACATATTCATTATATGCAAGGATTGAGTCTATTGGTCCAATGGAAATTCATGAAAAGACTAATGGCGTACAGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGTTAAAGTTTCTCTTATGGGGCGAACAGGTGATGCTAGCCAATCTTTTAAGTGTTGGTAGCTTGCTTGCACTTGATAGACCATATATTGCAACTGTAAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTATGTGTTTTAACACAGAATATAAACCAAGCTTCAAGGATGCTTAGTACGTCAAATCCTACTCAGGGTCCCCGAGTTTCTCAAGTTTCCTTGCCCTGCGATTCACATGGGACAATTGATTTTGGTAATTATCCTTTTCGGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCATTTTAGATATAGTTAATGAAAGAAATACCACAGAAGCTGTTTTCTCTATGAAAATTGAAGATAACACTGGAGAAATTTCGGCAAAGTTGCACTTTGTGGGATCTTGGTCGCTGGGAAGGGTAGGCGTTGGACATACAGTATATATAAGTGGCCTGACATGCACAATGAACAAGAATCGCTTTGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACCTTAGCTGCTTGCCAGCATTGTTAACTTCATCTTGTCTACATAAACTTTCACGACTTTCTGATCTTACCAGCAACGCTCATGGTACAAAGGTCTGTCAAGTTCAGCTTGACCAAGTTTCACATTGTCATGTTAGTACGAAATTTTTGCATGCAATTTGTGGTCATTTTGTCGAGGAGACACCTGGCAGGATCGAGTGCAGCTTCTGTCGTTGTGAATGCGAGTCTGAGCTTGTGCGTACATTCGACCTAAAAATCACCCTTGCAGACGATAGTGCAAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTGTGAACTACCTGAGGAAGAACAAGTAATGTACCCATCTTCACTCGAGAATGAAAGTTTTGTGGTTGCAATAGTGAATTGCAGGAGGCAAACCAGCAGATGTGGAGATAATGTCTATTCTGTTCATGATCCACTTTCATGGGAGATTACTCGTGCACTAAAGTGTGATTGATGTTGCATTATCTTTCATGAAGACTTGTTCATTGATTTGACCTTTCAGAGGTTCGAATATCCTTACAACTCGAAAAATAGCCCATTCATTTTGAGTTCACCAATCGAAAGGTGAGTGAAGAGTCCATATCTCTATGAAGGTACACATTACTAGTCTCTCTATTTTGACTTTGGCAATGTTTTGGCCCCTTGGGCCACTATGTTAATAGTGATTAAGTCTTTAATATAGATGTCTTTTTAGTTACTCTCATTATCAAATATTAAACAAG

Coding sequence (CDS)

ATGTCTTCTCGGGGCCGAAATTTCAACTCGGACGACGCCGGCGGAAACTCGGCCATGGAGTTAGATGGTGGCCGACAGCTTCAGGAAGAAGAAGATGATGATCCGTTTCTTAAATTTGTCGATTACGCGAGGTCTGTGCTAGCATTTGAAGACGAAGAAGACTTCGACCCTAATGTTAATGGAACGGAGACCAATACGCCGGGTTGGAGTTGGATCGCCTCTCGGGTCCTCAGAACTTGTATCGCCTACTCGAGTTCTGTTACCCCTGCGATTTTGCTATCTGAGCTCTCGCAGGCCTGGTATGAGCAACACAGAGTTGGGGCTCCCAAGAAAGTACCTGAATGTATTAATCAGTTGAAGAAGAAAAATAGGAGAAAGAAGCTCCCAAAAACAGTTACTATTGACTCCATATATGCGAAGAATTTCCTAGCTTTAAGTAGTGTATTGGAAGCTGTCATTGTTGATGCATTTATTCTTCCAGGTACAAATATACACATGCTTACTTTAGGGGATTTTTGGAGCTCTAATACAATCGATCTTTATCTCCATCGTAGATTCTATGACTTAGTGGATGGGATTCTGAAGAAAGGGAGGCAAATATTTTTAACTGGATGCTATCTTCGTGCTGCCAGCGGTGGATCTGGTCATCCACGACTTCTACCAACTGAATACCTTATCATATTATTAGATGAGGAAGAGGACGATGATGTAATACTTCTAGGAGCTCAATTTTGTTCCGATTCCTTTTCTTCTGTTTCTTTTGATGCCGTCAATCAAGGGACTACATATTCATTATATGCAAGGATTGAGTCTATTGGTCCAATGGAAATTCATGAAAAGACTAATGGCGTACAGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGTTAAAGTTTCTCTTATGGGGCGAACAGGTGATGCTAGCCAATCTTTTAAGTGTTGGTAGCTTGCTTGCACTTGATAGACCATATATTGCAACTGTAAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGTTATATTTGGTGCCTTGCATTCAGCATGAGGAGCAAGTATGTGTTTTAACACAGAATATAAACCAAGCTTCAAGGATGCTTAGTACGTCAAATCCTACTCAGGGTCCCCGAGTTTCTCAAGTTTCCTTGCCCTGCGATTCACATGGGACAATTGATTTTGGTAATTATCCTTTTCGGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCATTTTAGATATAGTTAATGAAAGAAATACCACAGAAGCTGTTTTCTCTATGAAAATTGAAGATAACACTGGAGAAATTTCGGCAAAGTTGCACTTTGTGGGATCTTGGTCGCTGGGAAGGGTAGGCGTTGGACATACAGTATATATAAGTGGCCTGACATGCACAATGAACAAGAATCGCTTTGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTTTGTCAACCTTAGCTGCTTGCCAGCATTGTTAACTTCATCTTGTCTACATAAACTTTCACGACTTTCTGATCTTACCAGCAACGCTCATGGTACAAAGGTCTGTCAAGTTCAGCTTGACCAAGTTTCACATTGTCATGTTAGTACGAAATTTTTGCATGCAATTTGTGGTCATTTTGTCGAGGAGACACCTGGCAGGATCGAGTGCAGCTTCTGTCGTTGTGAATGCGAGTCTGAGCTTGTGCGTACATTCGACCTAAAAATCACCCTTGCAGACGATAGTGCAAAAATCTTTGCATGGTGTACAGGTCAAACTGCTGCAGAGTTGTTGCAAATATCTCCTGATGAATTCTGTGAACTACCTGAGGAAGAACAAGTAATGTACCCATCTTCACTCGAGAATGAAAGTTTTGTGGTTGCAATAGTGAATTGCAGGAGGCAAACCAGCAGATGTGGAGATAATGTCTATTCTGTTCATGATCCACTTTCATGGGAGATTACTCGTGCACTAAAGTGTGATTGA

Protein sequence

MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKKKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLYLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKFLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTGISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTCTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWCTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDPLSWEITRALKCD
Homology
BLAST of Tan0020038 vs. NCBI nr
Match: XP_023538883.1 (uncharacterized protein LOC111799677 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1260.7 bits (3261), Expect = 0.0e+00
Identity = 620/672 (92.26%), Postives = 650/672 (96.73%), Query Frame = 0

Query: 2   SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
           SSRGR+FNSD+AGGNSAMEL+  R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3   SSRGRHFNSDEAGGNSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62

Query: 62  TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
           TET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63  TETYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122

Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
           KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182

Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
           LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242

Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
           AQFCSDSFSSVS DAVN+GTTYSLYARIESIGP+EIHEKTNG+QMIQI L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPIEIHEKTNGLQMIQISLLDNDGFKLKF 302

Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
           LLWGEQV+LANLLSVGSLLALDRPYIATVNENG+GTSDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGLGTSDELCLEYGSATQLYLVPCIQHEE 362

Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
           QVCVLTQNI+QASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNISQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422

Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
           ISLYGII+DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIIIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVRSWSLGRVGVGHTVYISGLTC 482

Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
           T+ KN  EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+V+LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRVRLDQ 542

Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
           VSHCHVSTKFLHA CGHFVEETPGR ECSFCRCEC+SELVRTFDLKITLADD+AKIFA C
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPGRTECSFCRCECKSELVRTFDLKITLADDTAKIFACC 602

Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
           TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTS+CGDNVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSKCGDNVYSVNDP 662

Query: 662 LSWEITRALKCD 674
           LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674

BLAST of Tan0020038 vs. NCBI nr
Match: KAG7029015.1 (hypothetical protein SDJN02_10198 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1255.7 bits (3248), Expect = 0.0e+00
Identity = 618/672 (91.96%), Postives = 648/672 (96.43%), Query Frame = 0

Query: 2   SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
           SSRGR+F SD+AGGNSAMEL+  R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3   SSRGRHFKSDEAGGNSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62

Query: 62  TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
           TET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63  TETYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122

Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
           KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182

Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
           LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLI LLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLITLLDEEEDDDVILLG 242

Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
           AQFCSDSFSSVS DAVN+GTTYSLYARIESIGP+EIHEKTNG+QMIQI L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPIEIHEKTNGLQMIQISLLDNDGFKLKF 302

Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
           LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 362

Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
           QVCVLTQNI+QASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNISQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422

Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
           ISLYGII+DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIIIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVRSWSLGRVGVGHTVYISGLTC 482

Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
           T+ KN  EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLT N+HGTKVC+V+LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTCNSHGTKVCRVRLDQ 542

Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
           VSHCHVSTKFLHA CGHFVEETPGRIECSFCRC+C+SELVRTFDLKITLADD+AKIFA C
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPGRIECSFCRCKCKSELVRTFDLKITLADDTAKIFACC 602

Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
           TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTS+CGDNVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSKCGDNVYSVNDP 662

Query: 662 LSWEITRALKCD 674
           LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674

BLAST of Tan0020038 vs. NCBI nr
Match: XP_022938337.1 (uncharacterized protein LOC111444466 [Cucurbita moschata])

HSP 1 Score: 1251.1 bits (3236), Expect = 0.0e+00
Identity = 617/672 (91.82%), Postives = 645/672 (95.98%), Query Frame = 0

Query: 2   SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
           SSRGR+F SD+AGGNSAMEL+  R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3   SSRGRHFKSDEAGGNSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62

Query: 62  TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
           TET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63  TETYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122

Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
           KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182

Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
           LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242

Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
           AQFCSDSFSSVS DAVN+GT YSLYARIESIGP+EIHEKTNG+QMIQI L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVNKGTAYSLYARIESIGPIEIHEKTNGLQMIQISLLDNDGFKLKF 302

Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
           LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 362

Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
           QVCVLTQNI+QASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNISQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422

Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
           ISLYGII+DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIIIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVKSWSLGRVGVGHTVYISGLTC 482

Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
           T+ KN  EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+V+LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRVRLDQ 542

Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
           VSHCHVSTKFLHA CGHFVEETP RIEC FCRCEC SELVRTFDLKITLADD+AKIFAWC
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPRRIECCFCRCECMSELVRTFDLKITLADDTAKIFAWC 602

Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
           TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCR QTS+ GDNVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRWQTSKSGDNVYSVNDP 662

Query: 662 LSWEITRALKCD 674
           LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674

BLAST of Tan0020038 vs. NCBI nr
Match: XP_022972298.1 (uncharacterized protein LOC111470879 isoform X1 [Cucurbita maxima] >XP_022972299.1 uncharacterized protein LOC111470879 isoform X2 [Cucurbita maxima] >XP_022972300.1 uncharacterized protein LOC111470879 isoform X3 [Cucurbita maxima])

HSP 1 Score: 1244.6 bits (3219), Expect = 0.0e+00
Identity = 612/672 (91.07%), Postives = 644/672 (95.83%), Query Frame = 0

Query: 2   SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
           SSR R F S +AGG SAMEL+  R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3   SSRDRYFKSYEAGGKSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62

Query: 62  TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
           T+T TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63  TKTYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122

Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
           KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182

Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
           LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242

Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
           AQFCSDSFSSVS DAV++GTTYSLYARIESIGP EIHEKTNG+QMIQI+L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVDKGTTYSLYARIESIGPKEIHEKTNGLQMIQILLLDNDGFKLKF 302

Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
           LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIG+SDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGSSDELCLEYGSATQLYLVPCIQHEE 362

Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
           QVCVLTQNINQASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNINQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422

Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
           ISLYGI++DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIVIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVRSWSLGRVGVGHTVYISGLTC 482

Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
           T+ KN  EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+ +LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRARLDQ 542

Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
           VSHCHVSTKFLHA CGHFVEETPGRIEC FCR EC+SELVRTFDLKITLADD+AKIFAWC
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPGRIECCFCRSECKSELVRTFDLKITLADDTAKIFAWC 602

Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
           TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTS+CG+NVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSKCGNNVYSVNDP 662

Query: 662 LSWEITRALKCD 674
           LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674

BLAST of Tan0020038 vs. NCBI nr
Match: XP_004134503.1 (uncharacterized protein LOC101215087 [Cucumis sativus] >KGN57165.1 hypothetical protein Csa_009918 [Cucumis sativus])

HSP 1 Score: 1223.4 bits (3164), Expect = 0.0e+00
Identity = 598/673 (88.86%), Postives = 639/673 (94.95%), Query Frame = 0

Query: 1   MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVN 60
           MSS  ++FNS DA  NSAMELD  ++LQEE DDDPFLKFVDYARSVLAFED+EDFDPN+N
Sbjct: 1   MSSHSKHFNSHDAARNSAMELDDPQKLQEEGDDDPFLKFVDYARSVLAFEDDEDFDPNIN 60

Query: 61  GTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLK 120
           GTET+TPGW+WIASRVLRTC+AYSSSVTPAILLSELSQAWYEQHRVGAPKK+PECINQLK
Sbjct: 61  GTETHTPGWTWIASRVLRTCMAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLK 120

Query: 121 KKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDL 180
           KKNRRKKLPKTVTIDSIY KNFLALSSVLEAVI+D FILPGTNIHMLTLGDFWSSNTIDL
Sbjct: 121 KKNRRKKLPKTVTIDSIYEKNFLALSSVLEAVILDEFILPGTNIHMLTLGDFWSSNTIDL 180

Query: 181 YLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILL 240
           YLHRRFYDLV+GILKKGRQIF+TGCYLRAASGGSG+PRLLPTEYLIILLDEEEDDDV+LL
Sbjct: 181 YLHRRFYDLVNGILKKGRQIFVTGCYLRAASGGSGYPRLLPTEYLIILLDEEEDDDVMLL 240

Query: 241 GAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLK 300
           GAQFCSD+FSSVS D+VN+GTTYSLYARIESIGP+EIHE  NG++MIQIILVDNDGFKLK
Sbjct: 241 GAQFCSDTFSSVSLDSVNEGTTYSLYARIESIGPLEIHEMMNGLRMIQIILVDNDGFKLK 300

Query: 301 FLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHE 360
           FLLWGEQV+LANLLSVGS+LALDRPY+ATVNENG+GTSDELCLEYGSATQLYLVPCIQHE
Sbjct: 301 FLLWGEQVLLANLLSVGSVLALDRPYVATVNENGVGTSDELCLEYGSATQLYLVPCIQHE 360

Query: 361 EQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMT 420
           EQVCVLTQNINQASR +S S PTQ P+VSQVSLPCDSHG IDFGNYPFRSFVIDLQDKMT
Sbjct: 361 EQVCVLTQNINQASRTVSMSYPTQSPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMT 420

Query: 421 GISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLT 480
           GISLYG +LDI NERNTTEA FSM+IEDNTGE+ AKL FV SWSLGRV VGHTV+ISGLT
Sbjct: 421 GISLYGNVLDIANERNTTEAGFSMRIEDNTGEVLAKLRFVRSWSLGRVSVGHTVFISGLT 480

Query: 481 CTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLD 540
           CT NKNR EALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSN HGTKVCQV+LD
Sbjct: 481 CTKNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLD 540

Query: 541 QVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAW 600
           QVSHCHVSTKFLHAICGHFVEETP RIECSFCRCEC+SEL+RTFDLKITLADDSAKIFAW
Sbjct: 541 QVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELMRTFDLKITLADDSAKIFAW 600

Query: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHD 660
           CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENE+FVVAIVNCRR++S  G+N+   +D
Sbjct: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENENFVVAIVNCRRRSSTYGNNLNFAND 660

Query: 661 PLSWEITRALKCD 674
           PLSWEITRALKC+
Sbjct: 661 PLSWEITRALKCE 673

BLAST of Tan0020038 vs. ExPASy TrEMBL
Match: A0A6J1FDS0 (uncharacterized protein LOC111444466 OS=Cucurbita moschata OX=3662 GN=LOC111444466 PE=4 SV=1)

HSP 1 Score: 1251.1 bits (3236), Expect = 0.0e+00
Identity = 617/672 (91.82%), Postives = 645/672 (95.98%), Query Frame = 0

Query: 2   SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
           SSRGR+F SD+AGGNSAMEL+  R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3   SSRGRHFKSDEAGGNSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62

Query: 62  TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
           TET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63  TETYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122

Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
           KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182

Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
           LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242

Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
           AQFCSDSFSSVS DAVN+GT YSLYARIESIGP+EIHEKTNG+QMIQI L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVNKGTAYSLYARIESIGPIEIHEKTNGLQMIQISLLDNDGFKLKF 302

Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
           LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 362

Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
           QVCVLTQNI+QASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNISQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422

Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
           ISLYGII+DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIIIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVKSWSLGRVGVGHTVYISGLTC 482

Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
           T+ KN  EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+V+LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRVRLDQ 542

Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
           VSHCHVSTKFLHA CGHFVEETP RIEC FCRCEC SELVRTFDLKITLADD+AKIFAWC
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPRRIECCFCRCECMSELVRTFDLKITLADDTAKIFAWC 602

Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
           TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCR QTS+ GDNVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRWQTSKSGDNVYSVNDP 662

Query: 662 LSWEITRALKCD 674
           LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674

BLAST of Tan0020038 vs. ExPASy TrEMBL
Match: A0A6J1IB36 (uncharacterized protein LOC111470879 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111470879 PE=4 SV=1)

HSP 1 Score: 1244.6 bits (3219), Expect = 0.0e+00
Identity = 612/672 (91.07%), Postives = 644/672 (95.83%), Query Frame = 0

Query: 2   SSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVNG 61
           SSR R F S +AGG SAMEL+  R+LQEEEDDDPFLKF+DYARSVLAFEDEEDFDPNV G
Sbjct: 3   SSRDRYFKSYEAGGKSAMELNDRRRLQEEEDDDPFLKFIDYARSVLAFEDEEDFDPNVKG 62

Query: 62  TETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKK 121
           T+T TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAW EQHR+GAPKK+PECINQLKK
Sbjct: 63  TKTYTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWSEQHRIGAPKKIPECINQLKK 122

Query: 122 KNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLY 181
           KNRRKKLPKTVTIDSIY KNFL+LSSVLEAVIV+ FILPGTNIHMLTLGDFWSSNTIDLY
Sbjct: 123 KNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVIVEEFILPGTNIHMLTLGDFWSSNTIDLY 182

Query: 182 LHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 241
           LH RFYDLV GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG
Sbjct: 183 LHCRFYDLVGGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLG 242

Query: 242 AQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKF 301
           AQFCSDSFSSVS DAV++GTTYSLYARIESIGP EIHEKTNG+QMIQI+L+DNDGFKLKF
Sbjct: 243 AQFCSDSFSSVSLDAVDKGTTYSLYARIESIGPKEIHEKTNGLQMIQILLLDNDGFKLKF 302

Query: 302 LLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEE 361
           LLWGEQV+LANLLSVGSLLALDRPYIATVNENGIG+SDELCLEYGSATQLYLVPCIQHEE
Sbjct: 303 LLWGEQVILANLLSVGSLLALDRPYIATVNENGIGSSDELCLEYGSATQLYLVPCIQHEE 362

Query: 362 QVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 421
           QVCVLTQNINQASR L TS PTQ PRVSQVSLPCDSHGTIDFGNYPFRSFV+DLQDKMTG
Sbjct: 363 QVCVLTQNINQASRTLGTSYPTQDPRVSQVSLPCDSHGTIDFGNYPFRSFVVDLQDKMTG 422

Query: 422 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 481
           ISLYGI++DIVNERNTTEAVFSM+IEDNTG+ISAKLHFV SWSLGRVGVGHTVYISGLTC
Sbjct: 423 ISLYGIVIDIVNERNTTEAVFSMRIEDNTGQISAKLHFVRSWSLGRVGVGHTVYISGLTC 482

Query: 482 TMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLDQ 541
           T+ KN  EALWIENHVGASFVNLSCLPALLTSSCLHK+SRLSDLTSN+HGTKVC+ +LDQ
Sbjct: 483 TIKKNNLEALWIENHVGASFVNLSCLPALLTSSCLHKISRLSDLTSNSHGTKVCRARLDQ 542

Query: 542 VSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAWC 601
           VSHCHVSTKFLHA CGHFVEETPGRIEC FCR EC+SELVRTFDLKITLADD+AKIFAWC
Sbjct: 543 VSHCHVSTKFLHANCGHFVEETPGRIECCFCRSECKSELVRTFDLKITLADDTAKIFAWC 602

Query: 602 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHDP 661
           TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTS+CG+NVYSV+DP
Sbjct: 603 TGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSKCGNNVYSVNDP 662

Query: 662 LSWEITRALKCD 674
           LSWEITRALKC+
Sbjct: 663 LSWEITRALKCE 674

BLAST of Tan0020038 vs. ExPASy TrEMBL
Match: A0A0A0L5D2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G166300 PE=4 SV=1)

HSP 1 Score: 1223.4 bits (3164), Expect = 0.0e+00
Identity = 598/673 (88.86%), Postives = 639/673 (94.95%), Query Frame = 0

Query: 1   MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVN 60
           MSS  ++FNS DA  NSAMELD  ++LQEE DDDPFLKFVDYARSVLAFED+EDFDPN+N
Sbjct: 1   MSSHSKHFNSHDAARNSAMELDDPQKLQEEGDDDPFLKFVDYARSVLAFEDDEDFDPNIN 60

Query: 61  GTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLK 120
           GTET+TPGW+WIASRVLRTC+AYSSSVTPAILLSELSQAWYEQHRVGAPKK+PECINQLK
Sbjct: 61  GTETHTPGWTWIASRVLRTCMAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLK 120

Query: 121 KKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDL 180
           KKNRRKKLPKTVTIDSIY KNFLALSSVLEAVI+D FILPGTNIHMLTLGDFWSSNTIDL
Sbjct: 121 KKNRRKKLPKTVTIDSIYEKNFLALSSVLEAVILDEFILPGTNIHMLTLGDFWSSNTIDL 180

Query: 181 YLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILL 240
           YLHRRFYDLV+GILKKGRQIF+TGCYLRAASGGSG+PRLLPTEYLIILLDEEEDDDV+LL
Sbjct: 181 YLHRRFYDLVNGILKKGRQIFVTGCYLRAASGGSGYPRLLPTEYLIILLDEEEDDDVMLL 240

Query: 241 GAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLK 300
           GAQFCSD+FSSVS D+VN+GTTYSLYARIESIGP+EIHE  NG++MIQIILVDNDGFKLK
Sbjct: 241 GAQFCSDTFSSVSLDSVNEGTTYSLYARIESIGPLEIHEMMNGLRMIQIILVDNDGFKLK 300

Query: 301 FLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHE 360
           FLLWGEQV+LANLLSVGS+LALDRPY+ATVNENG+GTSDELCLEYGSATQLYLVPCIQHE
Sbjct: 301 FLLWGEQVLLANLLSVGSVLALDRPYVATVNENGVGTSDELCLEYGSATQLYLVPCIQHE 360

Query: 361 EQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMT 420
           EQVCVLTQNINQASR +S S PTQ P+VSQVSLPCDSHG IDFGNYPFRSFVIDLQDKMT
Sbjct: 361 EQVCVLTQNINQASRTVSMSYPTQSPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMT 420

Query: 421 GISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLT 480
           GISLYG +LDI NERNTTEA FSM+IEDNTGE+ AKL FV SWSLGRV VGHTV+ISGLT
Sbjct: 421 GISLYGNVLDIANERNTTEAGFSMRIEDNTGEVLAKLRFVRSWSLGRVSVGHTVFISGLT 480

Query: 481 CTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLD 540
           CT NKNR EALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSN HGTKVCQV+LD
Sbjct: 481 CTKNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCQVRLD 540

Query: 541 QVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAW 600
           QVSHCHVSTKFLHAICGHFVEETP RIECSFCRCEC+SEL+RTFDLKITLADDSAKIFAW
Sbjct: 541 QVSHCHVSTKFLHAICGHFVEETPARIECSFCRCECKSELMRTFDLKITLADDSAKIFAW 600

Query: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHD 660
           CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENE+FVVAIVNCRR++S  G+N+   +D
Sbjct: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENENFVVAIVNCRRRSSTYGNNLNFAND 660

Query: 661 PLSWEITRALKCD 674
           PLSWEITRALKC+
Sbjct: 661 PLSWEITRALKCE 673

BLAST of Tan0020038 vs. ExPASy TrEMBL
Match: A0A5A7U7H0 (Nucleic acid-binding proteins superfamily isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold171G008000 PE=4 SV=1)

HSP 1 Score: 1223.0 bits (3163), Expect = 0.0e+00
Identity = 600/673 (89.15%), Postives = 638/673 (94.80%), Query Frame = 0

Query: 1   MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVN 60
           MSS  ++FNS DAG  SAMELD  R+LQEE DDDPFLKFVDYARSVLAFED+EDFDPNVN
Sbjct: 1   MSSLSKHFNSHDAGRYSAMELDDPRKLQEEGDDDPFLKFVDYARSVLAFEDDEDFDPNVN 60

Query: 61  GTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLK 120
           GTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKK+PECINQLK
Sbjct: 61  GTETDTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLK 120

Query: 121 KKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDL 180
           KKNRRKKLPKTVTIDSIY KNFL++SSVLEAVI+D FILPGTNIHMLTLGDFWSSNTIDL
Sbjct: 121 KKNRRKKLPKTVTIDSIYEKNFLSISSVLEAVILDEFILPGTNIHMLTLGDFWSSNTIDL 180

Query: 181 YLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILL 240
           YLHRRFYDLVDGILKKGRQIF+TGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDV+LL
Sbjct: 181 YLHRRFYDLVDGILKKGRQIFVTGCYLRAASGGSGYPRLLPTEYLVILLDEEEDDDVMLL 240

Query: 241 GAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLK 300
           GAQFCSD+FSSVS D+VN+GTTYSLYARIESIGP+EIHEK NG++MIQIILVDNDGFKLK
Sbjct: 241 GAQFCSDTFSSVSLDSVNEGTTYSLYARIESIGPLEIHEKINGLRMIQIILVDNDGFKLK 300

Query: 301 FLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHE 360
           FLLWGEQV+LANLLSVGS+LALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHE
Sbjct: 301 FLLWGEQVLLANLLSVGSVLALDRPYVATVNENGVGTSEELCLEYGSATQLYLVPCIQHE 360

Query: 361 EQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMT 420
           EQVCVLTQNINQASR +S S PTQGP+VSQVSLPCDSHG IDFGNYPFRSFVIDLQDKMT
Sbjct: 361 EQVCVLTQNINQASRTVSMSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMT 420

Query: 421 GISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLT 480
           GISLYG +LDI NERNTTEA FSM+IEDNTGEI AKL F  SWSLGRV VGHTV+ISGLT
Sbjct: 421 GISLYGNVLDIANERNTTEAGFSMRIEDNTGEILAKLRFERSWSLGRVSVGHTVFISGLT 480

Query: 481 CTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLD 540
           CT NKNR EALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSN HGTKVC+V+LD
Sbjct: 481 CTKNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCRVRLD 540

Query: 541 QVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAW 600
           QVSHCHVSTKFLHAICGHFVEETP RIECSFC CEC+SELVRTFDLKITLADDSAKIFAW
Sbjct: 541 QVSHCHVSTKFLHAICGHFVEETPARIECSFCCCECKSELVRTFDLKITLADDSAKIFAW 600

Query: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHD 660
           C GQTAAELLQISPDEFCELPEEEQVMYPSSLENE+FVVAIVNCRRQ+ + G+NV   +D
Sbjct: 601 CMGQTAAELLQISPDEFCELPEEEQVMYPSSLENENFVVAIVNCRRQSRKYGNNVNFAND 660

Query: 661 PLSWEITRALKCD 674
           PLSWEITRALKC+
Sbjct: 661 PLSWEITRALKCE 673

BLAST of Tan0020038 vs. ExPASy TrEMBL
Match: A0A1S3AX73 (uncharacterized protein LOC103483891 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103483891 PE=4 SV=1)

HSP 1 Score: 1221.5 bits (3159), Expect = 0.0e+00
Identity = 600/673 (89.15%), Postives = 637/673 (94.65%), Query Frame = 0

Query: 1   MSSRGRNFNSDDAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEEDFDPNVN 60
           MSS  ++FNS DAG  SAMELD  R+LQEE DDDPFLKFVDYARSVLAFED+EDFDPNVN
Sbjct: 1   MSSLSKHFNSHDAGRYSAMELDDPRKLQEEGDDDPFLKFVDYARSVLAFEDDEDFDPNVN 60

Query: 61  GTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLK 120
           GTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKK+PECINQLK
Sbjct: 61  GTETDTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKIPECINQLK 120

Query: 121 KKNRRKKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDL 180
           KKNRRKKLPKTVTIDSIY KNFL+LSSVLEAVI+D FILPGTNIHMLTLGDFWSSNTIDL
Sbjct: 121 KKNRRKKLPKTVTIDSIYEKNFLSLSSVLEAVILDEFILPGTNIHMLTLGDFWSSNTIDL 180

Query: 181 YLHRRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILL 240
           YLHRRFYDLVDGILKKGRQIF+TGCYLRAASGGSG+PRLLPTEYL+ILLDEEEDDDV+LL
Sbjct: 181 YLHRRFYDLVDGILKKGRQIFVTGCYLRAASGGSGYPRLLPTEYLVILLDEEEDDDVMLL 240

Query: 241 GAQFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLK 300
           GAQFCSD+FSSVS D+VN+GTTYSLYARIESIGP+EIHEK NG++MIQIILVDNDGFKLK
Sbjct: 241 GAQFCSDTFSSVSLDSVNEGTTYSLYARIESIGPLEIHEKINGLRMIQIILVDNDGFKLK 300

Query: 301 FLLWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHE 360
           FLLWGEQV+LA LLSVGS+LALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHE
Sbjct: 301 FLLWGEQVLLAKLLSVGSVLALDRPYVATVNENGVGTSEELCLEYGSATQLYLVPCIQHE 360

Query: 361 EQVCVLTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMT 420
           EQVCVLTQNINQASR +S S PTQGP+VSQVSLPCDSHG IDFGNYPFRSFVIDLQDKMT
Sbjct: 361 EQVCVLTQNINQASRTVSMSYPTQGPQVSQVSLPCDSHGAIDFGNYPFRSFVIDLQDKMT 420

Query: 421 GISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLT 480
           GISLYG +LDI NERNTTEA FSM+IEDNTGEI AKL F  SWSLGRV VGHTV+ISGLT
Sbjct: 421 GISLYGNVLDIANERNTTEAGFSMRIEDNTGEILAKLRFERSWSLGRVSVGHTVFISGLT 480

Query: 481 CTMNKNRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAHGTKVCQVQLD 540
           CT NKNR EALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSN HGTKVC+V+LD
Sbjct: 481 CTKNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNTHGTKVCRVRLD 540

Query: 541 QVSHCHVSTKFLHAICGHFVEETPGRIECSFCRCECESELVRTFDLKITLADDSAKIFAW 600
           QVSHCHVSTKFLHAICGHFVEETP RIECSFC CEC+SELVRTFDLKITLADDSAKIFAW
Sbjct: 541 QVSHCHVSTKFLHAICGHFVEETPARIECSFCCCECKSELVRTFDLKITLADDSAKIFAW 600

Query: 601 CTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRCGDNVYSVHD 660
           C GQTAAELLQISPDEFCELPEEEQVMYPSSLENE+FVVAIVNCRRQ+ + G+NV   +D
Sbjct: 601 CMGQTAAELLQISPDEFCELPEEEQVMYPSSLENENFVVAIVNCRRQSRKYGNNVNFAND 660

Query: 661 PLSWEITRALKCD 674
           PLSWEITRALKC+
Sbjct: 661 PLSWEITRALKCE 673

BLAST of Tan0020038 vs. TAIR 10
Match: AT3G17030.1 (Nucleic acid-binding proteins superfamily )

HSP 1 Score: 750.4 bits (1936), Expect = 1.3e-216
Identity = 384/680 (56.47%), Postives = 497/680 (73.09%), Query Frame = 0

Query: 12  DAGGNSAMELDGGRQLQEEEDDDPFLKFVDYARSVLAFEDEED------FDPNVNGTETN 71
           D  G S +E+       +EE +DPFL F+DYAR+V++ ED+ED        P    TE +
Sbjct: 3   DTNGASLIEIG-----DQEEVEDPFLAFLDYARTVISPEDDEDEKEESKRGPGEAMTEAS 62

Query: 72  TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQAWYEQHRVGAPKKVPECINQLKKKNRR 131
            PGW W+ASR+L+TC AYSS VT AILLS+LSQAW+EQ++ G  KK PE I+QLKK +RR
Sbjct: 63  GPGWGWVASRILKTCTAYSSGVTAAILLSDLSQAWHEQNKPGMSKKKPELIDQLKKGHRR 122

Query: 132 KKLPKTVTIDSIYAKNFLALSSVLEAVIVDAFILPGTNIHMLTLGDFWSSNTIDLYLHRR 191
           ++L  TVTIDSIY KNFL+++SVLEAVI++A +LPGTNI MLTLGDFWSSNTIDLYLHRR
Sbjct: 123 RRLANTVTIDSIYEKNFLSMNSVLEAVIINADVLPGTNIFMLTLGDFWSSNTIDLYLHRR 182

Query: 192 FYDLVD---GILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLGA 251
           +Y+LV+   GIL+KGR++ +TGCYLR A  G G PRLLPTEYL++LLDE++DDD IL+ A
Sbjct: 183 YYELVETPNGILRKGREVLITGCYLRTAREGFGTPRLLPTEYLVVLLDEDQDDDAILIAA 242

Query: 252 QFCSDSFSSVSFDAVNQGTTYSLYARIESIGPMEIHEKTNGVQMIQIILVDNDGFKLKFL 311
           QFCSD+FSSVS DA N G +YSLYARIESIGP+E     +  +  QI LVD DG +LKF+
Sbjct: 243 QFCSDTFSSVSLDAFNDGASYSLYARIESIGPLESELTFSTARRRQISLVDGDGDRLKFI 302

Query: 312 LWGEQVMLANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQ 371
           LWGEQV++ANLLSVGS+L ++RPYI+++ E+ +  + E CLEYGSAT LYLVP    EE+
Sbjct: 303 LWGEQVIVANLLSVGSVLGIERPYISSLEESAMEGNYEFCLEYGSATHLYLVPSTLQEER 362

Query: 372 VCV-LTQNINQASRMLSTSNPTQGPRVSQVSLPCDSHGTIDFGNYPFRSFVIDLQDKMTG 431
           VCV L+Q+  Q S++L +        VSQV+LP D+ G++DF NYPFR+ + D++DK TG
Sbjct: 363 VCVALSQHQCQGSKLLGSVG------VSQVTLPRDADGSVDFSNYPFRTMITDIRDKTTG 422

Query: 432 ISLYGIILDIVNERNTTEAVFSMKIEDNTGEISAKLHFVGSWSLGRVGVGHTVYISGLTC 491
           ISLYG++ DI  + N T  VFS+KIED TG I AKLHF   WSLGR+G+GH VY+SGL+C
Sbjct: 423 ISLYGVVTDISCDPNATGVVFSLKIEDTTGAIWAKLHFTNYWSLGRLGLGHVVYVSGLSC 482

Query: 492 TMNK-NRFEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTSNAH-GTKVCQVQL 551
            + K N  E LW E    A+FVNLSCLPA LTSSC+H +S LS ++        +C+V+L
Sbjct: 483 KITKENCIEMLWHEKDEKATFVNLSCLPAFLTSSCIHLISTLSQISKQRKPAINICRVKL 542

Query: 552 DQVSHCH-VSTKFLHAICGHFVEETP-----GRIECSFCRCECES--ELVRTFDLKITLA 611
           D++  CH ++T+  H++CGHF++E         + CSFCR  C S  E+VRTF + ITLA
Sbjct: 543 DEIDQCHNINTRLAHSLCGHFIDEESSSSYGANLHCSFCRVSCNSNTEVVRTFHITITLA 602

Query: 612 DDSAKIFAWCTGQTAAELLQISPDEFCELPEEEQVMYPSSLENESFVVAIVNCRRQTSRC 671
           D+  K++AWCTGQ+A+ +LQISPDEFC+LPE++Q+MYPSSLENE F+V + N   +    
Sbjct: 603 DEETKLYAWCTGQSASAILQISPDEFCDLPEDDQLMYPSSLENEWFLVILANSGSRNLGS 662

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023538883.10.0e+0092.26uncharacterized protein LOC111799677 [Cucurbita pepo subsp. pepo][more]
KAG7029015.10.0e+0091.96hypothetical protein SDJN02_10198 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022938337.10.0e+0091.82uncharacterized protein LOC111444466 [Cucurbita moschata][more]
XP_022972298.10.0e+0091.07uncharacterized protein LOC111470879 isoform X1 [Cucurbita maxima] >XP_022972299... [more]
XP_004134503.10.0e+0088.86uncharacterized protein LOC101215087 [Cucumis sativus] >KGN57165.1 hypothetical ... [more]
Match NameE-valueIdentityDescription
A0A6J1FDS00.0e+0091.82uncharacterized protein LOC111444466 OS=Cucurbita moschata OX=3662 GN=LOC1114444... [more]
A0A6J1IB360.0e+0091.07uncharacterized protein LOC111470879 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0L5D20.0e+0088.86Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G166300 PE=4 SV=1[more]
A0A5A7U7H00.0e+0089.15Nucleic acid-binding proteins superfamily isoform 1 OS=Cucumis melo var. makuwa ... [more]
A0A1S3AX730.0e+0089.15uncharacterized protein LOC103483891 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT3G17030.11.3e-21656.47Nucleic acid-binding proteins superfamily [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D2.40.50.140coord: 245..377
e-value: 3.1E-5
score: 25.7
NoneNo IPR availableGENE3D2.40.50.140coord: 534..667
e-value: 6.1E-6
score: 27.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..33
NoneNo IPR availablePANTHERPTHR36033NUCLEIC ACID-BINDING PROTEINS SUPERFAMILYcoord: 35..655
IPR035201Cell division control protein 24, OB domain 1PFAMPF17246CDC24_OB1coord: 36..163
e-value: 2.8E-45
score: 153.4
IPR035200Cell division control protein 24, OB domain 2PFAMPF17245CDC24_OB2coord: 168..295
e-value: 4.2E-38
score: 130.9
IPR035203Cell division control protein 24, OB domain 3PFAMPF17244CDC24_OB3coord: 414..632
e-value: 8.8E-72
score: 241.4
IPR012340Nucleic acid-binding, OB-foldSUPERFAMILY50249Nucleic acid-binding proteinscoord: 555..654
IPR012340Nucleic acid-binding, OB-foldSUPERFAMILY50249Nucleic acid-binding proteinscoord: 252..355

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020038.1Tan0020038.1mRNA