Tan0007884 (gene) Snake gourd v1

Overview
NameTan0007884
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG10: 975486 .. 985602 (+)
RNA-Seq ExpressionTan0007884
SyntenyTan0007884
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTAAATTCTTCCCACGTTCTGACTGTACAGTTTTTGCTCAATTCCGTTCACTTTCTTCCTCTTTTGATTCCTTTTCCGTTGAATTAACCCGATCTGACTTCTTCATCCAGTCGTCGTCCGGCTATCTCCAGCCTTCCTTGCCCTTCTATTCTCTCTCCGATCGACGAGTTCAAGCAGAAGAACTGTTCGAAATCAATTCACCCCCAAACCCAGGTTGGTCCTTTTCAAATCTCAAAACCCAGATGGGTTTTCCCTTCACTCGTTCTGAAAGTGAGAACCCGGAAGAGGGTTTTCTTCTTGGCTCTTGAAGAAAGTATTAACTTGTCGTAACAGTTAAGTCAATATGGCACTATCCTGATTCTCAAGAAATTCGATTAAGTTTTGATTTTCTAGTTCTTTTTGAAGATTTCTTGCCCGAAACGGAATATGCAATAAACCTCAGTGGCGATAAATCCTTTTTGAAGTTTTCCCTTTGACCTTACATGGACTGTTATTTGTTTGTTGAAATTTCTTTTGATCGGAGTGTAATGGTGATTGTCCTTTCTCACGTCCCACTTCTCATATTGGTGAAATACTTTTATTTTCTTTTGGTGAAATACCTTTCACTAATTTGACCCTTTCATTTTTGGCTATATGATCTATGTGTTTATTGGTCTTTCAATCTATGCGTTTATTTCCCTTTTCTCTTCGAGAAGAAAGTAATCACTGAACCTCACTATCGATCTTTCTTCACATTTGGAAAGCTTAAACTATTTTTGTGTTCTTTACGTACGTGTGATAAGCTCTGAATAAGTATTGGTGTACGTGTATTGCATTCTGATACTGTTCATTTGTTTAGATGTTATTGCCTGATTGCACAAAAGAGGGCCTTAATTTGGGCCTATCTTAGTATGGGACCTGAAATTGGATCAAAAGTGAAGGAGGAAGCTTTGATGGAGGACAAAAATGCTGTTCAGGATAAGCAGAGTGCAAGTAGTGGTCAGGAAAAAATTCATGACATGGAAGCTCCATCTGTTGAACGAACTATGATGTTAGATAGAAGTGAGGATATGGAGCTTGATATTATTGGGTGTACAGATAATTGTGAGGGAGGTCCTAGTAGTGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGTGATACTGTTTCTGGGACAGATTATGGTTTGTTATTGGATGATGAAGAAGTTGAATCCCAATTATATGGAGATAATAATTTGCAGCCTATGTCTAATGGATACAGAGAAGTATTTCCAAGGTAGTACATGTATCTTTTCTTTTTCTTTTCCCTTGTCCTGCATTGATTTGTTTCTTTTTTTCGACTTTTTCTCTCTACTTCATTGTGTGCTATTGGTGGATGAACAATTTGAAAGTTGATTTTTATATACCATCAGTGGTGTATAGAGATTGGACTATTTATCCTTTTTTCTTGAGCTTCCAAATTCAAGTTTTCCTTTCAGATTAATATTGATTAACATGTGAAAGATTCTCATCTTTGTTGGTCTTTTGCTTCAAGATTAAAAATTATTGTACAGCCTTCTCACCTCTTCTACTTTTGAGTTCGGATGTGTCAGTAGACTCGTATATCTGAAATACTTTGCTAGTGAATTTCAATCAGGAGTGAAGAAAATAACCTTGTCAAAAGAGAATTGAAGTCCTTGTCTCTGGATGTACTATGAATTACGATCACTGATTCACTGTCCAACCAAACTCCCAAAGGATAGCAACAGTTGCTTGAATCAAATTCTGCTGCGTGAAATTTTTAGCTCAATTATTGTACTTTCCCTGCTAATTGAGTATGACAGGTCTCTATGTTGTGTACTTATTCTAGGAGGGCTTGAAGGGAATTTATGCGTAGTATGGTCATTTAGTTTTGTCTGTTATAGGGGGTAGGAATTAGGATATTACTCCATAGCTTCAAAAATTATGCAAACCATCTTATAATGATCAACCGGCATCTTTAAGGGCGCGAAGTGTTCTGTGGAGGAGGTTTGGTCCTTAGCTAGGTTTAACACCTTGCTTTGGGCAACTGCATCTAAAGAATTTTTAATTACTTCCTTTTTATTTTTCTAAAAAAATTATTCGTTAAGTCCTATCTCTTGGAGTTGAGCCCTTTTCTTTTTTATTAGGCTAGTCCCTAGTTTTTTTAATGGCCATTGTTGGCCCTCTTGTATTCTTTCGTTTTTTCTCAATGAAAGTTTGGTTTTTAATCCAAATATGATATAATGAACAAGTCACTTATGTCAAAATGAAACTATAATTAACACAAAGAAGATTACAAATATCATATATGAATTTCCTAGCCCTAAAAAAGTATCATATACATGTATATGATAATGTTGTATTGGTTGGGTTGCCATCCTTTCCCTTGTGATGGAAGAGGTGGTTTGGATCCTCCTTTTAGGGAGAAGGGTGAAGTATTGTGGCAAGTGAGTTTCTTTGCAGTTTTGTCGGGTATTTGGATTGAGGGGAACAATAGATTTTTTCTAGAGGTTGAGAGGCCGGGTCTGGAGGTTTGGGAGGTGGCTAGGTTTAATATCTCATTGTGGGCTTCGGTCTCTAGTCTTTTTTGTAATTATGATCTTGGTATTGTTATTTTAGATTGGTCGGACTCCCTTTATTTATGTAAGGGCTTGTTTTTTGTATGATATTGTATTCTTTCATTTTTCTCAATGAAAGTACGATTTCTTACCCCCCAAAAAACGCAAGCCATGTCAAGTCAAGCTAGGTTGAGTTGAATTTTGTCTTTTAAAATTTTGTAACTTCTTTAAATATGTATTGTTGGACTAGGACAGATACGACTCAACAATTCAACTTGGTTGAGTTTTGAACCATGGGTTAGGGTGCACTTGCTTACTCCTTAAAATATTAATGATTTCATCTGTCTTCACCTTGCATGGCTGACATAACGAGAACAAGGCAAAAACTTTTGTGATTCATAGTATCAAGTCTACGTTTTCCACAAATATGTTATCGTTTCTCTCAACTTTATATGATAATGATATTCACGTGTTTATATATACATCTTACTTTCTAGGAAGAAAAAATTGACAGATCACTGGAGGAAGTTTATAAGTCCTGTTATGTGGCGGTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAATCATTAAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGTCTGTCTACGAACACTTCTCAATGGAAGATTTTGATGTGAAGTCAACAGGATTCTCAATTCACACTCAAAGACACAGGGTTATGAAAAGAAAGAGAAGGAAGAAAACTGAAGAGACAACTGAAGCAGCTTCATATATGGGACACCATAATGTGTTCTCCTACTATGGTATCAAGTCCTCAACTATTTCTATGCAACACTTTTTTCCCTAGTATCTCCTTCTTTGACATTGTAGCCTTCTTTTGTTATTCCAGAGAAGAAGAGGCCCATTGCTGATGACATAACTATGGAAGATACTTCTCTTAAATTAGGTAAGTGAAGTTGATCGTATTATTACTTTGGTTGATTTCATCAAATTGTACACGGCTGAATAGGAGTTTGCTGCAGATTAGTAATAACCGGAGGAGTGAAAGCTATTTTAAGTCTTTTATCGTCAAATAAATTAGTTCTATACTTGATATTTGATTATGAAAAGTTTACACTATCGTTTTTCTTAAGGTATGTGATATTGTTATCCCTTTTTTCCCCGTATGTGGGGTTGACATAGTTGATCCTTATGAACTTACAACAATCCCTATGGAGATGCTTCCAAGAGAATATACCAAAAACTTGTCCTATTTCTCTCATTTTCTCTTAGATGATAGTTTTAACAATATTGAGCCATAAAATCGGCTTTTCTTTTTCTGGGATGGCCTGAAAGAGTCTCAATACCGAAATTCTTTGAAGTTCTAGGGTGTTAGGAAGGACAAGGTGTGTTTTCGGGGTGGACTAGGGTTAGGAATATTAGTAGGGAAATGTTAAGAATATTGATAGGATATTAGGAAGGCATTTTTCTATCAATAGGAAATGGTTTATGAAAGGTATTGAGAGTTTGGAGGTAGAGAGAATTTGGAGTGAAGTTATTTTAGGAGGCTCCATGATTCTCGAAGTGCTTGGGTAATGTATTTTCTTTTATCATTTCCACTTTCTACCAGTTTCCATTGCAAAATTGGTTCCTATCATAGGGAGAACCATTTCCAAGCCAAAGATATTACAAATAAGTTTCATAGATCAAGAGAATGAGAACATCGTAAGAAAAATGATTTGATGCCTTTCAATGGGGACCTGCAAAGAGAGAACCAGCCCAGCAAGATGCTGAAATTACAACATTTTTATCAGACTTCATCATAAGACTGACTTAATAATATTTATAGACCATGCTGTAAATAGGAAAGAGACTCACATGCGAAAGAACCTGACTTGTCACTGTGCCAAAGTCTATTTTGTGGAGATCTTTAATATGACAGTGATGTATAGTTTCTGGAGTGTATGAAATCATCCAATGGATTTCGGATTTTCCAATTTTATCGGTAGATGATATACATCTATTACACTTGGGTTGGAAGAAACATCTTGCGCCACATACATTTCACTCAAGAAATCTACTCTTTCCAAATAGATTGTTAGCTAGAAATTAGAATCCATGTGGATAATGAGGAAGTTTTAGCGTAGGCAACTAGGCAAAAGTGCATTGCTGGTGGTTGGCCATTAACCTACCGAGGTAGCCCTTGGGAGGGAATTTGGTGAAAAATGGGAATCCCATGAACCATGGTAGAAAGGAACCTGAACAAGCTAAATAGATAGAAAAATATGACTCATAAAGGAGGCTATATCACCTTGTCACAATCAAGACTCACCAGCATCCTATTAATAATTTCTTGTCCCTTCTAAATAAAGATGTCCACGGGGCGGGGCGGGGTTGGGGAATGGCTCCCCGTCCCCGTCCCCGCCTCCCATTTTATTCCCCGTCCCCGGGCGGGGATTCGCCGGGATCGGATTCCCCGCGGGGAATAATTTCCCGTTTAGAATTTTATTTTTCTAAATTTTATTTAATGTTTCTTTTTTTTTTTTTTTTTTTGAAAATTTGTATTTCAATGGTTAATTATTCTTTAGAGAATTCTAGTTAAGAGACATTTTTTCAATTTTCTTAAAAAATAAAATGATATTTAATAAACTTTTAGACTTATGGTTTTATTAGATGTATACAAATATATATATTTAAAATTAAATAAATTAATCAGGGCGCGGACGGGGAATGTATTCCCTGTCCCTGTCTCCGCTTAGCCAACGGGGAAAAATTTTATCCCCGCTCCCTCCCCCATTCCCCGCCCCCGCGGGGAAAATGGACATCTCTACTTCTAAACGTATGTGGCACAAGATGTTTCGTTTTTTGATTTTGTTGGTTGTCCCTCCCCCCCGCCCCTTACGACCGTTGTTTTCCCCTCGCTTTGGAAGGTTAAAATTCCTAGGAAGGTAAATTTTTTGTGTGGCCAGTGCGTGCTCCTTTCTCAAACCTGAACAACTCTGAACGTTATGGAAGATAAGGGAAGTCACAAATTTTACATGAGGGAGTCAACACTTTAGATTGTGTTCAACGACATTTCTCTTTGGTATTGTTCCCGTAGTGTGTGTTCTTTGTAGGAGTCAGGAAGAGGATCTCTATCACTTATTATGGGGTTTTCAGTTTGCTCAATCTCTGTGGGCGTGGTGGCTGCAGTTGTTTTGGTGTTGTTTGACTTGGAATAGAGATTGTTATGTTATGGTGGAGGAGGTGCTTCTAAGCTCCTCCTTTTGTTAAAGGAGCAAAGTCTTATGTCATGCTAGCTTATTTGCCATTCTGTGAGGTATTTGGCTTGAGCATAATAATAGGACTTTAGGGAAGTGGAGAAGTTGGGTAAGGTTTGGGAGGTGGCGAGGTTGAATGCTTCGTTGTGGGTTTTGGTTATTAAGCCTTTTTGTAATGATATTGGTTTGATTTTGTTGGACTGGAGTCCAGTTCAGTAGTTAGTTCTTGGACTCTCTTTTTGTTGGGCTTGTTTTTTGTATGCTCTTGTTTTCTTTCATTTTCTCAATGAAAACATCGGTTTCTTACCAAAAAATCATTTCTTGTCCCTTTCCAAGATCTTTGAGTACGGTAGTGAATATTATTGAAAATGGAAAATGTGCAAGTGGCACACTGCTTCTCTCTTCGTTGATGTAGGCAGCTTGGGCTTGAGAAACCTAAGCAAGAATATTGTTGCTTTCTTAATGAATGTGTTGATTTTAATACAAACTGGATTCCTTGTTTGGCTTTCTTTGTAGCTAGCAAATATTGAAGCGGAAGGATTGGCTAGTTTTCAGCTATCAATAATGGGATTAGTGAAAGCAGGTGAAGTTTCTAAAGGTTTATCTAGGAATCCAGCAAGCCTATGAGTGGCTATGGAGATTGGGCAATGGGTGATGTACTGTCTTTTGGGAGACTTGTGGATTTACTAATCAACCTCTGAAGTATGGTTCTCAGATTCTTCCAACTGTGGCTAGTTTGGAAACCTTTTTTTATGAAAGATTCTTGGAATCCTACAGAAGGGGGATGAAGTCTTTCATTCAGAAGAACTTTTTTTTTTTTTTTTTTTGATAAGAAACTGAGCTTTCATTGAGAAAGAATGAAAAAATACAAGGGCGTACAAAAAACAAGCCCACAAAAGAGAGGGGGAGCTTACAACAAAACACTCCACCCAAGCAAAATAAAATCTAGAGGATAATTACAAAAAGACTTAGACACCGAGGCCCAGAGAGAAACATAGAATCTAATAAGGTCCAAATATCAAAGGAGGAATGCTCCAACCCTCGAAAGATTCTATTGTTTCTCTCCCCTAAGTCCCCAAAGGACAGCACAAACCCCTGCCTTCCATAAAAACGCCCCTTTCCCCCTAACTGAGGGTGCAAAAGGAACTCCTCCAACCTCTCCCGACAAGTCCCTTCTCCTCTAGAAAAGGAAAGCCCAAACTCACCGACGAAAAGATTCCAGACATCCTGAGCCAACTCACAACTCCAAAATAAATGGTCGATATTCGCTGCCGCCCTCTTGAAAAGAATACAACACCAGGGATCAACCAAGGACGGCAATCTTCTCGAGACCCTATCCAGAGTATTGACTCGTCCGTGAAGAACCTGCCAAGCAAAAAACTTAACATTTTTTGGGATTTTCACCTTCCAGACTAAGGCAAAGACAGACTCCCCTCGTGAAGCAGAGGAAGAAGGATTACCAAAGAGAATTGGAGGAGTACAAATAGGCAATCTTGGTGAGTATTCTTGAGCGGTTCCTGACTTAACCAACATAGACGCGCCATTTTATGACAGATAGACAAGTCCCAGAATTAATCTCGTTAATCATTTGCTGCCCATCTTACTCAACACCTTACCTTTTAAGGGCTATTTGGAATAGAATCCACAGAAAAGTCAAGGTTATTCTTTGGTCCCTAGTTTTGAGGGGGATAAACACTACGGGCATGGTGCAAATAAAACATCCTTACATGTGATCTCTCCTAACAGGTATTATCTTTGTAACAGAATTTGGACGACCTAGACCTTGTCTTCCTCTACTACACTATGATTACAAATGCTAGTGTTAGAATATGTTGAATGTGTATAGTCTCTCCTGATGCTTACCAAAGAATGTTAATGATTTTTTTGGAAACATGGACTGTCATTTTGTGGGCATCCCTTGAGACGAAAGCCTAAAACCATCTCAAAATGTGAGTTGTGATGACATTTCTTTGAGAAGCTGGTTGGAAAGAGACCAAAGAGATGTTCAAAAGTTCCAATCTTCGCTAGGTCTTTCCTAGAAGCAGTTCAGATGCATTTTTCCCACTCATCTCCGGGACTTTCTTTCGTGGCAATGCTAGAATCCTTTGGTACAATGCGGTGGTGGCTCTCTTTTGGAGAGACTGCTTTGACAGAAACAAAGGATTTTTCAAGGGGAACACTGGCCTCGTTACATTTTTTGAGATTTAGTTAAACTTCACTCTTCTACTTGGCTTTATAGGGATATCTCATCTCCTTTTTGTACATGGTCAATAATATCGTTTTTTCCCTAGAAAAATACAAGGGAAAAAAAATCTAGTGTTTGTAATGTCCTTAAGGATCCTATTTGACTTTTTTATGCTTCACAGCTTACGCTTTTGAACTTCTCTGTAACCATGCCTTCTTTATGCTAATTAGAGTAATATTCATGGTCATGTCTTGTACTTTTAAAACTTTTTTCCGAGTGTCTATCAATATAATTGTAATTAATAAACAAATAATTCTGTTTCTTACAGTTAAAAGAATAGCCCAATTTATAGGATTTTAAGTTTGAAAAGACTATTTATGTGTGGAGAGCTCTTGGCTACACTAGTCAATAGGCTAGCTATTGATTCTCTGCTCTGATCTCTACAGACAAGACAAGGAATATGAAACATGATGCCATCAATGACTTCGGGCCAATTGCAACTGATGGATGGCCATCTTCTATGTTGGGAGATAATGATAATAATTTGGAAGAAATCTTTCTAAAAATTGAAGCTGCGCAGTCAAAAGTTCACGAGTTGAAGAACAGAATTGACAAGGTGGTGAATGAAAATCCCATGAAGTTCTCCTCAATCAGTCAGCTATACTTGCTTGCATCAAGTGATGATCCCGCTTCACCTGAAGACGGAAATGATGTGTTTGTTAGGTCTTTGCATGAAGCATCACAACACATGTCTGAGCATGCATTAGATGTACTTATGCCCGAAAGTGCGACTAGAAGTCATGGAGAGGTCATGCTACTTCCTGATATGATTCAGAGCGTGGATCGTGGAAGTGTAAGTTTCCCCTACTACTTCTATATGCAATATATTTCTTTTCCTTTCAACTATAGAAATGTCACCAAGCATGGATTATTATCTTCTACACTGTTCATTTGTGAAAGTGTTGGTATGCATCCCTCTACTATATGCCATTCTGCTCTCATTTTCAATTTTCAATTAAAGAAATGCTTTAATTATTGGTCACGCAAGTCTCATATATCCTGTCATTGCAATCCTAACACTGTATGATGTATGCTGAGGTTTTCCTATGTTTATTCTGTTTAACAACGTGATCCTCTATCAGAATAGTAGCATTAGCAGTGTTATTTTTTGCTTTTTTTTAACCAAAAACAAAACTTTTCATTGATAGAATGAAAAGAGACTAATGCTCAAAAGTTACCATCTCTACAAGGGAGTGAAAAGAACAAAAGAAAAAAAATCAGCCCACAAAACTCATAAAATCTGAAAAGAATAAAAACCATAAAACACACAAAACTAGGATGGACTCCCAATTCATACTAATATCTTGCAAGTACACTATTGAGAAGTTTTCCATCTAGCCGCCTCGAACCTATCAACCATTGATTTTCCTTGTTTTAAAAAACCCTTTGATTTCGTTTCAACCAAAATTAAAGAAATAATGGCTTTGACTACATTAACCCGTAAGAAATGTGGTCTTCGTACACATCGGGGGACTAATACTGAACTGATTTCTTGTATTGTTATTGTGTTGCAACTGTTATGATTATACGAAAAACATGGTAGTTCTAAACAATTGGATCTTACTTGTATATTTGCAGACTGAGAAAGTTCTGATGCAAGATTCCGCAGTCAAGGAAGAGGTGCAAATTCCTGAAGAGGTTAATACTCAGTTTATTGAGCAGACTCCGAAATTGGAGGAGCAAATCATTTCTCCAGCTGCAGCCTCTCAAGCTGACTTAGCCTCAGAAGACAAGGAGCCTGACATGCAACACAAAACAAAACCCCCTTCTGCTGTCAAACCTAGTTCATCTAAGAGAACAAGAAAGCGGGGAAGGCGAAAAATCAGTTCGAGTAAACAGAAACGGAAAGCAACAGGTTAGCTGGATAAATGGGACTACAGAAGATGGTTATTTTTCTTCTTATGATGTGGTTGATGGAGGTTAGAATTTTGTTAATGGTTATTGCTCTACTTCTACTGTATATTTTCCATGTCATGGTCCTCTCTCGTCTGCTGCACATATCAACTCAATTATTTGTATATGCTGCAAATATTCATTAATTATGTTAATATGAAGAATAGCATTAATCCACAGTGGTTGGCCAC

mRNA sequence

CGTAAATTCTTCCCACGTTCTGACTGTACAGTTTTTGCTCAATTCCGTTCACTTTCTTCCTCTTTTGATTCCTTTTCCGTTGAATTAACCCGATCTGACTTCTTCATCCAGTCGTCGTCCGGCTATCTCCAGCCTTCCTTGCCCTTCTATTCTCTCTCCGATCGACGAGTTCAAGCAGAAGAACTGTTCGAAATCAATTCACCCCCAAACCCAGATGTTATTGCCTGATTGCACAAAAGAGGGCCTTAATTTGGGCCTATCTTAGTATGGGACCTGAAATTGGATCAAAAGTGAAGGAGGAAGCTTTGATGGAGGACAAAAATGCTGTTCAGGATAAGCAGAGTGCAAGTAGTGGTCAGGAAAAAATTCATGACATGGAAGCTCCATCTGTTGAACGAACTATGATGTTAGATAGAAGTGAGGATATGGAGCTTGATATTATTGGGTGTACAGATAATTGTGAGGGAGGTCCTAGTAGTGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGTGATACTGTTTCTGGGACAGATTATGGTTTGTTATTGGATGATGAAGAAGTTGAATCCCAATTATATGGAGATAATAATTTGCAGCCTATGTCTAATGGATACAGAGAAGTATTTCCAAGGAAGAAAAAATTGACAGATCACTGGAGGAAGTTTATAAGTCCTGTTATGTGGCGGTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAATCATTAAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGTCTGTCTACGAACACTTCTCAATGGAAGATTTTGATGTGAAGTCAACAGGATTCTCAATTCACACTCAAAGACACAGGGTTATGAAAAGAAAGAGAAGGAAGAAAACTGAAGAGACAACTGAAGCAGCTTCATATATGGGACACCATAATGTGTTCTCCTACTATGAGAAGAAGAGGCCCATTGCTGATGACATAACTATGGAAGATACTTCTCTTAAATTAGACAAGACAAGGAATATGAAACATGATGCCATCAATGACTTCGGGCCAATTGCAACTGATGGATGGCCATCTTCTATGTTGGGAGATAATGATAATAATTTGGAAGAAATCTTTCTAAAAATTGAAGCTGCGCAGTCAAAAGTTCACGAGTTGAAGAACAGAATTGACAAGGTGGTGAATGAAAATCCCATGAAGTTCTCCTCAATCAGTCAGCTATACTTGCTTGCATCAAGTGATGATCCCGCTTCACCTGAAGACGGAAATGATGTGTTTGTTAGGTCTTTGCATGAAGCATCACAACACATGTCTGAGCATGCATTAGATGTACTTATGCCCGAAAGTGCGACTAGAAGTCATGGAGAGGTCATGCTACTTCCTGATATGATTCAGAGCGTGGATCGTGGAAGTACTGAGAAAGTTCTGATGCAAGATTCCGCAGTCAAGGAAGAGGTGCAAATTCCTGAAGAGGTTAATACTCAGTTTATTGAGCAGACTCCGAAATTGGAGGAGCAAATCATTTCTCCAGCTGCAGCCTCTCAAGCTGACTTAGCCTCAGAAGACAAGGAGCCTGACATGCAACACAAAACAAAACCCCCTTCTGCTGTCAAACCTAGTTCATCTAAGAGAACAAGAAAGCGGGGAAGGCGAAAAATCAGTTCGAGTAAACAGAAACGGAAAGCAACAGGTTAGCTGGATAAATGGGACTACAGAAGATGGTTATTTTTCTTCTTATGATGTGGTTGATGGAGGTTAGAATTTTGTTAATGGTTATTGCTCTACTTCTACTGTATATTTTCCATGTCATGGTCCTCTCTCGTCTGCTGCACATATCAACTCAATTATTTGTATATGCTGCAAATATTCATTAATTATGTTAATATGAAGAATAGCATTAATCCACAGTGGTTGGCCAC

Coding sequence (CDS)

ATGGGACCTGAAATTGGATCAAAAGTGAAGGAGGAAGCTTTGATGGAGGACAAAAATGCTGTTCAGGATAAGCAGAGTGCAAGTAGTGGTCAGGAAAAAATTCATGACATGGAAGCTCCATCTGTTGAACGAACTATGATGTTAGATAGAAGTGAGGATATGGAGCTTGATATTATTGGGTGTACAGATAATTGTGAGGGAGGTCCTAGTAGTGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGTGATACTGTTTCTGGGACAGATTATGGTTTGTTATTGGATGATGAAGAAGTTGAATCCCAATTATATGGAGATAATAATTTGCAGCCTATGTCTAATGGATACAGAGAAGTATTTCCAAGGAAGAAAAAATTGACAGATCACTGGAGGAAGTTTATAAGTCCTGTTATGTGGCGGTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAATCATTAAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGTCTGTCTACGAACACTTCTCAATGGAAGATTTTGATGTGAAGTCAACAGGATTCTCAATTCACACTCAAAGACACAGGGTTATGAAAAGAAAGAGAAGGAAGAAAACTGAAGAGACAACTGAAGCAGCTTCATATATGGGACACCATAATGTGTTCTCCTACTATGAGAAGAAGAGGCCCATTGCTGATGACATAACTATGGAAGATACTTCTCTTAAATTAGACAAGACAAGGAATATGAAACATGATGCCATCAATGACTTCGGGCCAATTGCAACTGATGGATGGCCATCTTCTATGTTGGGAGATAATGATAATAATTTGGAAGAAATCTTTCTAAAAATTGAAGCTGCGCAGTCAAAAGTTCACGAGTTGAAGAACAGAATTGACAAGGTGGTGAATGAAAATCCCATGAAGTTCTCCTCAATCAGTCAGCTATACTTGCTTGCATCAAGTGATGATCCCGCTTCACCTGAAGACGGAAATGATGTGTTTGTTAGGTCTTTGCATGAAGCATCACAACACATGTCTGAGCATGCATTAGATGTACTTATGCCCGAAAGTGCGACTAGAAGTCATGGAGAGGTCATGCTACTTCCTGATATGATTCAGAGCGTGGATCGTGGAAGTACTGAGAAAGTTCTGATGCAAGATTCCGCAGTCAAGGAAGAGGTGCAAATTCCTGAAGAGGTTAATACTCAGTTTATTGAGCAGACTCCGAAATTGGAGGAGCAAATCATTTCTCCAGCTGCAGCCTCTCAAGCTGACTTAGCCTCAGAAGACAAGGAGCCTGACATGCAACACAAAACAAAACCCCCTTCTGCTGTCAAACCTAGTTCATCTAAGAGAACAAGAAAGCGGGGAAGGCGAAAAATCAGTTCGAGTAAACAGAAACGGAAAGCAACAGGTTAG

Protein sequence

MGPEIGSKVKEEALMEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRGSTEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
Homology
BLAST of Tan0007884 vs. NCBI nr
Match: XP_022974820.1 (uncharacterized protein LOC111473601 isoform X3 [Cucurbita maxima])

HSP 1 Score: 736.9 bits (1901), Expect = 1.1e-208
Identity = 392/481 (81.50%), Postives = 422/481 (87.73%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME
Sbjct: 1   MRPENGSKVEETLMEISRCVEDKNAAQDKQSTSSGQEKSHGMEAPSVEQSMMYERSEDME 60

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLLLDDEEVESQLY DNNLQP+S
Sbjct: 61  IDIIGCTDNCEGGPSSECNDSTEYSSSFGDTVSGTDYGLLLDDEEVESQLYDDNNLQPIS 120

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 121 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 180

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 181 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 240

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 241 DISSEGTSLKLNKTKNMRHDGINDFQTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 300

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 301 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 360

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVKEE QI EEVN QFIEQT KL
Sbjct: 361 LMPENAITSHGEVMLLPDMIQSADCGRSTKKVLVQDSAVKEEAQIHEEVNGQFIEQTLKL 420

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 421 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 476

BLAST of Tan0007884 vs. NCBI nr
Match: XP_022974817.1 (uncharacterized protein LOC111473601 isoform X1 [Cucurbita maxima])

HSP 1 Score: 736.9 bits (1901), Expect = 1.1e-208
Identity = 392/481 (81.50%), Postives = 422/481 (87.73%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME
Sbjct: 125 MRPENGSKVEETLMEISRCVEDKNAAQDKQSTSSGQEKSHGMEAPSVEQSMMYERSEDME 184

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLLLDDEEVESQLY DNNLQP+S
Sbjct: 185 IDIIGCTDNCEGGPSSECNDSTEYSSSFGDTVSGTDYGLLLDDEEVESQLYDDNNLQPIS 244

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 245 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 304

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 305 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 364

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 365 DISSEGTSLKLNKTKNMRHDGINDFQTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 424

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 425 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 484

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVKEE QI EEVN QFIEQT KL
Sbjct: 485 LMPENAITSHGEVMLLPDMIQSADCGRSTKKVLVQDSAVKEEAQIHEEVNGQFIEQTLKL 544

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 545 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 600

BLAST of Tan0007884 vs. NCBI nr
Match: XP_022946550.1 (uncharacterized protein LOC111450568 isoform X3 [Cucurbita moschata])

HSP 1 Score: 733.8 bits (1893), Expect = 9.4e-208
Identity = 390/481 (81.08%), Postives = 419/481 (87.11%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDK S SSGQEK H MEAPSVER+MM DR EDME
Sbjct: 1   MRPENGSKVEETLMEISRCVEDKNAAQDKPSTSSGQEKSHGMEAPSVERSMMYDRREDME 60

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG LLDDEEVESQLY DNNLQP+S
Sbjct: 61  IDIIGCTDNCEGGPSSECNDSTENSSSFGDTVSGTDYGSLLDDEEVESQLYDDNNLQPIS 120

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 121 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 180

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 181 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 240

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 241 DISSEGTSLKLNKTKNMRHDGINDFRTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 300

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 301 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 360

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQ+ D G ST+KVL+QDSA KEE QI EEVN QFIEQT KL
Sbjct: 361 LMPENAITSHGEVMLLPDMIQNADCGRSTKKVLVQDSAAKEEAQIHEEVNGQFIEQTLKL 420

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 421 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 476

BLAST of Tan0007884 vs. NCBI nr
Match: XP_022946534.1 (uncharacterized protein LOC111450568 isoform X1 [Cucurbita moschata])

HSP 1 Score: 733.8 bits (1893), Expect = 9.4e-208
Identity = 390/481 (81.08%), Postives = 419/481 (87.11%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDK S SSGQEK H MEAPSVER+MM DR EDME
Sbjct: 125 MRPENGSKVEETLMEISRCVEDKNAAQDKPSTSSGQEKSHGMEAPSVERSMMYDRREDME 184

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG LLDDEEVESQLY DNNLQP+S
Sbjct: 185 IDIIGCTDNCEGGPSSECNDSTENSSSFGDTVSGTDYGSLLDDEEVESQLYDDNNLQPIS 244

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 245 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 304

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 305 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 364

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 365 DISSEGTSLKLNKTKNMRHDGINDFRTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 424

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 425 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 484

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQ+ D G ST+KVL+QDSA KEE QI EEVN QFIEQT KL
Sbjct: 485 LMPENAITSHGEVMLLPDMIQNADCGRSTKKVLVQDSAAKEEAQIHEEVNGQFIEQTLKL 544

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 545 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 600

BLAST of Tan0007884 vs. NCBI nr
Match: XP_023540268.1 (uncharacterized protein LOC111800693 isoform X3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 730.3 bits (1884), Expect = 1.0e-206
Identity = 389/481 (80.87%), Postives = 419/481 (87.11%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVER+MM DRSEDME
Sbjct: 1   MRPENGSKVEETLMEISRCVEDKNAAQDKQSTSSGQEKSHGMEAPSVERSMMYDRSEDME 60

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG LLDDEEVESQLY DNNLQP+S
Sbjct: 61  IDIIGCTDNCEGGPSSECNDSTENSSSFGDTVSGTDYGSLLDDEEVESQLYDDNNLQPIS 120

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 121 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 180

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGF  HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 181 EQFSTEDFDVKSTGFPSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 240

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 241 DISSEGTSLKLNKTKNMRHDGINDFRTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 300

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 301 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 360

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSA KEE QI EEV+ QFIE+T KL
Sbjct: 361 LMPENAITSHGEVMLLPDMIQSADCGRSTKKVLVQDSAAKEEAQIHEEVDGQFIEKTLKL 420

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI      S ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 421 EEQI-----TSLADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 476

BLAST of Tan0007884 vs. ExPASy TrEMBL
Match: A0A6J1IEX6 (uncharacterized protein LOC111473601 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111473601 PE=4 SV=1)

HSP 1 Score: 736.9 bits (1901), Expect = 5.4e-209
Identity = 392/481 (81.50%), Postives = 422/481 (87.73%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME
Sbjct: 1   MRPENGSKVEETLMEISRCVEDKNAAQDKQSTSSGQEKSHGMEAPSVEQSMMYERSEDME 60

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLLLDDEEVESQLY DNNLQP+S
Sbjct: 61  IDIIGCTDNCEGGPSSECNDSTEYSSSFGDTVSGTDYGLLLDDEEVESQLYDDNNLQPIS 120

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 121 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 180

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 181 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 240

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 241 DISSEGTSLKLNKTKNMRHDGINDFQTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 300

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 301 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 360

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVKEE QI EEVN QFIEQT KL
Sbjct: 361 LMPENAITSHGEVMLLPDMIQSADCGRSTKKVLVQDSAVKEEAQIHEEVNGQFIEQTLKL 420

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 421 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 476

BLAST of Tan0007884 vs. ExPASy TrEMBL
Match: A0A6J1IBA7 (uncharacterized protein LOC111473601 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111473601 PE=4 SV=1)

HSP 1 Score: 736.9 bits (1901), Expect = 5.4e-209
Identity = 392/481 (81.50%), Postives = 422/481 (87.73%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME
Sbjct: 125 MRPENGSKVEETLMEISRCVEDKNAAQDKQSTSSGQEKSHGMEAPSVEQSMMYERSEDME 184

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLLLDDEEVESQLY DNNLQP+S
Sbjct: 185 IDIIGCTDNCEGGPSSECNDSTEYSSSFGDTVSGTDYGLLLDDEEVESQLYDDNNLQPIS 244

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 245 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 304

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 305 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 364

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 365 DISSEGTSLKLNKTKNMRHDGINDFQTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 424

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 425 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 484

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVKEE QI EEVN QFIEQT KL
Sbjct: 485 LMPENAITSHGEVMLLPDMIQSADCGRSTKKVLVQDSAVKEEAQIHEEVNGQFIEQTLKL 544

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 545 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 600

BLAST of Tan0007884 vs. ExPASy TrEMBL
Match: A0A6J1G437 (uncharacterized protein LOC111450568 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111450568 PE=4 SV=1)

HSP 1 Score: 733.8 bits (1893), Expect = 4.6e-208
Identity = 390/481 (81.08%), Postives = 419/481 (87.11%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDK S SSGQEK H MEAPSVER+MM DR EDME
Sbjct: 125 MRPENGSKVEETLMEISRCVEDKNAAQDKPSTSSGQEKSHGMEAPSVERSMMYDRREDME 184

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG LLDDEEVESQLY DNNLQP+S
Sbjct: 185 IDIIGCTDNCEGGPSSECNDSTENSSSFGDTVSGTDYGSLLDDEEVESQLYDDNNLQPIS 244

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 245 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 304

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 305 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 364

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 365 DISSEGTSLKLNKTKNMRHDGINDFRTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 424

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 425 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 484

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQ+ D G ST+KVL+QDSA KEE QI EEVN QFIEQT KL
Sbjct: 485 LMPENAITSHGEVMLLPDMIQNADCGRSTKKVLVQDSAAKEEAQIHEEVNGQFIEQTLKL 544

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 545 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 600

BLAST of Tan0007884 vs. ExPASy TrEMBL
Match: A0A6J1G468 (uncharacterized protein LOC111450568 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111450568 PE=4 SV=1)

HSP 1 Score: 733.8 bits (1893), Expect = 4.6e-208
Identity = 390/481 (81.08%), Postives = 419/481 (87.11%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDK S SSGQEK H MEAPSVER+MM DR EDME
Sbjct: 1   MRPENGSKVEETLMEISRCVEDKNAAQDKPSTSSGQEKSHGMEAPSVERSMMYDRREDME 60

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG LLDDEEVESQLY DNNLQP+S
Sbjct: 61  IDIIGCTDNCEGGPSSECNDSTENSSSFGDTVSGTDYGSLLDDEEVESQLYDDNNLQPIS 120

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 121 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 180

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 181 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 240

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 241 DISSEGTSLKLNKTKNMRHDGINDFRTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 300

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 301 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 360

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQ+ D G ST+KVL+QDSA KEE QI EEVN QFIEQT KL
Sbjct: 361 LMPENAITSHGEVMLLPDMIQNADCGRSTKKVLVQDSAAKEEAQIHEEVNGQFIEQTLKL 420

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 421 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 476

BLAST of Tan0007884 vs. ExPASy TrEMBL
Match: A0A6J1IHF9 (uncharacterized protein LOC111473601 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111473601 PE=4 SV=1)

HSP 1 Score: 724.2 bits (1868), Expect = 3.6e-205
Identity = 388/481 (80.67%), Postives = 417/481 (86.69%), Query Frame = 0

Query: 1   MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDME 60
           M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME
Sbjct: 125 MRPENGSKVEETLMEISRCVEDKNAAQDKQSTSSGQEKSHGMEAPSVEQSMMYERSEDME 184

Query: 61  LDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMS 120
           +DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLLLDDEEVESQLY DNNLQP+S
Sbjct: 185 IDIIGCTDNCEGGPSSECNDSTEYSSSFGDTVSGTDYGLLLDDEEVESQLYDDNNLQPIS 244

Query: 121 NGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVY 180
           NG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVY
Sbjct: 245 NGCREVFPRKKKLTDHWKKFISPVTWRCRWLELKIRKLQSQSSKYDRELALYDQRKQSVY 304

Query: 181 EHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 240
           E FS EDFDVKSTGFS HTQR R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +AD
Sbjct: 305 EQFSTEDFDVKSTGFSSHTQRDRIMKRKKRKKNEETTEVASYMAHHNLFSYYEKKRSVAD 364

Query: 241 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH 300
           DI+ E      DKT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Sbjct: 365 DISSE------DKTKNMRHDGINDFQTIATDGWPSSMLGDNDNNLEEMFLKIEAAQSRVH 424

Query: 301 ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDV 360
           ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DV
Sbjct: 425 ELKNRIDKVVNENPMKFSSINQIYMLASSDDPASPEDGNDVFVRSLHEASQHMSEDAFDV 484

Query: 361 LMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKL 420
           LMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVKEE QI EEVN QFIEQT KL
Sbjct: 485 LMPENAITSHGEVMLLPDMIQSADCGRSTKKVLVQDSAVKEEAQIHEEVNGQFIEQTLKL 544

Query: 421 EEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKAT 476
           EEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK T
Sbjct: 545 EEQITSP-----ADLASGIQEPDMQHKTETPSAAKPSSSKKTRKRGRRKFGMRKQKRKVT 594

BLAST of Tan0007884 vs. TAIR 10
Match: AT3G59670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37440.2); Has 77 Blast hits to 77 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 73; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 176.0 bits (445), Expect = 7.1e-44
Identity = 133/389 (34.19%), Postives = 211/389 (54.24%), Query Frame = 0

Query: 8   KVKEEALMEDKNAVQDKQSASSGQEKIH---DMEAPSVERTMMLDRSEDMELDIIGCTDN 67
           ++ EE+   D   +  K+  S G E           S E    +   E++++DI+   +N
Sbjct: 9   ELNEESKQFDVERLSPKKEQSPGGEHKECGLTSANTSEETVTSVSGGEELDVDIVESDEN 68

Query: 68  CEGGPSSECNVSTENSSSFGDTVSGTDYGLLLD----DEEVESQLYGDNNLQPMSNGYRE 127
                  + N +TE SSSF DT S  +  +LLD    + EVES  + + +L P  + +  
Sbjct: 69  KTSTTDEDPN-ATEYSSSFSDTAS-ENAEMLLDGLTGEAEVESHYWDETDLGPAYDSFSS 128

Query: 128 VFP-RKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFS 187
           +F  RKK+LT+HWR+FI P+MWR +W+EL+I++L+S++L+Y +EL LYDQ K       S
Sbjct: 129 IFHFRKKRLTNHWRRFIRPLMWRSKWVELRIRELESRALEYPKELELYDQEKLEANIDPS 188

Query: 188 MEDF---DVKSTGFSIHTQRHR-VMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIAD 247
           + +     +KS  FS    + R   KR++RKK E T + ASYM  HN+FSY E KR  +D
Sbjct: 189 VLESCGEGIKSLPFSNPCYKKRAAKKRRKRKKVESTDDIASYMACHNLFSYIETKRLSSD 248

Query: 248 DITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSM-LGDNDNNLEEIFLKIEAAQSKV 307
            + + D        R+  ++      P+  D   S     D D+ LEE+  KIE   S+V
Sbjct: 249 GMGLADDFGDAKDPRSDSNE------PVDLDDADSLFHHRDGDSVLEEVLWKIELVHSQV 308

Query: 308 HELKNRIDKVVNENPMKFSSISQLYLLASSDDP----ASPEDGNDVFVRSLHEASQHMSE 367
           H LK ++D V+++N  +FSS   L LLA+S  P    ++  +G+ +   +++ ASQHM++
Sbjct: 309 HRLKTQVDVVLSKNTARFSSSENLSLLAASSAPSPTVSAGGNGDVISFGAIYNASQHMAD 368

Query: 368 HALD--VLMPESATRSHGEVMLLPDMIQS 378
           + L   V   E    S+G+   +PD+I+S
Sbjct: 369 YGLGDIVFSSEGVISSYGDAFHIPDIIES 389

BLAST of Tan0007884 vs. TAIR 10
Match: AT4G37440.2 (unknown protein; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50040.1); Has 121 Blast hits to 117 proteins in 32 species: Archae - 0; Bacteria - 6; Metazoa - 13; Fungi - 5; Plants - 66; Viruses - 0; Other Eukaryotes - 31 (source: NCBI BLink). )

HSP 1 Score: 170.6 bits (431), Expect = 3.0e-42
Identity = 128/374 (34.22%), Postives = 202/374 (54.01%), Query Frame = 0

Query: 36  DMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTEN-SSSFGDTVSGTDYGL 95
           D++A +  +  + +  ED E+DI+ C DN E   S  C+  T+  SSSFG T S  +   
Sbjct: 54  DIDADASIKKEVAEFDED-EVDILECNDNIEIQVSG-CDDGTDGYSSSFGGTDSEHE--- 113

Query: 96  LLDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISP-VMWRCRWLELQIKKL 155
             +D+EV+S +  + +L         ++ RK+KLTDHWR+F+ P +MWRC+W+EL+ K+L
Sbjct: 114 --NDQEVDSMICNETSL--------PLWVRKRKLTDHWRRFVQPTLMWRCKWIELKYKEL 173

Query: 156 QSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKS-TGFSIHTQRHRVMKRKRRKKTEETT 215
           Q+Q+ KYD+E+  Y Q K+   E+   E+  VK+      +TQ+ R+MKRK RK+ EET 
Sbjct: 174 QNQAQKYDKEVEEYYQAKKLELENVKSEELGVKALPPLPCYTQKTRLMKRKTRKRVEETA 233

Query: 216 EAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSM 275
           +  SY  +HN+FSYY+ ++ +A DI + D S  LDK      D         ++  P   
Sbjct: 234 DVTSYASNHNLFSYYDCRKSLA-DIALNDNSRNLDKKNKSAKDE-----TAFSEETPPLE 293

Query: 276 LGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVVNENPMKF----------------SSIS 335
             + D  LE+I LKIEAA+S+   LK R+DKV++ENP  F                SS  
Sbjct: 294 FREGDAYLEQILLKIEAAKSEARNLKIRVDKVLSENPSIFPLANTVNPLGAADVYTSSEQ 353

Query: 336 QLYLLASSDDPASPEDGNDVFVRSLHEASQHMS---EHALDVLMPE--SATRSHGEVMLL 386
           Q  LLA  ++        +  V+S   +S H+S   +   D+L+ E  ++ R  G+ ++ 
Sbjct: 354 QKPLLAIKNEDEKSIISEEKPVKSASVSSHHVSPEDDETTDILLSEILASKRREGKSIIP 406

BLAST of Tan0007884 vs. TAIR 10
Match: AT4G37440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50040.1); Has 220 Blast hits to 205 proteins in 55 species: Archae - 0; Bacteria - 15; Metazoa - 50; Fungi - 11; Plants - 76; Viruses - 3; Other Eukaryotes - 65 (source: NCBI BLink). )

HSP 1 Score: 163.7 bits (413), Expect = 3.6e-40
Identity = 143/450 (31.78%), Postives = 231/450 (51.33%), Query Frame = 0

Query: 36  DMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTEN-SSSFGDTVSGTDYGL 95
           D++A +  +  + +  ED E+DI+ C DN E   S  C+  T+  SSSFG T S  +   
Sbjct: 54  DIDADASIKKEVAEFDED-EVDILECNDNIEIQVSG-CDDGTDGYSSSFGGTDSEHE--- 113

Query: 96  LLDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISP-VMWRCRWLELQIKKL 155
             +D+EV+S +  + +L         ++ RK+KLTDHWR+F+ P +MWRC+W+EL+ K+L
Sbjct: 114 --NDQEVDSMICNETSL--------PLWVRKRKLTDHWRRFVQPTLMWRCKWIELKYKEL 173

Query: 156 QSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKS-TGFSIHTQRHRVMKRKRRKKTEETT 215
           Q+Q+ KYD+E+  Y Q K+   E+   E+  VK+      +TQ+ R+MKRK RK+ EET 
Sbjct: 174 QNQAQKYDKEVEEYYQAKKLELENVKSEELGVKALPPLPCYTQKTRLMKRKTRKRVEETA 233

Query: 216 EAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSM 275
           +  SY  +HN+FSYY+ ++ +A DI + D S  LDK      D         ++  P   
Sbjct: 234 DVTSYASNHNLFSYYDCRKSLA-DIALNDNSRNLDKKNKSAKDE-----TAFSEETPPLE 293

Query: 276 LGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPED 335
             + D  LE+I LKIEAA+S+   LK R+DKV++ENP  F   + +  L ++D   S E 
Sbjct: 294 FREGDAYLEQILLKIEAAKSEARNLKIRVDKVLSENPSIFPLANTVNPLGAADVYTSSEQ 353

Query: 336 GNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEV---------MLLPDMIQSVDRGST 395
              +      +    +SE        +SA+ S   V         +LL +++ S  R   
Sbjct: 354 QKPLLAIKNEDEKSIISEEK----PVKSASVSSHHVSPEDDETTDILLSEILASKRREGK 413

Query: 396 EKVLMQDSAVKEEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTK 455
             +  ++    E+  I E  +    ++TP+  E I    +  +    S +       K K
Sbjct: 414 SIIPDKNLVKTEQASIEEGPSRPVRKRTPRNREIITKEESNPKRRRVSRE-------KPK 471

Query: 456 PPSAVKPSSSKRTRKRGRRKISSSKQKRKA 474
             + +    S R RKRG+R+  S+  +R++
Sbjct: 474 SNAVMASRFSNRKRKRGKRRSGSAGLRRRS 471

BLAST of Tan0007884 vs. TAIR 10
Match: AT3G50040.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37440.2); Has 70 Blast hits to 70 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 137.5 bits (345), Expect = 2.8e-32
Identity = 96/272 (35.29%), Postives = 144/272 (52.94%), Query Frame = 0

Query: 52  EDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTV---SGTDYGLLLDDEEVESQLYGD 111
           E+  +DI+   D+ E     E      +SSSFGD++    G D+G     +E +S L  D
Sbjct: 35  EEDIVDILKWHDDFE-EEREEVLCGASSSSSFGDSMCARDGDDFGF---GDEAQSMLSND 94

Query: 112 NNLQ-PMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALY 171
             L     +G   +   KKK  D WR+   P+MWRC+W+EL++K++QSQ+  Y++E+  Y
Sbjct: 95  YPLPGTCDDGTEFLGLPKKKTNDRWRRLTKPIMWRCKWIELKVKEIQSQARGYEKEVKDY 154

Query: 172 DQRKQSVYEHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYY 231
              KQ   E   +E FD KS  F  + QR  V KR RRK+ EETT+ A+YM +HN+FSY 
Sbjct: 155 YLTKQFDLEKSKLEGFDGKSIPFRENNQRRNVFKRGRRKRVEETTDVAAYMSNHNLFSYA 214

Query: 232 EKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKI 291
           +K+ P+       D+     +    K DAI D   I       S L  +D+ L +   KI
Sbjct: 215 DKRVPVNVKGQYLDSDFGTGRKATGKQDAIEDDSLI-------SELDCSDDVLAKFLCKI 274

Query: 292 EAAQSKVHELKNRIDKVV-NENPMKFSSISQL 319
           + AQ K   L+ R+D+++ +  P   SS+ Q+
Sbjct: 275 DEAQGKARRLRKRVDQLMWDSQPAHTSSMPQM 295

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022974820.11.1e-20881.50uncharacterized protein LOC111473601 isoform X3 [Cucurbita maxima][more]
XP_022974817.11.1e-20881.50uncharacterized protein LOC111473601 isoform X1 [Cucurbita maxima][more]
XP_022946550.19.4e-20881.08uncharacterized protein LOC111450568 isoform X3 [Cucurbita moschata][more]
XP_022946534.19.4e-20881.08uncharacterized protein LOC111450568 isoform X1 [Cucurbita moschata][more]
XP_023540268.11.0e-20680.87uncharacterized protein LOC111800693 isoform X3 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1IEX65.4e-20981.50uncharacterized protein LOC111473601 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1IBA75.4e-20981.50uncharacterized protein LOC111473601 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1G4374.6e-20881.08uncharacterized protein LOC111450568 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1G4684.6e-20881.08uncharacterized protein LOC111450568 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1IHF93.6e-20580.67uncharacterized protein LOC111473601 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT3G59670.17.1e-4434.19unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G37440.23.0e-4234.22unknown protein; LOCATED IN: cellular_component unknown; BEST Arabidopsis thalia... [more]
AT4G37440.13.6e-4031.78unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G50040.12.8e-3235.29unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 280..307
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 422..475
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..35
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 452..475
NoneNo IPR availablePANTHERPTHR34057:SF10BNAA08G15670D PROTEINcoord: 1..473
NoneNo IPR availablePANTHERPTHR34057ELONGATION FACTORcoord: 1..473
IPR038745AT4G37440-likeCDDcd11650AT4G37440_likecoord: 49..307
e-value: 7.32874E-69
score: 218.446

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007884.1Tan0007884.1mRNA
Tan0007884.2Tan0007884.2mRNA