Tan0022414.1 (mRNA) Snake gourd v1

Overview
NameTan0022414.1
TypemRNA
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDynein beta chain, ciliary protein
LocationLG06: 79051036 .. 79057205 (+)
Sequence length1037
RNA-Seq ExpressionTan0022414.1
SyntenyTan0022414.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCAGATCGGCCACGTTCGTTCTCCGACGCTCTCTTATTTTAGTTCTCTTCAAAGCGGTCTGAGTTTTGTACTCAGTAAAACCGAGTCGCGACTCACCGATTCGTCATTTCTTCTCCATTTCTCCCCTGAAAATTGTCCTAAAACCCTCCCACTCATAGGGTTCTTGATTCTATACGCTCGTTTCGTGACGAATTTCGTATGAATCTGAATCTCCGATTCGAGTTCGTTTTGTAAGCTCGTTGCTCGTTTGGAACAAGGAAGCGATGGGACAGGCATTTCGTCGAGCGGCCGGAAGAATCAAGCCGGCTTCGAGCGTGGATTCCACTGCCTCTACGTTGAAAATGGAGAGCGTCGTCGATCGGAACCCTCCGCCGCGTGCGGCCAAGAAGGCTCGGGAGAGTGGCGCTCTTGATTCTGGTAATTGTATTTTGTTAATTGTTTGCGATGTTTCTCTGATTTCTCTGATTTGTTCTTTAGTCTTCTTTCTGGCACCTTGAGAATCGAGGCTGAAGGTTATTGAAGAAGGATTATGTTTCTCTGTTGAATTAAATGCAGTTTCGTAGAATTCGTTTGAGTTGTAAACCATCAGATTTCGATAATGAGCTTGATTTTTGATAACGTCCTTTCCTTCCGGTTTCTCTCAATCGCTTTGGTTGTTTTTTCATTTCATACTGTTATTTATTATCTTCCTCATAATGTAGCAGCATTTATTCATTAGGATTTGGATTTAAGCTCATTTACTACTCATTGATCTGATCTTCCCTGAAGCAACTGTTCTCGTTATTGTTTGTCGGCTTTTGTCTCTCCGAACAGCAATACCTTCTGGTTACTGTTCATTCTTGTTTCTCTTTCCTCTTAGTGTTTGATTTTGTAGAAGCATGGACGAAGTTGCAGTTTACTGTAAGCATTTGCTTCTGGCTACAGTTTTATGTGATCTTTTGGTCCTGAACTTTGAGTTCGACAGTTCACTTGCTTGATCTGAGATTGTGATTTTTCTTCTTCTTATTACACTATTTCTGTTGCATTTATTAAACAAAAGTTGAGAAGTGCTTTGATCTCTGCTTCTATTGCGTATTTTTTTGTTGGCTTACGTTGATTTAGGCTTCCAATGTCTAAGCATGTAAAGAAAGCATAGAGACCAAGGAAAATATGTTCTGGATGCTAGGGATTAATGGAATAATGGATGATGTTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGAAGCATAATAGCATTCCAGGTGGAGTTTTTTTTTGTTCGATTCTCTTACTCCACAGAATCCCTATGAGTTCTATATATAATCAGATGACTAGCTTGATTTTTTTGGGGCGGTCAATCTGGAAGGGAATAAAGATCAAATCACCGCAGAAACTGCCTATAAAATTAGGAACAAAACTGAACAATAACTATCGATTATGCGTGTAAATCGTAAAGCATTGTCCAACAGTTAGATGGAAACTTAAGATCACTCCAAAGTCAGTTTGATGATTTTTTTTTGCTGGCGGGAATTGACTTATAGGATGAAATACAATTGAATGCTAGCTGTTTCCAATTATTTCTTCAGGTGGTGTCTCGGGAAGTAATTCTGGAAATGTGCTTGAAGAACGGGATCCACAGTTCGATGCGATGCTTAGCCAAATGGTGGGTCGAATTAAATCAAAGCCGGGAGGAAAACTCGAGATGGGAGAGGTATTACTCTTAGAATACGAGGCATTGCATTTATTCTTACTATTTTAGTCAGATTCAAATTGAGATGGAAGAGGTAATTTCCATGATCCATTAATAATCTTTGATGTTTGACCAGTCCTACGAGTCAACTGGTGTGATCCTACAGTTCATAGAGATTAATGCATATTGTAAGTAAATAACCCCAAGTTGTTCAATCATTTGATAACTATTCAATCTTTGACAAATGATTACGAATTTAAATTTGTAATTTCAAAAGCATAGAAGTTAAAAGCCTCTCTTTAATTGTTAGGAGGAAAAAAAAATCCAAGCTACACTACAATTTTTTTTTTTTGATGAGAAACTTAAGAGCATTTCATTAATTGTATGAAATGTACAAAAAGGGCCAATGATAAGAGACCCAATTACAAAAGGTTTTCCCAATTAGAAAAAAGGGAAGTGAGATTATAATCACCAAAAAAAAAGGGGGGTGAATTTACACCAAGAGAAAGCGGAAGCGGAGTAAAGAATAAGCTCTAAAAGTACGTGAGGGTCTTTCGCAGTGTCATTAAAGAGCCTGTTGTTTCGCTCAAACCAAATGGACTAAAGTACAACTTTAATAATGTTTTCTCACATGCAAGCCTTTCCTTGTGAGTAAGGGTGACCCAACAAAAGGAGCTCAAGGACTGATTTCACATGATTTGGGAAGACAATCTGAAGCTTGGAAGCCTTGAAAAGCTGTGACCAAATGTATGAAATGTAGGGGCAGGTTGTAAATAAACGACACACTGATTCACAAGCATTTTTGCACATAACACACCAGTTTGGGGCTGAAACTATCCATGGAGCCATCTTTAAAAGAGTGTTTGCTGTGTTTATCCCCTCATGGTTGATTTCCCATAGGAAGAAACAAACTTTTTTGGGGATGAGGCTCTTCCAAAGGTTACGGTAAAGTGAAGAGCTGGTACAATTATTGGTTGAAGACAATCGATGGGTGAGAGATTTGGTTGTGAAGTGACCTTTTTTATCAAGCTTCCAATTCCGAAAATCTTCCATTATAGCGCTTAGTTGGTTAGGCAATTGGGTTGTGAGACTTAGGGGTGAGCAAAAAGACCAAAACCGACATTTATCGATCGAAACCGACCAACCGAGGCAGAGGGCGGTCGGTCGGTTTTGTGAAGGATTTGATCGGGGTCGGTTTTGGCCTTCACAAAACCGACCGAATCCTTAAAAAAAAAAAAAATAATAAAAAAATAGTAATAATAATTAAAAAAAATAAAAAACTAAAAAAATTTCTCAAAATAATTAAATAATAATTAAAAAATAATTATAAATTATTAAATAATAATAAAATAATAATTATAAATATTTAATATTTAATTAAATAAATATTTATATATATATATAAACCGACCCTTGGTCGGTTCGGGGGTCGGTTGGGGTCGAAAACCGACCCCGACCGACCGATGCTCACCCCTAGTGAGACTAACCCATTTCCACAATCTCAGCTTCTTTCAAGTTCCTGCGGAAGCCAAGGTCCCAAGAAAGTTCTTGTGTTATAAACATCTGTCGGATGGGAGCATTTTTTTTTTTTTTGGATAGGGCAAAGAGTTGTGTGTAGGTGTTCTTCAGAGGAGAATCTCCAAGCCAACAATCAAGCCAAAACGATGTTTGATTGCCATTGCCAATAACACAGAAAATATTATCAAAGATAAGTCTCTGCTCTCTAACAATATACTTCCAAGGCCCACGGTGATACTTTAAAAGCTTAGTATCAGGGCGATGATTGCAATGAGTTGAACCATACTTTGATGTCAAAACAGTCTCTAAAGGGTTTTTCAACATGAAATCTCTAAATCCATTTTGTTAATACACTTTGGTTTTTGATCTTGATGCTAAACAAACCAAGCCCTCCTTCATCAATAGGTCGACAAACGATTTTCCAATTAATTGGATGGATCCCATGAGAGTCTTTGCAACCATTCCATAAAAACCTTCGGTAAAGTTTTTCAATTGCATTTGCAGCCCCACGGAGATTTTAAAAAGAGAGAGGTAATAGGTGGGGAGATTGGATAAGGTTGCTTGAATAAGGGTGAGCATACCTCCTTTGGAGATCTGAAAGAACCGTTCCATCCTTTTAGGCGCCTTTCAATTTTTCAATAATGGGGTCCCAAAAAGATTTTTTTGAGGGGTTTATCATTAAAGGGCAACCCTAAATAGGTGGTGGGCCAAGCTGCCACTTTGCATCCAAAAACTTTGGCTAAATCATCAAGCAAAGGTACATGGATTCCAAAGAATTCTGATTTTTGCGGATTGATTCTCAATCCCGTTGCATCCAAAAACTCTTTCATTGTGTCAAAGAGTGTGTGAATACTGTCTTTGGTTGGTGTAGAGAACAAAATGGTATCATCGACAAAGAGAAGGTGATTAATGGCTAGCTTGTCTTTTCCCACCTCCATGCCACCAATGAGGCCCCGATTGGCTTTTAAAGTCAACATCTTGCTAAGACAATCAATGATGATAATAAAAAGGAAAGGAGATAAGGGGTCTCCTTGCCTTAATCCTCTCGAAGCCTTTATCTTCCTGCGGGGCCTCCCATTAATAATTATAGAGAAATTTGCTGAAGAGATACAATCATGGATCTATTTTCTCCAAGTCTCATCGAATCCCTTTGCAACAAGAACAAAGTCTAAGAACTCCCAGTCGACCATATCGAAAGCCTTTTCTATGTCAAGTTTGATAACCACCCCCTGTTTCTTTCGTCGTGTATTCCTCAATGAGCTCGTTAGCAATAAGGGAGGCATCGATGATCTGCCTATCTGCCACAAAGGCAAATTGAAATTCTGAAATTGTGTGATGAAGCATCGATTTTAGCCTTTCAGAGAGAACTCTGGCAATCACCTTGTAGAGACAAGTGGTAAGGCTAATAGGCCTATAATCACCCACTGTTTTGGCATCAAGTATTTTTGGAATAAGGCAAATGTAAGTTTCATTGACATTGGCGTTAATAATGTCATTCTCGAAAAAATCTTGGAACACCCTCTTGATATCTTCCTTAAGGATGTTCCAATGCTTTTTAAAGAATTCAGTTGTGAAACCATTGGGGCATAGTGATTTGTTGGTGCCAAGATCCTTAACAGCTTTGTGAATCTCTTCTTCAGTGAAAGGAGCCTCAATCATAAGAGCTTGTTCGACAGAAATGGGGCTCCCAGAATCTAAGGATGGAAGACTACACTGGCCAGGGATTTTGGAGTAAAGCTGTGAGTAAAAATCTATGAACTCATCTTCAATATGTTTGTCAATCACTAAACTTTCACCTTTTCTAAATAACAATACAATGAAAAAAAGAAAATAGTAAATTTGCCTCATTCAAAAGTATATAAGTTAAAAGTCTCTCTTTAATTGTTAGAAGAAAAAAATCCCAGCAATGAAAAATAATTAAAAAAAAAAAGCCTCCTCACAGATAAGGAGGGTTCCTTCTTACCCTTTTTTTTCTTCTCTTCTCCTTCTTCTTTTGTCTCTCGATATCCGTGAATGTCATGGGACAACCCTTTGACTCTATTTGATTGCCAAGGAAATTCGTAGAATATTAAATTCTTGGTAGGTGGCCACCATGACTCTTGAACTCATTCCTTCTAAGCCCTTTATTATTTACCTTGCCCTTTATTATTCAAGGTAGTTTCTTATGGAATCAAAAGACTTGGTGGTAGATGGAAGAGATTTTTGTAGTCCTCCTTGGTCAAAACAATTTAGGCAAAGGGATCTTTCTTTTTTCTTTTTGGCTGTTTTCCCCTTCTTTTCTATAATGAAAACTCTGTCTCCTATACGTGCAGTGTATATATGATAATTTTCCTAGAGGCTTTATAAGAATGATTTTCCTGATATGGGTGATACTGATAATGTTACAGGCCTCTGTGGTGGAGAGGTATGACAGACCAATGCCAAAGCTAAGAAATACAGATCTAAAATCCAGTAAATACGAGGATCGCCCAGCCCCACCAGGAACTTTAAACGTAGCACAGATGCGCCACATTATTCTCCTTCATGAAGGAAAGGCTGATGATCATGATGGCCCAATGCCCCTTCACCAAATTGCTGAAAACTATAATGTCAGTGTTGCTCAAATACAGACGATTTTGCAGTTTCTGTCTCTTCCTCCAGAGGACACTCTTAGAGAGAAAAAGAAGGATCCTTAATAACAGGTTTTTAATTATTTCTTTTGGGGGACTAACCATTGAATTTGGTTTTTGTGTACTAAATTTTGAAGGTTATATTTTAACACTTTGAAACTACTTTCAGGAATGGAGCCAAGAAAACAAAGGAAGGGGCCATTGCTCCCCTAAGATGAAAAAAAATAAAGGTTTTTAAAGTTTTCTACTAACTTTGTCTCTTTTATTTATATCTTATGTGTACACAAAAAATCAGGTTGAGTTATAGCTTTACAAACGGTCTAATGTGTTCTTAAAATTTTCAATCTTATTTTCTTAGAGGTTTCTCAA

mRNA sequence

ATCAGATCGGCCACGTTCGTTCTCCGACGCTCTCTTATTTTAGTTCTCTTCAAAGCGGTCTGAGTTTTGTACTCAGTAAAACCGAGTCGCGACTCACCGATTCGTCATTTCTTCTCCATTTCTCCCCTGAAAATTGTCCTAAAACCCTCCCACTCATAGGGTTCTTGATTCTATACGCTCGTTTCGTGACGAATTTCGTATGAATCTGAATCTCCGATTCGAGTTCGTTTTGTAAGCTCGTTGCTCGTTTGGAACAAGGAAGCGATGGGACAGGCATTTCGTCGAGCGGCCGGAAGAATCAAGCCGGCTTCGAGCGTGGATTCCACTGCCTCTACGTTGAAAATGGAGAGCGTCGTCGATCGGAACCCTCCGCCGCGTGCGGCCAAGAAGGCTCGGGAGAGTGGCGCTCTTGATTCTGGTGGTGTCTCGGGAAGTAATTCTGGAAATGTGCTTGAAGAACGGGATCCACAGTTCGATGCGATGCTTAGCCAAATGGTGGGTCGAATTAAATCAAAGCCGGGAGGAAAACTCGAGATGGGAGAGGCCTCTGTGGTGGAGAGGTATGACAGACCAATGCCAAAGCTAAGAAATACAGATCTAAAATCCAGTAAATACGAGGATCGCCCAGCCCCACCAGGAACTTTAAACGTAGCACAGATGCGCCACATTATTCTCCTTCATGAAGGAAAGGCTGATGATCATGATGGCCCAATGCCCCTTCACCAAATTGCTGAAAACTATAATGTCAGTGTTGCTCAAATACAGACGATTTTGCAGTTTCTGTCTCTTCCTCCAGAGGACACTCTTAGAGAGAAAAAGAAGGATCCTTAATAACAGGAATGGAGCCAAGAAAACAAAGGAAGGGGCCATTGCTCCCCTAAGATGAAAAAAAATAAAGGTTTTTAAAGTTTTCTACTAACTTTGTCTCTTTTATTTATATCTTATGTGTACACAAAAAATCAGGTTGAGTTATAGCTTTACAAACGGTCTAATGTGTTCTTAAAATTTTCAATCTTATTTTCTTAGAGGTTTCTCAA

Coding sequence (CDS)

ATGGGACAGGCATTTCGTCGAGCGGCCGGAAGAATCAAGCCGGCTTCGAGCGTGGATTCCACTGCCTCTACGTTGAAAATGGAGAGCGTCGTCGATCGGAACCCTCCGCCGCGTGCGGCCAAGAAGGCTCGGGAGAGTGGCGCTCTTGATTCTGGTGGTGTCTCGGGAAGTAATTCTGGAAATGTGCTTGAAGAACGGGATCCACAGTTCGATGCGATGCTTAGCCAAATGGTGGGTCGAATTAAATCAAAGCCGGGAGGAAAACTCGAGATGGGAGAGGCCTCTGTGGTGGAGAGGTATGACAGACCAATGCCAAAGCTAAGAAATACAGATCTAAAATCCAGTAAATACGAGGATCGCCCAGCCCCACCAGGAACTTTAAACGTAGCACAGATGCGCCACATTATTCTCCTTCATGAAGGAAAGGCTGATGATCATGATGGCCCAATGCCCCTTCACCAAATTGCTGAAAACTATAATGTCAGTGTTGCTCAAATACAGACGATTTTGCAGTTTCTGTCTCTTCCTCCAGAGGACACTCTTAGAGAGAAAAAGAAGGATCCTTAA

Protein sequence

MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNSGNVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYEDRPAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPEDTLREKKKDP
Homology
BLAST of Tan0022414.1 vs. NCBI nr
Match: XP_023006464.1 (uncharacterized protein LOC111499180 [Cucurbita maxima])

HSP 1 Score: 330.1 bits (845), Expect = 1.2e-86
Identity = 166/186 (89.25%), Postives = 179/186 (96.24%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNSG 60
           MGQAFRRAAGRIKPASS+DS+AS+LKMESVVDR PPPRAA+KARESG+LDSG V GSNSG
Sbjct: 1   MGQAFRRAAGRIKPASSIDSSASSLKMESVVDRKPPPRAAEKARESGSLDSGDVLGSNSG 60

Query: 61  NVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYEDR 120
           NVLEERDPQFDAMLSQM GRI+SKPGGKLEMGEASVVERY+RPMPKLRNTDLKSSKYEDR
Sbjct: 61  NVLEERDPQFDAMLSQMAGRIRSKPGGKLEMGEASVVERYNRPMPKLRNTDLKSSKYEDR 120

Query: 121 PAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPEDT 180
           PAPPGTLNVAQMRHIILLHEGKADDHDGPM ++QIAE YNV V+QI+TILQFLSLPPED+
Sbjct: 121 PAPPGTLNVAQMRHIILLHEGKADDHDGPMGVNQIAERYNVGVSQIRTILQFLSLPPEDS 180

Query: 181 LREKKK 187
           LR+KKK
Sbjct: 181 LRDKKK 186

BLAST of Tan0022414.1 vs. NCBI nr
Match: XP_022958820.1 (uncharacterized protein LOC111459975 [Cucurbita moschata])

HSP 1 Score: 327.4 bits (838), Expect = 8.1e-86
Identity = 165/186 (88.71%), Postives = 178/186 (95.70%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNSG 60
           MGQAFRRAAGRIKPASS+DS+AS+LKMESVVDR PPPRAA+KARESGALDSG V GSNS 
Sbjct: 1   MGQAFRRAAGRIKPASSIDSSASSLKMESVVDRKPPPRAAEKARESGALDSGDVLGSNSE 60

Query: 61  NVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYEDR 120
           NVLEERDPQFDAMLSQM GRI+SKPGGKLEMGEASVVERYDRPMP+LRNTDLKSSKYEDR
Sbjct: 61  NVLEERDPQFDAMLSQMAGRIRSKPGGKLEMGEASVVERYDRPMPRLRNTDLKSSKYEDR 120

Query: 121 PAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPEDT 180
           PAPPGTLNVAQMRHI+LLHEGKA+DHDGPM ++QIAE YNV VAQI+TILQFLSLPPED+
Sbjct: 121 PAPPGTLNVAQMRHIMLLHEGKAEDHDGPMGVNQIAERYNVGVAQIRTILQFLSLPPEDS 180

Query: 181 LREKKK 187
           LR+KKK
Sbjct: 181 LRDKKK 186

BLAST of Tan0022414.1 vs. NCBI nr
Match: KAG7013632.1 (hypothetical protein SDJN02_23799 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 327.0 bits (837), Expect = 1.1e-85
Identity = 165/186 (88.71%), Postives = 178/186 (95.70%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNSG 60
           MGQAFRRAAGRIKPASS+DS+AS+LKMESVVDR PPPRAA+KARESGALDSG V GSNS 
Sbjct: 1   MGQAFRRAAGRIKPASSIDSSASSLKMESVVDRKPPPRAAEKARESGALDSGDVLGSNSE 60

Query: 61  NVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYEDR 120
           NVLEERDPQFDAMLSQM GRI+SKPGGKLEMGEASVVERYDRPMP+LRNTDLKSSKYEDR
Sbjct: 61  NVLEERDPQFDAMLSQMAGRIRSKPGGKLEMGEASVVERYDRPMPRLRNTDLKSSKYEDR 120

Query: 121 PAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPEDT 180
           PAPPGTLNVAQMRHIILLHEGKA+DHDGPM ++QIAE YNV VAQI+TILQFLSLPPED+
Sbjct: 121 PAPPGTLNVAQMRHIILLHEGKAEDHDGPMGVNQIAERYNVGVAQIRTILQFLSLPPEDS 180

Query: 181 LREKKK 187
           L++KKK
Sbjct: 181 LQDKKK 186

BLAST of Tan0022414.1 vs. NCBI nr
Match: XP_023547640.1 (uncharacterized protein LOC111806523 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 324.3 bits (830), Expect = 6.8e-85
Identity = 164/186 (88.17%), Postives = 176/186 (94.62%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNSG 60
           MGQAFRRAAGRIKPASS+DS+AS+LKMESVVDR PPPRA +KARESGALDSG   GSNS 
Sbjct: 1   MGQAFRRAAGRIKPASSIDSSASSLKMESVVDRKPPPRAVEKARESGALDSGDDLGSNSE 60

Query: 61  NVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYEDR 120
           NVLEERDPQFDAMLSQM GRI+SKPGGKLEMGEASVVERYDRPMP+LRNTDLKSSKYEDR
Sbjct: 61  NVLEERDPQFDAMLSQMAGRIRSKPGGKLEMGEASVVERYDRPMPRLRNTDLKSSKYEDR 120

Query: 121 PAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPEDT 180
           PAPPGTLNVAQMRHIILLHEGKA+DHDGPM ++QIAE YNV VAQI+TILQFLSLPPED+
Sbjct: 121 PAPPGTLNVAQMRHIILLHEGKAEDHDGPMGVNQIAERYNVGVAQIRTILQFLSLPPEDS 180

Query: 181 LREKKK 187
           LR+KKK
Sbjct: 181 LRDKKK 186

BLAST of Tan0022414.1 vs. NCBI nr
Match: XP_011657169.1 (uncharacterized protein LOC101223121 [Cucumis sativus] >XP_031743505.1 uncharacterized protein LOC101223121 [Cucumis sativus] >KGN47188.1 hypothetical protein Csa_020957 [Cucumis sativus])

HSP 1 Score: 320.9 bits (821), Expect = 7.6e-84
Identity = 165/188 (87.77%), Postives = 176/188 (93.62%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDS-TASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNS 60
           MGQAFRRAAGRIKPASS+DS TAS+LKMES+VDR PPPR A+KARESGALDSG V  S+S
Sbjct: 1   MGQAFRRAAGRIKPASSIDSTTASSLKMESIVDRKPPPRVAEKARESGALDSGDVPASDS 60

Query: 61  GNVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYED 120
           GN+LEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERY RPMPKLR+T++ SSKYED
Sbjct: 61  GNMLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYGRPMPKLRDTNISSSKYED 120

Query: 121 RPAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPED 180
           RPAPPGTLNVAQMR IILLHEGKADDHDGPM LHQIAE YNVSVAQIQTILQFLSLPPED
Sbjct: 121 RPAPPGTLNVAQMRQIILLHEGKADDHDGPMGLHQIAERYNVSVAQIQTILQFLSLPPED 180

Query: 181 TLREKKKD 188
           +LR+K KD
Sbjct: 181 SLRDKIKD 188

BLAST of Tan0022414.1 vs. ExPASy TrEMBL
Match: A0A6J1L288 (uncharacterized protein LOC111499180 OS=Cucurbita maxima OX=3661 GN=LOC111499180 PE=4 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 6.0e-87
Identity = 166/186 (89.25%), Postives = 179/186 (96.24%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNSG 60
           MGQAFRRAAGRIKPASS+DS+AS+LKMESVVDR PPPRAA+KARESG+LDSG V GSNSG
Sbjct: 1   MGQAFRRAAGRIKPASSIDSSASSLKMESVVDRKPPPRAAEKARESGSLDSGDVLGSNSG 60

Query: 61  NVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYEDR 120
           NVLEERDPQFDAMLSQM GRI+SKPGGKLEMGEASVVERY+RPMPKLRNTDLKSSKYEDR
Sbjct: 61  NVLEERDPQFDAMLSQMAGRIRSKPGGKLEMGEASVVERYNRPMPKLRNTDLKSSKYEDR 120

Query: 121 PAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPEDT 180
           PAPPGTLNVAQMRHIILLHEGKADDHDGPM ++QIAE YNV V+QI+TILQFLSLPPED+
Sbjct: 121 PAPPGTLNVAQMRHIILLHEGKADDHDGPMGVNQIAERYNVGVSQIRTILQFLSLPPEDS 180

Query: 181 LREKKK 187
           LR+KKK
Sbjct: 181 LRDKKK 186

BLAST of Tan0022414.1 vs. ExPASy TrEMBL
Match: A0A6J1H4J6 (uncharacterized protein LOC111459975 OS=Cucurbita moschata OX=3662 GN=LOC111459975 PE=4 SV=1)

HSP 1 Score: 327.4 bits (838), Expect = 3.9e-86
Identity = 165/186 (88.71%), Postives = 178/186 (95.70%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNSG 60
           MGQAFRRAAGRIKPASS+DS+AS+LKMESVVDR PPPRAA+KARESGALDSG V GSNS 
Sbjct: 1   MGQAFRRAAGRIKPASSIDSSASSLKMESVVDRKPPPRAAEKARESGALDSGDVLGSNSE 60

Query: 61  NVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYEDR 120
           NVLEERDPQFDAMLSQM GRI+SKPGGKLEMGEASVVERYDRPMP+LRNTDLKSSKYEDR
Sbjct: 61  NVLEERDPQFDAMLSQMAGRIRSKPGGKLEMGEASVVERYDRPMPRLRNTDLKSSKYEDR 120

Query: 121 PAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPEDT 180
           PAPPGTLNVAQMRHI+LLHEGKA+DHDGPM ++QIAE YNV VAQI+TILQFLSLPPED+
Sbjct: 121 PAPPGTLNVAQMRHIMLLHEGKAEDHDGPMGVNQIAERYNVGVAQIRTILQFLSLPPEDS 180

Query: 181 LREKKK 187
           LR+KKK
Sbjct: 181 LRDKKK 186

BLAST of Tan0022414.1 vs. ExPASy TrEMBL
Match: A0A0A0KCC5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G196710 PE=4 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 3.7e-84
Identity = 165/188 (87.77%), Postives = 176/188 (93.62%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDS-TASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNS 60
           MGQAFRRAAGRIKPASS+DS TAS+LKMES+VDR PPPR A+KARESGALDSG V  S+S
Sbjct: 1   MGQAFRRAAGRIKPASSIDSTTASSLKMESIVDRKPPPRVAEKARESGALDSGDVPASDS 60

Query: 61  GNVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYED 120
           GN+LEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERY RPMPKLR+T++ SSKYED
Sbjct: 61  GNMLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYGRPMPKLRDTNISSSKYED 120

Query: 121 RPAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPED 180
           RPAPPGTLNVAQMR IILLHEGKADDHDGPM LHQIAE YNVSVAQIQTILQFLSLPPED
Sbjct: 121 RPAPPGTLNVAQMRQIILLHEGKADDHDGPMGLHQIAERYNVSVAQIQTILQFLSLPPED 180

Query: 181 TLREKKKD 188
           +LR+K KD
Sbjct: 181 SLRDKIKD 188

BLAST of Tan0022414.1 vs. ExPASy TrEMBL
Match: A0A1S3C957 (uncharacterized protein LOC103497865 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497865 PE=4 SV=1)

HSP 1 Score: 316.2 bits (809), Expect = 9.0e-83
Identity = 164/188 (87.23%), Postives = 175/188 (93.09%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDS-TASTLKMESVVDRNPPPRAAKKARESGALDSGGVSGSNS 60
           MGQAFRRAAGRIKPASSVDS TAS+LKMES+VDR PPPR A+KARESG+LDSG V+ S+S
Sbjct: 1   MGQAFRRAAGRIKPASSVDSTTASSLKMESIVDRKPPPRVAEKARESGSLDSGDVTASDS 60

Query: 61  GNVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYED 120
           GN LEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERY RPMPKLR+T++KSSKYED
Sbjct: 61  GNRLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYGRPMPKLRDTNIKSSKYED 120

Query: 121 RPAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPED 180
           RPAPPGTLNVAQMR IILLHEGKADDHDGPM  HQIAE YNVSVAQIQTILQFLSLPPED
Sbjct: 121 RPAPPGTLNVAQMRQIILLHEGKADDHDGPMGPHQIAERYNVSVAQIQTILQFLSLPPED 180

Query: 181 TLREKKKD 188
           +LR+K  D
Sbjct: 181 SLRDKITD 188

BLAST of Tan0022414.1 vs. ExPASy TrEMBL
Match: A0A6J1C9H1 (uncharacterized protein LOC111009550 OS=Momordica charantia OX=3673 GN=LOC111009550 PE=4 SV=1)

HSP 1 Score: 313.2 bits (801), Expect = 7.6e-82
Identity = 160/193 (82.90%), Postives = 173/193 (89.64%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKA-----RESGALDSGGVS 60
           MGQAFRRAAGRIKPASS+DSTAS+LKMESVVDR PPPRA +KA     RESGALDSGG+ 
Sbjct: 1   MGQAFRRAAGRIKPASSIDSTASSLKMESVVDRRPPPRAIQKAEVPQPRESGALDSGGIL 60

Query: 61  GSNSGNVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSS 120
           GSNS NV EERDPQFDAML QMVGRIKSKPGGKLEMGEA+VVERY+RPMPKLRNTD+KSS
Sbjct: 61  GSNSENVPEERDPQFDAMLGQMVGRIKSKPGGKLEMGEAAVVERYERPMPKLRNTDVKSS 120

Query: 121 KYEDRPAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSL 180
           +YEDRPAPPGTLNVAQMRH+I LHEGKA+DH+G M + QIA+ YNVSV QI TILQFLSL
Sbjct: 121 RYEDRPAPPGTLNVAQMRHVIQLHEGKAEDHNGTMAVQQIAQRYNVSVTQIHTILQFLSL 180

Query: 181 PPEDTLREKKKDP 189
           PPEDTLR K KDP
Sbjct: 181 PPEDTLRAKNKDP 193

BLAST of Tan0022414.1 vs. TAIR 10
Match: AT3G21400.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 29 Blast hits to 29 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 27; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 181.0 bits (458), Expect = 8.7e-46
Identity = 97/188 (51.60%), Postives = 129/188 (68.62%), Query Frame = 0

Query: 1   MGQAFRRAAGRIKPASSVDSTASTLKMESVVDRNPPPRAAKKARESGALDSGGVSG--SN 60
           MGQ  RRA G+IK       +  ++   S+        A K +  + A+D     G  ++
Sbjct: 1   MGQQLRRAVGKIKEVERSSPSRVSIDRRSLPTEE--LSAVKSSPSTAAVDGVSDKGRRTS 60

Query: 61  SGNVLEERDPQFDAMLSQMVGRIKSKPGGKLEMGEASVVERYDRPMPKLRNTDLKSSKYE 120
             NVLEERDP++D ML+QMVGRIK+KPGGK EMGEASVVE   RP+PKLRNT  +S++YE
Sbjct: 61  EDNVLEERDPKYDTMLNQMVGRIKAKPGGKAEMGEASVVETSKRPLPKLRNTTPESTRYE 120

Query: 121 DRPAPPGTLNVAQMRHIILLHEGKADDHDGPMPLHQIAENYNVSVAQIQTILQFLSLPPE 180
           + P P GTLNVAQ+RHI+LL +GK+ DH GPM +++IAE Y + V+Q+Q I QFLSLP E
Sbjct: 121 ENPVPQGTLNVAQVRHIMLLFQGKSQDHHGPMGVNEIAEKYRIDVSQVQKITQFLSLPQE 180

Query: 181 DTLREKKK 187
            T ++KK+
Sbjct: 181 ITDKQKKQ 186

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023006464.11.2e-8689.25uncharacterized protein LOC111499180 [Cucurbita maxima][more]
XP_022958820.18.1e-8688.71uncharacterized protein LOC111459975 [Cucurbita moschata][more]
KAG7013632.11.1e-8588.71hypothetical protein SDJN02_23799 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023547640.16.8e-8588.17uncharacterized protein LOC111806523 [Cucurbita pepo subsp. pepo][more]
XP_011657169.17.6e-8487.77uncharacterized protein LOC101223121 [Cucumis sativus] >XP_031743505.1 uncharact... [more]
Match NameE-valueIdentityDescription
A0A6J1L2886.0e-8789.25uncharacterized protein LOC111499180 OS=Cucurbita maxima OX=3661 GN=LOC111499180... [more]
A0A6J1H4J63.9e-8688.71uncharacterized protein LOC111459975 OS=Cucurbita moschata OX=3662 GN=LOC1114599... [more]
A0A0A0KCC53.7e-8487.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G196710 PE=4 SV=1[more]
A0A1S3C9579.0e-8387.23uncharacterized protein LOC103497865 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1C9H17.6e-8282.90uncharacterized protein LOC111009550 OS=Momordica charantia OX=3673 GN=LOC111009... [more]
Match NameE-valueIdentityDescription
AT3G21400.18.7e-4651.60unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..70
NoneNo IPR availablePANTHERPTHR36759DYNEIN BETA CHAIN, CILIARY PROTEINcoord: 1..186

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Tan0022414Tan0022414gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0022414.1-five_prime_utrTan0022414.1-five_prime_utr-LG06:79051036..79051299five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0022414.1-exonTan0022414.1-exon-LG06:79051036..79051453exon
Tan0022414.1-exonTan0022414.1-exon-LG06:79052606..79052730exon
Tan0022414.1-exonTan0022414.1-exon-LG06:79056615..79056908exon
Tan0022414.1-exonTan0022414.1-exon-LG06:79057006..79057205exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0022414.1-cdsTan0022414.1-cds-LG06:79051300..79051453CDS
Tan0022414.1-cdsTan0022414.1-cds-LG06:79052606..79052730CDS
Tan0022414.1-cdsTan0022414.1-cds-LG06:79056615..79056902CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0022414.1-three_prime_utrTan0022414.1-three_prime_utr-LG06:79056903..79056908three_prime_UTR
Tan0022414.1-three_prime_utrTan0022414.1-three_prime_utr-LG06:79057006..79057205three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Tan0022414.1Tan0022414.1-proteinpolypeptide