Tan0016255 (gene) Snake gourd v1

Overview
NameTan0016255
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionhomeobox protein 6
LocationLG05: 39628593 .. 39631956 (-)
RNA-Seq ExpressionTan0016255
SyntenyTan0016255
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGAAATATTATATAGAGAAATTCATCACAGTATCGAGACAGTGCCCCAAAAGTTATTGTAATATGAAAATTTGGAAATGTAGCATCTTAATTAATGTTGAGCATAGCATTTGAGACGCTACGGATGAGATTACTCAATGAAAGCGCATGCATGGAAGTGTCTTGACGAATCTTGTTTTTCCGTAGAGTTAAGACACTCTAATATAGCATCTCGATGCTACCTACTGCTTGTTGGCTTGTTTGTGGAAACATTGTAACGTTGTTCCCTCTGTTCTTTTGGCTTTATTTCTTCTTCTATTTACTTGATTTCTTCAATCCTTACTTAAATCATCTCCTTCTCATTATAGAATCATCATAAACTTTGTATCTCCTCTATATACTTGATTGTTGAGAGTTAAACATATTTTATTCGTACGTTATGTAGATTTAAGTTTCAGTATGTTAATGTATTTAATTATAAGTTTTTTGGATAGGAGTTAAAAGAAGAATTTATTCATTCATAGCAAAGTTTTACGTTAATTGAGTTTATGTTATTATATATTAGAGTTTTGAGTTCATAAAGTTAAGTGTAGGAGAGTTACGGATAGGATCACTTTAGCAAATCCTATAAATAGGATTTCGTTAGTACGATTGTAACACAAGCAAGTAAAGTGAATAAAAAAACAAAGTGAGTTTAGAGAAAAAAGTGTGAACCAAGTTTGTAACTTTGTGAAAAGTATTTTTTAATACAAATGAATTGTTTCTTGTGAAATATATTCCTTGTTTGATTATTTATTTGTGAGTGTCCATCACACGCTTCCAAGATTGATATCTGCAAAACAAACAAATATGTGTGAAAATGATTAACTATTTTAGCATTAAAATTAGTGGTCGTTAAGAATTATAGAAATAACTCTCTTTTTGCACTATCTTTAGTATATGAGCCAAATTGACTATTGACATAATAGTTCAAATGCATAAACTAATTTTACTTTAATAAACTTAACAGCTGGGTTGTATTATATAAAAGAGCACAATTTGGGTGGTGCCGAAGCAACACTCAAATGTTTCGTAGCTCCTTTTTTAATCACAAGTTTTTTTAAAAAAATAAATAAATAAAAAGAAATTATGTGACTACCAAGTGGGAAGTAAAATATAGGGCAACAACCAAATATTTTCCCATATAAAAGTATTATCTCTTTTACTTTATGAAGTTAAATTAATCATCATTAAAGCTAAAAAAAAAATCAAAATTTATAAAACCATACAAAAAAATTGAACTTTTTAAAATATATTCTTTGAACAATATTGTTAAAATCATTTAAAAAAAAAAAACAATAATGTTAAAATAATCCATAGAGATATTGATTAGACCCTTCATGTACAGTTTGGGCATATTTATTCCTTGTTTCTCTCTCTCTTACTAGATTCTCCCTATCTCTCTCTCATCATACTTTGCTTTCACTTTATCCCTCTCTCTCTCACACCCCATATCCCTTTACTCCATTAGCCATTACCCACAAATATATATAAATCTCAATCTCAAATCTCAAAGAAAAATTTGCAAGTGAAATTGAGAAGAGGCCATGGGAAGAAGCTGTCATGGAGATGAAAGATGGGAAGAAGATGAAGAAGAAGCATTATCTTTCTGTGATCTTCCTGTAAAAGATCAAAAGCAGCAGCTTTTGATTGTAAATGAAAACCCCATCGAAGAGTCATCAGTCGCCCTCCAAACCGAAACGGAGGACTTTGATTTCAACCACTGGCCGCCGCCGCCGCCGCCCATGTGCGCCGCTGATGACCTCTTCTTTCAAGGCCATATGCTCCCTCTTCGTCTCTCTTTCAGCTCTGATAATTCTCACAATCACTTCTTTTCTAAAAATTTGTCCATCAGGTAATTCGTGCACAGATGGAGTCCACCTACAATTAATTTTTATACATATATAATTTTAAATTATTATTATTATTATTTAATCTAAAATTAGATTGGCTTTTCTTTCCTTTTCCCCATTATTTACTTCTTCAGGTCGGAGTCAATGGATCATAATATGTTGAGGTTTAGAAATGGAAGCAGCAGTAGCAGTAGCAGTAGAAGTTATTATTCAAGGTAATAAGCTTTTAATTAAATTTCTCGAAATATCCAGATTTAATTTCTCACATATCAAAGGAATAATTGATTAAAAACGGAACAAAATTTATCGACCGCAGATATGTTTTCTTCCCAAGAAAAAAAAAAAAATTATTTTATTTTTTATAAAATAAATAAAAAATTTGGGATATTAAAGCTGCACTAAGACATACCAATTGCTTGGACACAGATACGTTTTTCTTTCTTAAAAATGAGGTTGATAGATATTTTTAAAACACCCATCCAAATATTTAAATTAACAAATAATTTTTTTTAATATTACTTTGATGTATTCTTTTAGTATTTTATTATCCATATTAATATTGTATAATCGAAATATGAATTAATTTCCAACCGATTGATATATTTTATATATATATATGAAATTATTTTTAATTATATATGTTTTTTCAAACAAAAAGTTTAAAATTTTTCAAAAGTAGCATCACCGAAGTCCAACTGAATTTCAAACATAAATATAAACTACCGTTGAAAAAAAGGTTAAATTTGAATTATATTTATAGAAAAAGTTGTAACGTTCGAAAATTTTCATATTGAATTTATTTATGATGATCTATCAAATATTTTCTATTACAGGTCATCAAGTATTAGTAACAATTCCATTTCCATTCCGACGAATTCAAAGCCGAGAACTCAAAAGAACGTTTTCCACTCTCACCCAAGTCCGACGCCCCAAATTAGATCCTTCTCGACTTCCGGCCGCCGGAGCCGGAGCTCTTCCCGTTGGGACTTTTTCCGGTTGGGCCTTCTCCGAACGCCGGGAATGGAACTTCACGACCTCAAAACTCGCACCAGCACTACCGCCACCGCCGCCGCGGCAGCACCCACGGCGGCGCAAAAAACAACGGGCTCGTTTCTGGGTGTGGTCAGCTGCAAAAAATCCGTGGATACAATACCGGCGCCGGCGAGTACGAGGAAGAACAGAAGCAATTGGAGTGAGAGTATTGCGAACAAAAGGAGTGAAAATGGAAAGGGAAATGGGATTAGGGAAAAAGTGATGGGGGAATTGAAGATGATGAAGAGATTACAGAAACAACAGAAGGTTTTGAATAGTAATAATAAAAATAATAATGATGATGAAATTAGCGAAAAGGAAAAGGAAAAGGAAAAGGAAACTCGGCTGTCACATCGTCGAACATTTGAATGGGTAAAGCAGCTCTCGCATGCAAGCTTTGCTGAAGAACAATGATCTTTAACCGTTTTTGACTTCTCACCAATTTAATTGCATTCCCC

mRNA sequence

ATGATGAAATATTATATAGAGAAATTCATCACAGTATCGAGACAGTGCCCCAAAAAAGAGGCCATGGGAAGAAGCTGTCATGGAGATGAAAGATGGGAAGAAGATGAAGAAGAAGCATTATCTTTCTGTGATCTTCCTGTAAAAGATCAAAAGCAGCAGCTTTTGATTGTAAATGAAAACCCCATCGAAGAGTCATCAGTCGCCCTCCAAACCGAAACGGAGGACTTTGATTTCAACCACTGGCCGCCGCCGCCGCCGCCCATGTGCGCCGCTGATGACCTCTTCTTTCAAGGCCATATGCTCCCTCTTCGTCTCTCTTTCAGCTCTGATAATTCTCACAATCACTTCTTTTCTAAAAATTTGTCCATCAGGTCGGAGTCAATGGATCATAATATGTTGAGGTTTAGAAATGGAAGCAGCAGTAGCAGTAGCAGTAGAAGTTATTATTCAAGGTCATCAAGTATTAGTAACAATTCCATTTCCATTCCGACGAATTCAAAGCCGAGAACTCAAAAGAACGTTTTCCACTCTCACCCAAGTCCGACGCCCCAAATTAGATCCTTCTCGACTTCCGGCCGCCGGAGCCGGAGCTCTTCCCGTTGGGACTTTTTCCGGTTGGGCCTTCTCCGAACGCCGGGAATGGAACTTCACGACCTCAAAACTCGCACCAGCACTACCGCCACCGCCGCCGCGGCAGCACCCACGGCGGCGCAAAAAACAACGGGCTCGTTTCTGGGTGTGGTCAGCTGCAAAAAATCCGTGGATACAATACCGGCGCCGGCGAGTACGAGGAAGAACAGAAGCAATTGGAGTGAGAGTATTGCGAACAAAAGGAGTGAAAATGGAAAGGGAAATGGGATTAGGGAAAAAGTGATGGGGGAATTGAAGATGATGAAGAGATTACAGAAACAACAGAAGGTTTTGAATAGTAATAATAAAAATAATAATGATGATGAAATTAGCGAAAAGGAAAAGGAAAAGGAAAAGGAAACTCGGCTGTCACATCGTCGAACATTTGAATGGGTAAAGCAGCTCTCGCATGCAAGCTTTGCTGAAGAACAATGATCTTTAACCGTTTTTGACTTCTCACCAATTTAATTGCATTCCCC

Coding sequence (CDS)

ATGATGAAATATTATATAGAGAAATTCATCACAGTATCGAGACAGTGCCCCAAAAAAGAGGCCATGGGAAGAAGCTGTCATGGAGATGAAAGATGGGAAGAAGATGAAGAAGAAGCATTATCTTTCTGTGATCTTCCTGTAAAAGATCAAAAGCAGCAGCTTTTGATTGTAAATGAAAACCCCATCGAAGAGTCATCAGTCGCCCTCCAAACCGAAACGGAGGACTTTGATTTCAACCACTGGCCGCCGCCGCCGCCGCCCATGTGCGCCGCTGATGACCTCTTCTTTCAAGGCCATATGCTCCCTCTTCGTCTCTCTTTCAGCTCTGATAATTCTCACAATCACTTCTTTTCTAAAAATTTGTCCATCAGGTCGGAGTCAATGGATCATAATATGTTGAGGTTTAGAAATGGAAGCAGCAGTAGCAGTAGCAGTAGAAGTTATTATTCAAGGTCATCAAGTATTAGTAACAATTCCATTTCCATTCCGACGAATTCAAAGCCGAGAACTCAAAAGAACGTTTTCCACTCTCACCCAAGTCCGACGCCCCAAATTAGATCCTTCTCGACTTCCGGCCGCCGGAGCCGGAGCTCTTCCCGTTGGGACTTTTTCCGGTTGGGCCTTCTCCGAACGCCGGGAATGGAACTTCACGACCTCAAAACTCGCACCAGCACTACCGCCACCGCCGCCGCGGCAGCACCCACGGCGGCGCAAAAAACAACGGGCTCGTTTCTGGGTGTGGTCAGCTGCAAAAAATCCGTGGATACAATACCGGCGCCGGCGAGTACGAGGAAGAACAGAAGCAATTGGAGTGAGAGTATTGCGAACAAAAGGAGTGAAAATGGAAAGGGAAATGGGATTAGGGAAAAAGTGATGGGGGAATTGAAGATGATGAAGAGATTACAGAAACAACAGAAGGTTTTGAATAGTAATAATAAAAATAATAATGATGATGAAATTAGCGAAAAGGAAAAGGAAAAGGAAAAGGAAACTCGGCTGTCACATCGTCGAACATTTGAATGGGTAAAGCAGCTCTCGCATGCAAGCTTTGCTGAAGAACAATGA

Protein sequence

MMKYYIEKFITVSRQCPKKEAMGRSCHGDERWEEDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTETEDFDFNHWPPPPPPMCAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSESMDHNMLRFRNGSSSSSSSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSGRRSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSCKKSVDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVLNSNNKNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEEQ
Homology
BLAST of Tan0016255 vs. NCBI nr
Match: XP_022947585.1 (uncharacterized protein LOC111451408 [Cucurbita moschata])

HSP 1 Score: 393.3 bits (1009), Expect = 2.3e-105
Identity = 233/341 (68.33%), Postives = 257/341 (75.37%), Query Frame = 0

Query: 22  MGRSCHGDE--RWEEDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTETEDFDFN 81
           M  + HG+E  + +EDEEEALS CDLPVK +KQQ  ++NENPI          TEDFDFN
Sbjct: 1   MATTIHGEEHDKQDEDEEEALSLCDLPVK-EKQQPSLLNENPI----------TEDFDFN 60

Query: 82  HWPPP--PPPMCAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSESMDHNMLRFRN 141
           HWPPP  PPPMCAADD+FFQGH+LPLRLS SSDN+HNHFFSK LS RSESMDHNMLRFRN
Sbjct: 61  HWPPPPSPPPMCAADDIFFQGHLLPLRLSVSSDNTHNHFFSKPLSARSESMDHNMLRFRN 120

Query: 142 GSSSSS--SSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSGR-- 201
           GSSSSS  SSRS+YSR SSISNNSISIPTNSKPRTQ NVFHSHPSPTPQIRSFST GR  
Sbjct: 121 GSSSSSSNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFGRRS 180

Query: 202 RSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSCKKS 261
           RSRSSSRWDFFR+GLLRTPGMELHDLKTRT    T + A PT  QKTT SFLGVVSCKKS
Sbjct: 181 RSRSSSRWDFFRVGLLRTPGMELHDLKTRT----TVSNAVPTVGQKTTASFLGVVSCKKS 240

Query: 262 VDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVLNSNNK 321
           V+TIPA     K   NWS +I  KR+E GKG GIREK +                     
Sbjct: 241 VETIPA----AKKIKNWSGNIGKKRNEKGKGIGIREKEL--------------------- 298

Query: 322 NNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEEQ 355
              +D +  +EKEKEK TRLSHRRTFEW+KQLSHA+FA++Q
Sbjct: 301 ---NDNVEIREKEKEKATRLSHRRTFEWLKQLSHATFADQQ 298

BLAST of Tan0016255 vs. NCBI nr
Match: KAG6605199.1 (hypothetical protein SDJN03_02516, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 387.1 bits (993), Expect = 1.6e-103
Identity = 233/345 (67.54%), Postives = 256/345 (74.20%), Query Frame = 0

Query: 22  MGRSCHGDERW-----EEDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTETEDF 81
           M  + HG E       +EDEEEALS CDLPVK +KQQ L++NENPI          TEDF
Sbjct: 1   MATTIHGGEEHDKQDEDEDEEEALSLCDLPVK-EKQQPLLLNENPI----------TEDF 60

Query: 82  DFNHWPPP--PPPMCAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSESMDHNMLR 141
           DFNHWPPP  PPPMCAADD+FFQGH+LPLRLS SSDN+HNHFFSK LS RSESMDHNMLR
Sbjct: 61  DFNHWPPPPSPPPMCAADDIFFQGHLLPLRLSVSSDNTHNHFFSKPLSARSESMDHNMLR 120

Query: 142 FRNGSSSSS--SSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSG 201
           FRNGSSSSS  SSRS+YSR SSISNNSISIPTNSKPRTQ NVFHSHPSPTPQIRSFST G
Sbjct: 121 FRNGSSSSSSNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFG 180

Query: 202 R--RSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSC 261
           R  RSRSSSRWDFFR+GLLRTPGMELHDLKTRT+ T     A PT  QKTT SFLGVVSC
Sbjct: 181 RRSRSRSSSRWDFFRVGLLRTPGMELHDLKTRTTVT----NAVPTVGQKTTASFLGVVSC 240

Query: 262 KKSVDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVLNS 321
           KKSV+TIPA    R    NWS +I  KR+E G+G GIREK +                  
Sbjct: 241 KKSVETIPAAKKIR----NWSGNIGKKRNEKGEGIGIREKEL------------------ 300

Query: 322 NNKNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHA-SFAEEQ 355
                 +D +  +EKEKEK TRLSHRRTFEW+KQLSHA +FA++Q
Sbjct: 301 ------NDNVEIREKEKEKATRLSHRRTFEWLKQLSHATTFADQQ 302

BLAST of Tan0016255 vs. NCBI nr
Match: XP_023006931.1 (uncharacterized protein LOC111499574 [Cucurbita maxima])

HSP 1 Score: 380.9 bits (977), Expect = 1.2e-101
Identity = 230/341 (67.45%), Postives = 251/341 (73.61%), Query Frame = 0

Query: 22  MGRSCHGDE--------RWEEDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTET 81
           M  + HGDE          EE+EEEALS CDLPVK +KQQ L+++ENPI          T
Sbjct: 1   MATTIHGDEHDNQDEEDEEEEEEEEALSMCDLPVK-EKQQPLLLDENPI----------T 60

Query: 82  EDFDFNHWPPPPPPMCAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSESMDHNML 141
           EDFDFNHW  PPPPMCAADD+FFQGH+LPLRLS SSDN+HNHFFSK LS RSESMDHNML
Sbjct: 61  EDFDFNHW--PPPPMCAADDIFFQGHLLPLRLSVSSDNTHNHFFSKPLSARSESMDHNML 120

Query: 142 RFRNGSSSSS-SSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSG 201
           RFRNGSSSSS SSRS+YSR SSISNNSISIPTNSKPRTQ NVFHSHPSPTPQIRSFST G
Sbjct: 121 RFRNGSSSSSNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFG 180

Query: 202 RRSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSCKK 261
           R  RSSSRWDFFR+GLLRTPGMELHDLKTRT T   A  A PT  QKT G+FLGVVSCKK
Sbjct: 181 R--RSSSRWDFFRVGLLRTPGMELHDLKTRT-TNGAAFEAEPTVGQKTRGTFLGVVSCKK 240

Query: 262 SVDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVLNSNN 321
           SVDTIPA     K   NWS +I  KR+E GKG GIRE                       
Sbjct: 241 SVDTIPA----AKKIKNWSGNIVKKRNEKGKGIGIRE----------------------- 297

Query: 322 KNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEE 354
            N  +D +  +EKEKEK TRLSHRRTFEW+KQLSHA+FA++
Sbjct: 301 -NELNDNVEIREKEKEKATRLSHRRTFEWLKQLSHATFADQ 297

BLAST of Tan0016255 vs. NCBI nr
Match: XP_023534091.1 (uncharacterized protein LOC111795757 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 380.6 bits (976), Expect = 1.5e-101
Identity = 234/347 (67.44%), Postives = 258/347 (74.35%), Query Frame = 0

Query: 22  MGRSCHG------DERWEEDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTETED 81
           M  + HG      DE  ++DEEEALS CDLPVK +KQQ L++NENPI        TE  D
Sbjct: 1   MATTIHGEEHDKQDEDEDDDEEEALSLCDLPVK-EKQQPLLLNENPI--------TEDFD 60

Query: 82  FDFNHWPPP--PPPMCAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSESMDHNML 141
           F+FNHWPPP  PPPMCAADD+FFQGH+LPLRLS SSDN+HN FFSK LS RSESMDHNML
Sbjct: 61  FNFNHWPPPPSPPPMCAADDIFFQGHLLPLRLSVSSDNTHNQFFSKPLSARSESMDHNML 120

Query: 142 RFRNGSSSSS-SSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSG 201
           RFRNGSSSSS SSRS+YSR SSISNNSISIPTNSKPRTQ NVFHSHPSPTPQIRSFST G
Sbjct: 121 RFRNGSSSSSNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFG 180

Query: 202 R--RSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSC 261
           R  RSRSSSRWDFFR+G+LRTPGMELHDLKTRT T A A  AAPT  QKTT SFLGVVSC
Sbjct: 181 RRSRSRSSSRWDFFRVGVLRTPGMELHDLKTRT-TNAGAFEAAPTVGQKTTASFLGVVSC 240

Query: 262 KKSVDTIPAPASTRKNRSNWSESIANKRSENGK--GNGIREKVMGELKMMKRLQKQQKVL 321
           KKSV+TIPA    R    NWS +I  KR+E GK  G GIREK +                
Sbjct: 241 KKSVETIPAAKKIR----NWSGNIVKKRNEKGKGIGIGIREKEL---------------- 300

Query: 322 NSNNKNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHA-SFAEEQ 355
                   +D +  +EKEKEK TRLSHRRTFEW+KQLSHA +FA++Q
Sbjct: 301 --------NDNVEIREKEKEKATRLSHRRTFEWLKQLSHATTFADQQ 309

BLAST of Tan0016255 vs. NCBI nr
Match: XP_038902148.1 (uncharacterized protein LOC120088781 [Benincasa hispida])

HSP 1 Score: 355.9 bits (912), Expect = 4.0e-94
Identity = 219/346 (63.29%), Postives = 242/346 (69.94%), Query Frame = 0

Query: 22  MGRSCHGDERWE---------EDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTE 81
           MGRS HGDERWE         E+EEEALSFCDLPVK+++Q        P+  +S A+  E
Sbjct: 4   MGRSFHGDERWEEADKEETEIEEEEEALSFCDLPVKEKQQ--------PMRSASAAV--E 63

Query: 82  TEDFDFNHWPPPPPPMCAADDLFFQGHMLPLRLSFSSDNSHN----HFFSKNLSIRSESM 141
           TEDFDFNHW PPPPPM AAD+LFFQG MLPLRLSFSS+NS+N    H F  NL  RSESM
Sbjct: 64  TEDFDFNHWRPPPPPMFAADELFFQGQMLPLRLSFSSENSNNNNIHHLFGGNLWQRSESM 123

Query: 142 DHNMLRFRNGSSSSSSSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSF 201
           DHNMLRF NGSSSSSSSRS+YSRSSS+SNNS+SIPTNSK R QKNVFHSHPSPTPQIRSF
Sbjct: 124 DHNMLRFGNGSSSSSSSRSHYSRSSSVSNNSVSIPTNSKARPQKNVFHSHPSPTPQIRSF 183

Query: 202 STSGRRSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVV 261
           S S  RSRSSSRW+FFRLGLLRTPGMELHDLKTRT+T     AAA T   KTT S LGVV
Sbjct: 184 SNSSHRSRSSSRWEFFRLGLLRTPGMELHDLKTRTTT-----AAAATTTHKTTTSILGVV 243

Query: 262 SCKKSVDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVL 321
           SCK+SVDT+    +  KNR N +    NK+ E      IREK                  
Sbjct: 244 SCKRSVDTV--ATTVAKNRRNENVKKNNKKVE------IREKE----------------- 297

Query: 322 NSNNKNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEEQ 355
                        EKEKEKEKE R+SHRRTFEW+KQLSHA+F EEQ
Sbjct: 304 ------------KEKEKEKEKERRVSHRRTFEWLKQLSHATFGEEQ 297

BLAST of Tan0016255 vs. ExPASy TrEMBL
Match: A0A6J1G7B7 (uncharacterized protein LOC111451408 OS=Cucurbita moschata OX=3662 GN=LOC111451408 PE=4 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 1.1e-105
Identity = 233/341 (68.33%), Postives = 257/341 (75.37%), Query Frame = 0

Query: 22  MGRSCHGDE--RWEEDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTETEDFDFN 81
           M  + HG+E  + +EDEEEALS CDLPVK +KQQ  ++NENPI          TEDFDFN
Sbjct: 1   MATTIHGEEHDKQDEDEEEALSLCDLPVK-EKQQPSLLNENPI----------TEDFDFN 60

Query: 82  HWPPP--PPPMCAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSESMDHNMLRFRN 141
           HWPPP  PPPMCAADD+FFQGH+LPLRLS SSDN+HNHFFSK LS RSESMDHNMLRFRN
Sbjct: 61  HWPPPPSPPPMCAADDIFFQGHLLPLRLSVSSDNTHNHFFSKPLSARSESMDHNMLRFRN 120

Query: 142 GSSSSS--SSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSGR-- 201
           GSSSSS  SSRS+YSR SSISNNSISIPTNSKPRTQ NVFHSHPSPTPQIRSFST GR  
Sbjct: 121 GSSSSSSNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFGRRS 180

Query: 202 RSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSCKKS 261
           RSRSSSRWDFFR+GLLRTPGMELHDLKTRT    T + A PT  QKTT SFLGVVSCKKS
Sbjct: 181 RSRSSSRWDFFRVGLLRTPGMELHDLKTRT----TVSNAVPTVGQKTTASFLGVVSCKKS 240

Query: 262 VDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVLNSNNK 321
           V+TIPA     K   NWS +I  KR+E GKG GIREK +                     
Sbjct: 241 VETIPA----AKKIKNWSGNIGKKRNEKGKGIGIREKEL--------------------- 298

Query: 322 NNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEEQ 355
              +D +  +EKEKEK TRLSHRRTFEW+KQLSHA+FA++Q
Sbjct: 301 ---NDNVEIREKEKEKATRLSHRRTFEWLKQLSHATFADQQ 298

BLAST of Tan0016255 vs. ExPASy TrEMBL
Match: A0A6J1KZ54 (uncharacterized protein LOC111499574 OS=Cucurbita maxima OX=3661 GN=LOC111499574 PE=4 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 5.6e-102
Identity = 230/341 (67.45%), Postives = 251/341 (73.61%), Query Frame = 0

Query: 22  MGRSCHGDE--------RWEEDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTET 81
           M  + HGDE          EE+EEEALS CDLPVK +KQQ L+++ENPI          T
Sbjct: 1   MATTIHGDEHDNQDEEDEEEEEEEEALSMCDLPVK-EKQQPLLLDENPI----------T 60

Query: 82  EDFDFNHWPPPPPPMCAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSESMDHNML 141
           EDFDFNHW  PPPPMCAADD+FFQGH+LPLRLS SSDN+HNHFFSK LS RSESMDHNML
Sbjct: 61  EDFDFNHW--PPPPMCAADDIFFQGHLLPLRLSVSSDNTHNHFFSKPLSARSESMDHNML 120

Query: 142 RFRNGSSSSS-SSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSG 201
           RFRNGSSSSS SSRS+YSR SSISNNSISIPTNSKPRTQ NVFHSHPSPTPQIRSFST G
Sbjct: 121 RFRNGSSSSSNSSRSHYSRCSSISNNSISIPTNSKPRTQNNVFHSHPSPTPQIRSFSTFG 180

Query: 202 RRSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSCKK 261
           R  RSSSRWDFFR+GLLRTPGMELHDLKTRT T   A  A PT  QKT G+FLGVVSCKK
Sbjct: 181 R--RSSSRWDFFRVGLLRTPGMELHDLKTRT-TNGAAFEAEPTVGQKTRGTFLGVVSCKK 240

Query: 262 SVDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVLNSNN 321
           SVDTIPA     K   NWS +I  KR+E GKG GIRE                       
Sbjct: 241 SVDTIPA----AKKIKNWSGNIVKKRNEKGKGIGIRE----------------------- 297

Query: 322 KNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEE 354
            N  +D +  +EKEKEK TRLSHRRTFEW+KQLSHA+FA++
Sbjct: 301 -NELNDNVEIREKEKEKATRLSHRRTFEWLKQLSHATFADQ 297

BLAST of Tan0016255 vs. ExPASy TrEMBL
Match: A0A6J1JCH9 (probable membrane-associated kinase regulator 1 OS=Cucurbita maxima OX=3661 GN=LOC111483795 PE=4 SV=1)

HSP 1 Score: 347.8 bits (891), Expect = 5.3e-92
Identity = 218/342 (63.74%), Postives = 242/342 (70.76%), Query Frame = 0

Query: 22  MGRSCHGDERWEE-------DEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTETE 81
           MGRSCHGDE WEE       ++EEALSFCDLP+K + Q  L +NE PI  S+     ++E
Sbjct: 1   MGRSCHGDETWEEQNQIQEQEDEEALSFCDLPLK-ETQPPLNLNETPIRSSAA---VQSE 60

Query: 82  DFDFNHWPPP-PPPMCAADDLFFQGHMLPLRLSFSSDNSHNH-FFSKNLSIRSESMDHNM 141
           DFDFNH PPP P PMCAAD++FFQGH+LPL  SFSS+NSHN+ FF +N S RSES D  M
Sbjct: 61  DFDFNHCPPPLPQPMCAADEVFFQGHILPLCHSFSSENSHNNPFFPRNSSTRSESSD--M 120

Query: 142 LRFRNGSSSSSSSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSG 201
           LRFRNGS+SSSSSRS+YSRSSSISNNSISIPTNSKPR   NVFHSHPSPTPQIRS STSG
Sbjct: 121 LRFRNGSTSSSSSRSHYSRSSSISNNSISIPTNSKPRPHNNVFHSHPSPTPQIRSHSTSG 180

Query: 202 RRSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSCKK 261
           RRSRSSSRWDFFRLGLLRTPGMELHDLKTRT+  +++AAAA  AA  T GSFLGVVSCKK
Sbjct: 181 RRSRSSSRWDFFRLGLLRTPGMELHDLKTRTN--SSSAAAATPAAHTTAGSFLGVVSCKK 240

Query: 262 SVDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVLNSNN 321
           SVDT+ A                 KRSENGK     EK                      
Sbjct: 241 SVDTVAAAGK-------------KKRSENGK-----EK---------------------- 285

Query: 322 KNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEEQ 355
                    EKEKEKE+ETR+SHRRTFEWVKQLSHAS  +EQ
Sbjct: 301 ---------EKEKEKERETRVSHRRTFEWVKQLSHASSGDEQ 285

BLAST of Tan0016255 vs. ExPASy TrEMBL
Match: A0A6J1F3H1 (probable membrane-associated kinase regulator 1 OS=Cucurbita moschata OX=3662 GN=LOC111439472 PE=4 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 1.2e-91
Identity = 212/342 (61.99%), Postives = 238/342 (69.59%), Query Frame = 0

Query: 22  MGRSCHGDERWEE-------DEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTETE 81
           MGRSCHGDE WEE       ++EEALSFCDLP+K + Q  L +NE PI  S+     ++E
Sbjct: 1   MGRSCHGDETWEEQNQIQEQEDEEALSFCDLPLK-ETQPPLNLNETPIRSSAA---VQSE 60

Query: 82  DFDFNHWPPP-PPPMCAADDLFFQGHMLPLRLSFSSDNSHNH-FFSKNLSIRSESMDHNM 141
           DFDFNH PPP P PMCAAD++FFQGH+LPLR SFSS+NSHN+ FF +N S RSES D  M
Sbjct: 61  DFDFNHCPPPLPQPMCAADEVFFQGHILPLRHSFSSENSHNNPFFPRNSSTRSESSD--M 120

Query: 142 LRFRNGSSSSSSSRSYYSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFSTSG 201
           LRFRNGS+SSSSSRS+YSRSSS+SNNSISIPTNSKPR   NVFHSHPSPTPQIRS STSG
Sbjct: 121 LRFRNGSTSSSSSRSHYSRSSSLSNNSISIPTNSKPRPHNNVFHSHPSPTPQIRSHSTSG 180

Query: 202 RRSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSCKK 261
           RRSRSSSRWDFFRLGLLRTPGMELHDLKTRT+++++A AAA  AA  T GSFLGVVSCKK
Sbjct: 181 RRSRSSSRWDFFRLGLLRTPGMELHDLKTRTNSSSSATAAATPAAHTTAGSFLGVVSCKK 240

Query: 262 SVDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQQKVLNSNN 321
           SVDT+ A                 KRSENG                              
Sbjct: 241 SVDTVAAAGK-------------KKRSENG------------------------------ 283

Query: 322 KNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEEQ 355
                     KE EKE+ TR+SHRRTFEWVKQLSHAS  +EQ
Sbjct: 301 ----------KENEKERGTRVSHRRTFEWVKQLSHASSGDEQ 283

BLAST of Tan0016255 vs. ExPASy TrEMBL
Match: A0A0A0LPT6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G277620 PE=4 SV=1)

HSP 1 Score: 333.6 bits (854), Expect = 1.0e-87
Identity = 207/350 (59.14%), Postives = 238/350 (68.00%), Query Frame = 0

Query: 24  RSCHGDERW-----------------EEDEEEALSFCDLPVKDQKQQLLIVNENPIEESS 83
           RSCHGDER                  EE+EEEALS CDLPVK+++Q        P    S
Sbjct: 7   RSCHGDERCEEEDKEETQIDDDDDEEEEEEEEALSLCDLPVKEKQQ--------PTRSVS 66

Query: 84  VALQTETEDFDFNHWPPPPPPMCAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSE 143
             +    +DFDFNHW PPP PM  ADDLFFQGHMLPLRLSFSS+NS N+  + NL  RSE
Sbjct: 67  TTVVETDQDFDFNHWRPPPSPMLTADDLFFQGHMLPLRLSFSSENSQNN--NGNLWCRSE 126

Query: 144 SMD-HNMLRFRNGSSSSSSSRSYYSRSSSISNNSISIPTNSKPR-TQKNVFHSHPSPTPQ 203
           SMD +NMLRFRN S+SSSSSRS+YSRSSS+SNNSISIPTNSKPR +  NVFHSHPSPTPQ
Sbjct: 127 SMDGNNMLRFRNESTSSSSSRSHYSRSSSLSNNSISIPTNSKPRPSNNNVFHSHPSPTPQ 186

Query: 204 IRSFSTSGRRSRSSSRWDFFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSF 263
           IRSFSTS  RSRSSSRW+FFRLGLLRTPGMELHDLKTRT+TT T      T A KTT S 
Sbjct: 187 IRSFSTSSHRSRSSSRWEFFRLGLLRTPGMELHDLKTRTTTTTTTTTTTST-AHKTTASI 246

Query: 264 LGVVSCKKSVDTIPAPASTRKNRSNWSESIANKRSENGKGNGIREKVMGELKMMKRLQKQ 323
           LGVVSCK+SV+T+P                    +  G  N IR               +
Sbjct: 247 LGVVSCKRSVETVP--------------------TTTGSKNRIR---------------R 306

Query: 324 QKVLNSNNKNNNDDEISEKEKEKEKETRLSHRRTFEWVKQLSHASFAEEQ 355
           + VL +N KNN+D+++  +EKEKEKE R+SHRRTFEW+KQLSHA+F EEQ
Sbjct: 307 ENVLENNKKNNDDNKVEIREKEKEKERRVSHRRTFEWLKQLSHATFGEEQ 310

BLAST of Tan0016255 vs. TAIR 10
Match: AT5G67350.1 (unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 67.8 bits (164), Expect = 2.0e-11
Identity = 94/324 (29.01%), Postives = 146/324 (45.06%), Query Frame = 0

Query: 29  DERWEEDEEEALSFCDLPVKDQKQQLLIVNENPIEESSVALQTETEDFDFNHWPPPPPPM 88
           D+  EE+EEEALS CDLP +  + + ++  E+   +S       +     +    P P M
Sbjct: 24  DDVEEEEEEEALSLCDLPNEKGELRSIVKEEDEEFDSGFEFGIGSSFRAGSDSCEPAPEM 83

Query: 89  CAADDLFFQGHMLPLRLSFSSDNSHNHFFSKNLSIRSESMDHNMLRFRNGSSSSSSSRSY 148
             AD+LFF+G +LPLR S S D   N      L  RSES++     FR      S  +  
Sbjct: 84  STADELFFKGRILPLRHSVSLDAGLNE--PTRLITRSESVE-----FRRTGIIRSDRK-- 143

Query: 149 YSRSSSISNNSISIPTNSKPRTQKNVFHSHPSPTPQIRSFST------SGRRSRSSSRWD 208
                 I NN I               +S PSP PQIR  S+      S R  +SSS WD
Sbjct: 144 ------IKNNFID--------------YSQPSPQPQIRRSSSMTARVNSIRNPKSSSIWD 203

Query: 209 FFRLGLLRTPGMELHDLKTRTSTTATAAAAAPTAAQKTTGSFLGVVSCKKSVDTIPAPAS 268
           F RLGL+RTP +EL           TA  A  + ++ ++ S     S  K + +  + + 
Sbjct: 204 FLRLGLVRTPEIELR---------TTAGNAKLSVSRNSSCSSTSTSSNSKKIGSGESRSR 263

Query: 269 TRKNRSNWSESIANKRSENGKGNGIREKV-MGELKMMKRLQKQQKVLNSNNKNNNDDEIS 328
            R+    +S+   +  +E  K   ++ KV  GE +  +R+                  + 
Sbjct: 264 NRRRSFMFSDCKCSVSTET-KMAPVKIKVSSGETEEKQRM------------------ME 290

Query: 329 EKEKEKEKETRLSHRRTFEWVKQL 346
           +K  +KE+++ ++ +RTFEW+ Q+
Sbjct: 324 KKTAKKEEKSAMARKRTFEWLSQV 290

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022947585.12.3e-10568.33uncharacterized protein LOC111451408 [Cucurbita moschata][more]
KAG6605199.11.6e-10367.54hypothetical protein SDJN03_02516, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023006931.11.2e-10167.45uncharacterized protein LOC111499574 [Cucurbita maxima][more]
XP_023534091.11.5e-10167.44uncharacterized protein LOC111795757 [Cucurbita pepo subsp. pepo][more]
XP_038902148.14.0e-9463.29uncharacterized protein LOC120088781 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1G7B71.1e-10568.33uncharacterized protein LOC111451408 OS=Cucurbita moschata OX=3662 GN=LOC1114514... [more]
A0A6J1KZ545.6e-10267.45uncharacterized protein LOC111499574 OS=Cucurbita maxima OX=3661 GN=LOC111499574... [more]
A0A6J1JCH95.3e-9263.74probable membrane-associated kinase regulator 1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1F3H11.2e-9161.99probable membrane-associated kinase regulator 1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A0A0LPT61.0e-8759.14Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G277620 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G67350.12.0e-1129.01unknown protein; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 298..318
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 259..280
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..195
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 306..337
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 316..337
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 259..287
NoneNo IPR availablePANTHERPTHR33922OS01G0888066 PROTEIN-RELATEDcoord: 28..350
NoneNo IPR availablePANTHERPTHR33922:SF2OS01G0888066 PROTEINcoord: 28..350

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016255.1Tan0016255.1mRNA