Tan0017511 (gene) Snake gourd v1

Overview
NameTan0017511
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGolgin family A protein
LocationLG05: 844128 .. 848066 (-)
RNA-Seq ExpressionTan0017511
SyntenyTan0017511
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTCTTTCTTCCTTCCTTTAATTTTTTTTTCCTCTGTTTTTTCCTTACTGGGTTCGAGATCTCTTTGAGTCAAGACTGCCCAAAAATAGGATCTTTAATAATTTCTTTCTTTTTTTTTCTTCGCTGAATTCAAGGCTAACTGTTCGAAACTTGCCTACTATAAAGCTAAAACCATCTCACTCTCACATGAACTTCACCTCAATTTCTGCAACACCTACCGCCGAGTAAGTAAACTCTTTACAAATTTCTTCAATCTCTCGGAATTTCCTTTCTTTTTTTGTTTTTCGCATTTCGTAAGAGTTGTGTGTATTTTTTTTTTGGAACTTGGGTTTTTGAAGCATAAATAATGTCATGGCCAGAGCAGAAAACAGAGAAGAGATGTAAGATTAGAAAACGAGGGTGTTCATCGTCTTCTTCTTCTTCGACTTTGGTCTTTAAGTACAGATTAAAGAAGACCCCCACGTGGAAAATGAGCACACAATCTCATTCTTCCAAGTTATCCACCGGCGACCATCCCAACCGGTCACCGTCGTGGTCTCTCGACGGGGGCGGAAAAGGGAAGGAAGCCTCTGTTTCCGTTTCGGTTTCAGTCTCAGGCCGGAAAACGACGGCCAACAATAATTCCCAGAAGTTGAAGAACAATTCAGATGTTGTTGAAGATAAGAAAGAGGTGATGAAAACCCGAGATTTGGTTTCCCAGATCTCGCATTCGTGTTTATCAGATCCGGATCCGAGCTGCAACAACACGAATTCAGAGGTGACTTTTTGATTTCGATTACGAATCGATGAATAATATGCATTGGGAATTGTGTTAATTAGAAACTAGAATACGCTTTTTGGATTTGGGTATTGAAATCTGAGAAAAAGTGGGGTTTGATTGAATAAATAATTTTTTTTTTTTTTAAAAAATGCAGAAAGTTGAAGGTGGCAGAGTACACAGAAGAAGAAGGTCGGCATCTTCTTTGAGGATTGGGATAGGGGAAGTGGGTGGTTCGAATTTTCATGGCAATGACTGTTTAATGGAGGTACGATTCATCTATTATTGCTTCCTTTCCACGAAATCATAATTCTTCATATATTTTTTTAATATTTTATTTTTTTTTTTCTTTTTTCTTTTTCAATCCCATCAGCCATCACTTGTTTTGGTTGCTGTCATTTTCCACTTGTACTTCTGCACCAACCTTTTTTTCTTTCCTTTTCACATTCGAATCTTCAATTTTTAAAAAATATAATTCTAGAAAGAATCTAAATGAAGAATTAATTTTCACTGTAATTATGCCTACGGCTTACCATATATAGAAGAAATAAATAATAACTTCTTTTTTCGTTTGATTCCACAGTTTAATTTTAGTTTCTCTTTATAGTTTGATATACGTTTCAATTTAATCTCTATCCTTTTAAGAATTTCAATTGAATCCTATGTCAAATTAACACCTCATCTCAAAATAGTATTTCATTTAAAACTGCTCATTTTTTCTCTATATTCTATTCATCTTTAGAACTCGATCAAATAGAATAGGGTCGCTAACCCATAATGACAGAGATCATCGATGATGATTTGATTAAATCATGAACCAATATGAACTTTTAAGATAACAAACATATAGGGACCACGTTAAAAGTTTAAATCCAAATTCAGACTTTTTTATTTTGGTCCTCGTTGTAACTTGCAAGGAAGCACAAATTTCTAACTTGAAATCTCAGTGTAATAAATGTGCACAGAGGCACCGATATTGTGAGAGAGACCATAAAGCTATTGCCGTATTCAAACTCCATTATTTGAATCTGTAGCGGTCCATTTCATTAAATTTCTAAAATTTGTAATATACAGCTTGATTAATTTAATGATGACAAGTGAGTGGGGAGTCTATTTTTTTTAGAAGGAAATTGGACATTTCGATTTTTGTTTAGACAGTTTTGTTCGAAAAACTATTAAAAAATTAAATCGAATCGAAATGTATATATACCATAAAAAGTCCTAAATTGAATCCTAAATCATTCTTGCACCGATTTATCCGTTTTCTTTCAATTAATTTTTAAGTATAAATTGAGATTGTTAGCTCTATGAATTCTCATTAAGGTGTTAGAGAAATTTTGAGGTAAAAAAAATACTAAAATAAGAAATGGAACTAAGAAATAAAATAGCATAAAATTTGTAAAAAAAAAGGAAAACTGAAAATGGAACATTGATTTGGTTCCTTTCTTACTAAAATCTAGATCAGTTTGGTTCTCGATTTGGTCTAAAATCAATGAATATCGAACTAATATCACTCTATTTTTTATATTTTCGAAGGGCTCTGATTTTGTAAGTGGTTTGTAATGGTATTAATAATGGAATGCAGATTGAAAATCCAAGTCATCAAGGAAAAACGACACGTCGAAAGACAAAATTTACGCTAAAAACACGTTTGAAGGAAGTGAGCAATTGCCTGACGACATCAAAGGAGCTTTTAAGAGTTCTAAACCATGTATGGGGTCATGAAGATCACGATCAACAGCGCCCATCTTCGACTTTATCTCTGATTACAGCTCTGAAGTCAGAGCTTGATCGAGCCAAGACCCGAGTTGATCATTTGATCAAAGACCAGAACTTTCATGGCGATGAAATCGAGCAACTAATGAAGCGATTTGCAGAGGAAAAGGCAGGTTGGAAGTACAGGGAGAGAGCAAGAGTTCGGAGCGCCATTACTTCAATGGCGGATGAAGTTGAGGTTGAGAAGAAACTTAGAAGACAAGCTGAGAGACTGAACAAGAGGATTGCCAAAGAACTTGCCGATGCTAAAGTTTCACTTTCGAAAGCAATGAAAGATCTTCAAAGGTAATAATTTCGGAGTTGTGTTATAGTTGTGGTAAAAATTTGAAGGTGGGTGATACATTATTGTTTGCAATTGCATTGAAGGGAGAGGAGAGCAAAGGAGATATTCGAGGAGATATGTGATGAATTAGCGAAAGGGATTGGAGAAGACAGAGCCCAATTCGAGGAGCTAAAGAAGGAATCAGCAAAAGTAAGAGAAGAAGTGGAAAAAGAAAGAGAAATGCTTCAATTAGCAGATGTGTTACGAGAGGAAAGAGTTCAAATGAAGCTTTCGGAGGCCAAATATCAGTTCGAAGAGAAAAACGCAGCAGTGGAAAAGCTGAAAGACGAGCTGGAAGCGTATTTGATAACCCAACAAGAAGATTACTGTTGCAACAAATTTGAAAAAATCAAAGAGCTGGAAGCATATTTGAAGAAAATAAATTTTGGGTCATGTCAAGAACAGAACATGGAAGTGGGAGAAGAAGATGATTGTTCAGAAGAGGACGACAGCGATCTCCATTCAATTGAACTCAACATGGACAACAACAACAAGAGTTACAGATGGAGTTTCGTCCATGGCTCTAAAAGTAAAAGGAACTCATTCGAAATTAAGGATCAAAACCAAAACCAAACCAATGCAAGAAAATCCCTTTCAGAAAAAATCCAATGGGGAAGTATTTGCTTGAATCGGAAATCATCCAACGAGTTTCTTGGTCGAAAGAGCCACGAAAATTCCGAGAGATTTGACTGGGAGAGATTTACAGAGCTTTTTACACAATCAACAACACACAAACAAGATCTTGATCTTGATCTTGATGTTGATGTTCATCAATTAAACGAGGGGGATAATGAAATTACACACAAGATCAATAATACTAAATCTGTGAAGTGCCTTAGGGATATTTTATTTCCAGGATCAGCCGAAGATCAAAATCAAGTTGCTAAAACGGAGGAGGATGGAATTGCTATGAATGTTCTCCAAATCGATGAAGAAGCTTCTTCAGTGGTGACGAAGGGATAAATTTTACGTTTAAAACCATTCTCTTAATTCATTTCTTTTGATTCAGAAGTGCTTTGCTTAACAATCTTTTCTTCTTCATATTATATTTAAATGCTTCTAATTTTGGCTAAAGACCCAATAC

mRNA sequence

CCTCTTTCTTCCTTCCTTTAATTTTTTTTTCCTCTGTTTTTTCCTTACTGGGTTCGAGATCTCTTTGAGTCAAGACTGCCCAAAAATAGGATCTTTAATAATTTCTTTCTTTTTTTTTCTTCGCTGAATTCAAGGCTAACTGTTCGAAACTTGCCTACTATAAAGCTAAAACCATCTCACTCTCACATGAACTTCACCTCAATTTCTGCAACACCTACCGCCGAAAAACAGAGAAGAGATGTAAGATTAGAAAACGAGGGTGTTCATCGTCTTCTTCTTCTTCGACTTTGGTCTTTAAGTACAGATTAAAGAAGACCCCCACGTGGAAAATGAGCACACAATCTCATTCTTCCAAGTTATCCACCGGCGACCATCCCAACCGGTCACCGTCGTGGTCTCTCGACGGGGGCGGAAAAGGGAAGGAAGCCTCTGTTTCCGTTTCGGTTTCAGTCTCAGGCCGGAAAACGACGGCCAACAATAATTCCCAGAAGTTGAAGAACAATTCAGATGTTGTTGAAGATAAGAAAGAGGTGATGAAAACCCGAGATTTGGTTTCCCAGATCTCGCATTCGTGTTTATCAGATCCGGATCCGAGCTGCAACAACACGAATTCAGAGAAAGTTGAAGGTGGCAGAGTACACAGAAGAAGAAGGTCGGCATCTTCTTTGAGGATTGGGATAGGGGAAGTGGGTGGTTCGAATTTTCATGGCAATGACTGTTTAATGGAGATTGAAAATCCAAGTCATCAAGGAAAAACGACACGTCGAAAGACAAAATTTACGCTAAAAACACGTTTGAAGGAAGTGAGCAATTGCCTGACGACATCAAAGGAGCTTTTAAGAGTTCTAAACCATGTATGGGGTCATGAAGATCACGATCAACAGCGCCCATCTTCGACTTTATCTCTGATTACAGCTCTGAAGTCAGAGCTTGATCGAGCCAAGACCCGAGTTGATCATTTGATCAAAGACCAGAACTTTCATGGCGATGAAATCGAGCAACTAATGAAGCGATTTGCAGAGGAAAAGGCAGGTTGGAAGTACAGGGAGAGAGCAAGAGTTCGGAGCGCCATTACTTCAATGGCGGATGAAGTTGAGGTTGAGAAGAAACTTAGAAGACAAGCTGAGAGACTGAACAAGAGGATTGCCAAAGAACTTGCCGATGCTAAAGTTTCACTTTCGAAAGCAATGAAAGATCTTCAAAGGGAGAGGAGAGCAAAGGAGATATTCGAGGAGATATGTGATGAATTAGCGAAAGGGATTGGAGAAGACAGAGCCCAATTCGAGGAGCTAAAGAAGGAATCAGCAAAAGTAAGAGAAGAAGTGGAAAAAGAAAGAGAAATGCTTCAATTAGCAGATGTGTTACGAGAGGAAAGAGTTCAAATGAAGCTTTCGGAGGCCAAATATCAGTTCGAAGAGAAAAACGCAGCAGTGGAAAAGCTGAAAGACGAGCTGGAAGCGTATTTGATAACCCAACAAGAAGATTACTGTTGCAACAAATTTGAAAAAATCAAAGAGCTGGAAGCATATTTGAAGAAAATAAATTTTGGGTCATGTCAAGAACAGAACATGGAAGTGGGAGAAGAAGATGATTGTTCAGAAGAGGACGACAGCGATCTCCATTCAATTGAACTCAACATGGACAACAACAACAAGAGTTACAGATGGAGTTTCGTCCATGGCTCTAAAAGTAAAAGGAACTCATTCGAAATTAAGGATCAAAACCAAAACCAAACCAATGCAAGAAAATCCCTTTCAGAAAAAATCCAATGGGGAAGTATTTGCTTGAATCGGAAATCATCCAACGAGTTTCTTGGTCGAAAGAGCCACGAAAATTCCGAGAGATTTGACTGGGAGAGATTTACAGAGCTTTTTACACAATCAACAACACACAAACAAGATCTTGATCTTGATCTTGATGTTGATGTTCATCAATTAAACGAGGGGGATAATGAAATTACACACAAGATCAATAATACTAAATCTGTGAAGTGCCTTAGGGATATTTTATTTCCAGGATCAGCCGAAGATCAAAATCAAGTTGCTAAAACGGAGGAGGATGGAATTGCTATGAATGTTCTCCAAATCGATGAAGAAGCTTCTTCAGTGGTGACGAAGGGATAAATTTTACGTTTAAAACCATTCTCTTAATTCATTTCTTTTGATTCAGAAGTGCTTTGCTTAACAATCTTTTCTTCTTCATATTATATTTAAATGCTTCTAATTTTGGCTAAAGACCCAATAC

Coding sequence (CDS)

ATGAGCACACAATCTCATTCTTCCAAGTTATCCACCGGCGACCATCCCAACCGGTCACCGTCGTGGTCTCTCGACGGGGGCGGAAAAGGGAAGGAAGCCTCTGTTTCCGTTTCGGTTTCAGTCTCAGGCCGGAAAACGACGGCCAACAATAATTCCCAGAAGTTGAAGAACAATTCAGATGTTGTTGAAGATAAGAAAGAGGTGATGAAAACCCGAGATTTGGTTTCCCAGATCTCGCATTCGTGTTTATCAGATCCGGATCCGAGCTGCAACAACACGAATTCAGAGAAAGTTGAAGGTGGCAGAGTACACAGAAGAAGAAGGTCGGCATCTTCTTTGAGGATTGGGATAGGGGAAGTGGGTGGTTCGAATTTTCATGGCAATGACTGTTTAATGGAGATTGAAAATCCAAGTCATCAAGGAAAAACGACACGTCGAAAGACAAAATTTACGCTAAAAACACGTTTGAAGGAAGTGAGCAATTGCCTGACGACATCAAAGGAGCTTTTAAGAGTTCTAAACCATGTATGGGGTCATGAAGATCACGATCAACAGCGCCCATCTTCGACTTTATCTCTGATTACAGCTCTGAAGTCAGAGCTTGATCGAGCCAAGACCCGAGTTGATCATTTGATCAAAGACCAGAACTTTCATGGCGATGAAATCGAGCAACTAATGAAGCGATTTGCAGAGGAAAAGGCAGGTTGGAAGTACAGGGAGAGAGCAAGAGTTCGGAGCGCCATTACTTCAATGGCGGATGAAGTTGAGGTTGAGAAGAAACTTAGAAGACAAGCTGAGAGACTGAACAAGAGGATTGCCAAAGAACTTGCCGATGCTAAAGTTTCACTTTCGAAAGCAATGAAAGATCTTCAAAGGGAGAGGAGAGCAAAGGAGATATTCGAGGAGATATGTGATGAATTAGCGAAAGGGATTGGAGAAGACAGAGCCCAATTCGAGGAGCTAAAGAAGGAATCAGCAAAAGTAAGAGAAGAAGTGGAAAAAGAAAGAGAAATGCTTCAATTAGCAGATGTGTTACGAGAGGAAAGAGTTCAAATGAAGCTTTCGGAGGCCAAATATCAGTTCGAAGAGAAAAACGCAGCAGTGGAAAAGCTGAAAGACGAGCTGGAAGCGTATTTGATAACCCAACAAGAAGATTACTGTTGCAACAAATTTGAAAAAATCAAAGAGCTGGAAGCATATTTGAAGAAAATAAATTTTGGGTCATGTCAAGAACAGAACATGGAAGTGGGAGAAGAAGATGATTGTTCAGAAGAGGACGACAGCGATCTCCATTCAATTGAACTCAACATGGACAACAACAACAAGAGTTACAGATGGAGTTTCGTCCATGGCTCTAAAAGTAAAAGGAACTCATTCGAAATTAAGGATCAAAACCAAAACCAAACCAATGCAAGAAAATCCCTTTCAGAAAAAATCCAATGGGGAAGTATTTGCTTGAATCGGAAATCATCCAACGAGTTTCTTGGTCGAAAGAGCCACGAAAATTCCGAGAGATTTGACTGGGAGAGATTTACAGAGCTTTTTACACAATCAACAACACACAAACAAGATCTTGATCTTGATCTTGATGTTGATGTTCATCAATTAAACGAGGGGGATAATGAAATTACACACAAGATCAATAATACTAAATCTGTGAAGTGCCTTAGGGATATTTTATTTCCAGGATCAGCCGAAGATCAAAATCAAGTTGCTAAAACGGAGGAGGATGGAATTGCTATGAATGTTCTCCAAATCGATGAAGAAGCTTCTTCAGTGGTGACGAAGGGATAA

Protein sequence

MSTQSHSSKLSTGDHPNRSPSWSLDGGGKGKEASVSVSVSVSGRKTTANNNSQKLKNNSDVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGEVGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGHEDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYRERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEIFEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVEKLKDELEAYLITQQEDYCCNKFEKIKELEAYLKKINFGSCQEQNMEVGEEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTNARKSLSEKIQWGSICLNRKSSNEFLGRKSHENSERFDWERFTELFTQSTTHKQDLDLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQIDEEASSVVTKG
Homology
BLAST of Tan0017511 vs. ExPASy Swiss-Prot
Match: Q66GQ2 (Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 PE=2 SV=2)

HSP 1 Score: 103.6 bits (257), Expect = 7.9e-21
Identity = 81/268 (30.22%), Postives = 150/268 (55.97%), Query Frame = 0

Query: 163 LTTSKELLRVLNHVWGHEDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEI 222
           L TS ELL+VLN +W  E    ++  S +SLI ALK+E+  ++ R+  L++ Q     E+
Sbjct: 193 LKTSTELLKVLNRIWSLE----EQHVSNISLIKALKTEVAHSRVRIKELLRYQQADRHEL 252

Query: 223 EQLMKRFAEEKAGWKYRERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVS 282
           + ++K+ AEEK   K +E  R+ SA+ S+   +E E+KLR+++E L++++A+EL++ K S
Sbjct: 253 DSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDERKLRKRSESLHRKMARELSEVKSS 312

Query: 283 LSKAMKDLQRERRAKEIFEEICDELAKGIGEDRAQFEELKKESAKV--REEVEKEREMLQ 342
           LS  +K+L+R  ++ ++ E +CDE AKGI     +   LKK++           ++ +L 
Sbjct: 313 LSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGGGDQLVLH 372

Query: 343 LADVLREERVQMKLSEAKYQFEEKNAAVEKLKDELEAYLITQQEDYCCNKFEKIKELEAY 402
           +A+   +ER+QM+L        +  + ++KL+ E+E +L  ++ +   N+          
Sbjct: 373 IAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIETFLQEKRNEIPRNRRNS------- 432

Query: 403 LKKINFGSCQEQNMEVGEEDDCSEEDDS 429
           L+ + F +      +V  E+D    D +
Sbjct: 433 LESVPFNTLSAPPRDVDCEEDSGGSDSN 449

BLAST of Tan0017511 vs. ExPASy Swiss-Prot
Match: F4I878 (Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 5.5e-06
Identity = 41/131 (31.30%), Postives = 77/131 (58.78%), Query Frame = 0

Query: 240 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 299
           E  + +  I  +  E++ E+K RR+AE + K++AK++ + +  +++  +++Q +R  KE+
Sbjct: 76  ELGKAQDEIKELKAELDYERKARRRAELMIKKLAKDVEEER--MAREAEEMQNKRLFKEL 135

Query: 300 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 359
             E                   K E  +++ ++E+ER+M +LA+VLREERVQMKL +A+ 
Sbjct: 136 SSE-------------------KSEMVRMKRDLEEERQMHRLAEVLREERVQMKLMDARL 185

Query: 360 QFEEKNAAVEK 371
             EEK + +E+
Sbjct: 196 FLEEKLSELEE 185

BLAST of Tan0017511 vs. NCBI nr
Match: KAG6605972.1 (hypothetical protein SDJN03_03289, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 732.6 bits (1890), Expect = 2.6e-207
Identity = 432/603 (71.64%), Postives = 480/603 (79.60%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 1   MSTKSHSS--------NRSPSCSIAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 60

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASS+R+G GE
Sbjct: 61  DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGE 120

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHGN CL+EIENPS+QG+T RRKTKF LKTRLKEV NCLTTSKEL+RVLNHV  H
Sbjct: 121 ---ANFHGNHCLIEIENPSNQGRTVRRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAH 180

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALKSE++RAK RVDHLIKDQ+ HGDEIE +MKRF EEK  WK R
Sbjct: 181 EDNDQHRPSSISPLITALKSEMERAKARVDHLIKDQSLHGDEIEIVMKRFTEEKTAWKNR 240

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLRRQAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 241 ERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 300

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 301 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 360

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQE- 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 361 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 420

Query: 421 -QNMEVGEEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
            +  E  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     ++Q N
Sbjct: 421 LEGDEKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDQIN 480

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     EF+GRKSHE+SER +WERFTE+F +        
Sbjct: 481 GRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESSERLEWERFTEVFEK-------- 540

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                       EGDN    K NNTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 541 ------------EGDNGSAEK-NNTKSGKCLRDILFPGFVEPNDDV------GIAENV-E 549

BLAST of Tan0017511 vs. NCBI nr
Match: XP_022995046.1 (uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima] >XP_022995048.1 uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima])

HSP 1 Score: 731.5 bits (1887), Expect = 5.9e-207
Identity = 433/603 (71.81%), Postives = 480/603 (79.60%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 46  MSTKSHSS--------NRSPSCSVAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 105

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASSLRIG GE
Sbjct: 106 DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSLRIGTGE 165

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHGN CL+EIENPS+QG+T RRKTKF LKTRLKEVSNCLTTSKEL+RVLNHV  H
Sbjct: 166 ---ANFHGNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVSNCLTTSKELVRVLNHVLAH 225

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALKSE++RAK RVDHLIKDQ+FHGDEIE +MKRF EEK  WK R
Sbjct: 226 EDNDQHRPSSISPLITALKSEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNR 285

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLR+QAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 286 ERARVRSSIASMADEIEIEKKLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 345

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 346 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 405

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQEQ 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 406 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 465

Query: 421 NMEVG--EEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
               G  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     ++Q N
Sbjct: 466 PDGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDQIN 525

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     +F+GRKSHE+SER +WERFTE+F +        
Sbjct: 526 GRKSVSEKIQWGSICLNRKASNGSKNGDFVGRKSHESSERLEWERFTEVFEK-------- 585

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                       EGDN    K  NTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 586 ------------EGDNGSAEK-KNTKSGKCLRDILFPGFVEPNDDV------GIAGNV-E 594

BLAST of Tan0017511 vs. NCBI nr
Match: XP_022995049.1 (uncharacterized protein LOC111490718 isoform X2 [Cucurbita maxima] >XP_022995050.1 uncharacterized protein LOC111490718 isoform X2 [Cucurbita maxima])

HSP 1 Score: 731.5 bits (1887), Expect = 5.9e-207
Identity = 433/603 (71.81%), Postives = 480/603 (79.60%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 1   MSTKSHSS--------NRSPSCSVAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 60

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASSLRIG GE
Sbjct: 61  DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSLRIGTGE 120

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHGN CL+EIENPS+QG+T RRKTKF LKTRLKEVSNCLTTSKEL+RVLNHV  H
Sbjct: 121 ---ANFHGNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVSNCLTTSKELVRVLNHVLAH 180

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALKSE++RAK RVDHLIKDQ+FHGDEIE +MKRF EEK  WK R
Sbjct: 181 EDNDQHRPSSISPLITALKSEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNR 240

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLR+QAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 241 ERARVRSSIASMADEIEIEKKLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 300

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 301 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 360

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQEQ 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 361 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 420

Query: 421 NMEVG--EEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
               G  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     ++Q N
Sbjct: 421 PDGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDQIN 480

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     +F+GRKSHE+SER +WERFTE+F +        
Sbjct: 481 GRKSVSEKIQWGSICLNRKASNGSKNGDFVGRKSHESSERLEWERFTEVFEK-------- 540

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                       EGDN    K  NTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 541 ------------EGDNGSAEK-KNTKSGKCLRDILFPGFVEPNDDV------GIAGNV-E 549

BLAST of Tan0017511 vs. NCBI nr
Match: KAG7035923.1 (hypothetical protein SDJN02_02723 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 730.3 bits (1884), Expect = 1.3e-206
Identity = 431/603 (71.48%), Postives = 479/603 (79.44%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 44  MSTKSHSS--------NRSPSCSIAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 103

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASS+R+G GE
Sbjct: 104 DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGE 163

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHGN CL+EIENPS+QG+T RRKTKF LKTRLKEV NCLTTSKEL+RVLNHV  H
Sbjct: 164 ---ANFHGNHCLIEIENPSNQGRTVRRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAH 223

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALKSE++RAK RVDHLIKDQ+ HGDEIE +MKRF EEK  WK R
Sbjct: 224 EDNDQHRPSSISPLITALKSEMERAKARVDHLIKDQSLHGDEIEIVMKRFTEEKTAWKNR 283

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLRRQAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 284 ERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 343

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 344 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 403

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQE- 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 404 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 463

Query: 421 -QNMEVGEEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
            +  E  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     ++Q N
Sbjct: 464 LEGDEKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDQIN 523

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     EF+GRKSHE+SER +WERFTE+F +        
Sbjct: 524 GRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESSERLEWERFTEVFEK-------- 583

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                       EGDN    K  NTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 584 ------------EGDNGSAEK-KNTKSGKCLRDILFPGFVEPNDDV------GIAENV-E 592

BLAST of Tan0017511 vs. NCBI nr
Match: XP_023534124.1 (uncharacterized protein LOC111795778 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023534128.1 uncharacterized protein LOC111795779 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 729.6 bits (1882), Expect = 2.2e-206
Identity = 431/603 (71.48%), Postives = 478/603 (79.27%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 40  MSTKSHSS--------NRSPSCSVAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 99

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASS+R+G GE
Sbjct: 100 DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGE 159

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHGN CL+EIENPS+QG+T RRKTKF LKTRLKEV NCLTTSKEL+RVLNHV  H
Sbjct: 160 ---ANFHGNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAH 219

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALK E++RAK RVDHLIKDQ+FHGDEIE +MKRF EEK  WK R
Sbjct: 220 EDNDQHRPSSISPLITALKLEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNR 279

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLRRQAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 280 ERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 339

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 340 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 399

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQEQ 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 400 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 459

Query: 421 NMEVG--EEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
               G  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     ++Q N
Sbjct: 460 LEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDQIN 519

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     EF+GRKSHE++ER +WERFTE+F          
Sbjct: 520 GRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERLEWERFTEVFE--------- 579

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                      NEGDN    K  NTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 580 -----------NEGDNGSAEK-KNTKSGKCLRDILFPGFVEPNDDV------GIAGNV-E 588

BLAST of Tan0017511 vs. ExPASy TrEMBL
Match: A0A6J1K0X3 (uncharacterized protein LOC111490718 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490718 PE=4 SV=1)

HSP 1 Score: 731.5 bits (1887), Expect = 2.9e-207
Identity = 433/603 (71.81%), Postives = 480/603 (79.60%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 46  MSTKSHSS--------NRSPSCSVAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 105

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASSLRIG GE
Sbjct: 106 DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSLRIGTGE 165

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHGN CL+EIENPS+QG+T RRKTKF LKTRLKEVSNCLTTSKEL+RVLNHV  H
Sbjct: 166 ---ANFHGNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVSNCLTTSKELVRVLNHVLAH 225

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALKSE++RAK RVDHLIKDQ+FHGDEIE +MKRF EEK  WK R
Sbjct: 226 EDNDQHRPSSISPLITALKSEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNR 285

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLR+QAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 286 ERARVRSSIASMADEIEIEKKLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 345

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 346 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 405

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQEQ 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 406 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 465

Query: 421 NMEVG--EEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
               G  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     ++Q N
Sbjct: 466 PDGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDQIN 525

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     +F+GRKSHE+SER +WERFTE+F +        
Sbjct: 526 GRKSVSEKIQWGSICLNRKASNGSKNGDFVGRKSHESSERLEWERFTEVFEK-------- 585

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                       EGDN    K  NTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 586 ------------EGDNGSAEK-KNTKSGKCLRDILFPGFVEPNDDV------GIAGNV-E 594

BLAST of Tan0017511 vs. ExPASy TrEMBL
Match: A0A6J1K315 (uncharacterized protein LOC111490718 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111490718 PE=4 SV=1)

HSP 1 Score: 731.5 bits (1887), Expect = 2.9e-207
Identity = 433/603 (71.81%), Postives = 480/603 (79.60%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 1   MSTKSHSS--------NRSPSCSVAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 60

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASSLRIG GE
Sbjct: 61  DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSLRIGTGE 120

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHGN CL+EIENPS+QG+T RRKTKF LKTRLKEVSNCLTTSKEL+RVLNHV  H
Sbjct: 121 ---ANFHGNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVSNCLTTSKELVRVLNHVLAH 180

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALKSE++RAK RVDHLIKDQ+FHGDEIE +MKRF EEK  WK R
Sbjct: 181 EDNDQHRPSSISPLITALKSEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNR 240

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLR+QAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 241 ERARVRSSIASMADEIEIEKKLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 300

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 301 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 360

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQEQ 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 361 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 420

Query: 421 NMEVG--EEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
               G  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     ++Q N
Sbjct: 421 PDGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDQIN 480

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     +F+GRKSHE+SER +WERFTE+F +        
Sbjct: 481 GRKSVSEKIQWGSICLNRKASNGSKNGDFVGRKSHESSERLEWERFTEVFEK-------- 540

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                       EGDN    K  NTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 541 ------------EGDNGSAEK-KNTKSGKCLRDILFPGFVEPNDDV------GIAGNV-E 549

BLAST of Tan0017511 vs. ExPASy TrEMBL
Match: A0A6J1H001 (uncharacterized protein LOC111459204 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111459204 PE=4 SV=1)

HSP 1 Score: 726.9 bits (1875), Expect = 7.0e-206
Identity = 429/603 (71.14%), Postives = 479/603 (79.44%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 1   MSTKSHSS--------NRSPSCSIAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 60

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASS+R+G GE
Sbjct: 61  DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGE 120

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHG+ CL+EIENPS+QGKT RRKTKF LKTRLKEV NCLTTSKEL+RVLNHV  H
Sbjct: 121 ---ANFHGDHCLIEIENPSNQGKTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAH 180

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALKSE++RAK RVDHLIKDQ+ HGDEIE +MKRF EEK  WK R
Sbjct: 181 EDNDQHRPSSISPLITALKSEMERAKARVDHLIKDQSLHGDEIEIVMKRFTEEKTAWKNR 240

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLR+QAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 241 ERARVRSSIASMADEIEIEKKLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 300

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 301 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 360

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQE- 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 361 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 420

Query: 421 -QNMEVGEEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
            +  E  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     +++ N
Sbjct: 421 LEGDEKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDEIN 480

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     EF+GRKSHE+SER +WERFTE+F +        
Sbjct: 481 GRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESSERLEWERFTEVFEK-------- 540

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                       EGDN    K  NTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 541 ------------EGDNGSAEK-KNTKSGKCLRDILFPGFVEPNDDV------GIAGNV-E 549

BLAST of Tan0017511 vs. ExPASy TrEMBL
Match: A0A6J1H037 (uncharacterized protein LOC111459204 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459204 PE=4 SV=1)

HSP 1 Score: 726.9 bits (1875), Expect = 7.0e-206
Identity = 429/603 (71.14%), Postives = 479/603 (79.44%), Query Frame = 0

Query: 1   MSTQSHSSKLSTGDHPNRSPSWSLDGGG-KGKEASVSVSVSVSGRKTTANNNSQKLKNNS 60
           MST+SHSS        NRSPS S+ GGG KGKEASVSVSVSVS R     N+SQKLKNN 
Sbjct: 44  MSTKSHSS--------NRSPSCSIAGGGSKGKEASVSVSVSVSAR-----NHSQKLKNNM 103

Query: 61  DVVEDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNSEKVEGGRVHRRRRSASSLRIGIGE 120
           D++EDK+E+MKT+D VSQISHSCLSDPDP  N++NS+KVEG RVHRRR SASS+R+G GE
Sbjct: 104 DIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGE 163

Query: 121 VGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKELLRVLNHVWGH 180
              +NFHG+ CL+EIENPS+QGKT RRKTKF LKTRLKEV NCLTTSKEL+RVLNHV  H
Sbjct: 164 ---ANFHGDHCLIEIENPSNQGKTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAH 223

Query: 181 EDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRFAEEKAGWKYR 240
           ED+DQ RPSS   LITALKSE++RAK RVDHLIKDQ+ HGDEIE +MKRF EEK  WK R
Sbjct: 224 EDNDQHRPSSISPLITALKSEMERAKARVDHLIKDQSLHGDEIEIVMKRFTEEKTAWKNR 283

Query: 241 ERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKDLQRERRAKEI 300
           ERARVRS+I SMADE+E+EKKLR+QAERLNK IAKELA+AK+SLSKAMKDLQRERRAKEI
Sbjct: 284 ERARVRSSIASMADEIEIEKKLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 343

Query: 301 FEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKY 360
           FE+ICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY
Sbjct: 344 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 403

Query: 361 QFEEKNAAVEKLKDELEAYLITQ-------QEDYCCNKFEKIKELEAYLKKINFGSCQE- 420
           QFEEKNAAVE+LKDELEA+LITQ       +EDY      KIKELEAYLKKINFGS QE 
Sbjct: 404 QFEEKNAAVERLKDELEAFLITQFRHENREEEDYS----GKIKELEAYLKKINFGSVQEH 463

Query: 421 -QNMEVGEEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSFEIKDQNQNQTN 480
            +  E  EE +CSEEDDSDLHSIELNMDNNNKSYRWSFVHG  SKRNSFE     +++ N
Sbjct: 464 LEGDEKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGG-SKRNSFE-----KDEIN 523

Query: 481 ARKSLSEKIQWGSICLNRKSSN-----EFLGRKSHENSERFDWERFTELFTQSTTHKQDL 540
            RKS+SEKIQWGSICLNRK+SN     EF+GRKSHE+SER +WERFTE+F +        
Sbjct: 524 GRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESSERLEWERFTEVFEK-------- 583

Query: 541 DLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPGSAEDQNQVAKTEEDGIAMNVLQ 589
                       EGDN    K  NTKS KCLRDILFPG  E  + V      GIA NV +
Sbjct: 584 ------------EGDNGSAEK-KNTKSGKCLRDILFPGFVEPNDDV------GIAGNV-E 592

BLAST of Tan0017511 vs. ExPASy TrEMBL
Match: A0A6J1DGK1 (uncharacterized protein LOC111020667 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111020667 PE=4 SV=1)

HSP 1 Score: 607.8 bits (1566), Expect = 4.8e-170
Identity = 376/574 (65.51%), Postives = 429/574 (74.74%), Query Frame = 0

Query: 38  SVSVSGRKTTANNNSQKLKNNSDVV--EDKKEVMKTRDLVSQISHSCLSDPDPSCNNTNS 97
           S S +G+     NN     NNSD V  EDKKE++KT ++VSQISHSCLSDPDP+ NNT S
Sbjct: 5   SHSSAGKGVKLKNN-----NNSDAVELEDKKELIKTGEMVSQISHSCLSDPDPNFNNTKS 64

Query: 98  EKVEG--GRVHRRRRSASSLRIGIG--EVGGSNFHGNDCLMEIENPSHQGKTTRRKTKFT 157
           EKVEG  GRV RRRRSA SLRIGIG  EVGGSNF GNDCLMEIEN S + KTTRRK KFT
Sbjct: 65  EKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRS-EVKTTRRKKKFT 124

Query: 158 LKTRLKEVSNCLTTSKELLRVLNHVWGHEDHDQQRPSSTLSLITALKSELDRAKTRVDHL 217
           +KTRLKEVSNCLTTSKEL+RVL HVWG   HD+++PSS  SL+ ALKSELDRAKTRV+HL
Sbjct: 125 VKTRLKEVSNCLTTSKELVRVLTHVWG--SHDEKQPSSASSLMAALKSELDRAKTRVEHL 184

Query: 218 IKDQN--FHGDEIEQLMKRFAEEKAGWKYRERARVRSAITSMADEVEVEKKLRRQAERLN 277
           ++D+   FHGDEIE L KRFAEEKA WKY+ERARV SAI+SMA+EV VE+KLRRQAERLN
Sbjct: 185 MRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKLRRQAERLN 244

Query: 278 KRIAKELADAKVSLSKAMKDLQRERRAKEIFEEICDELAKGIGEDRAQFEELKKESAKVR 337
           KRI KEL +A+V+++KAMKD+ RE+RAKEI EEIC+ELAKGIGEDRA+FEEL+KES KVR
Sbjct: 245 KRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEELRKESEKVR 304

Query: 338 EEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVEKLKDELEAYLITQQ---EDY 397
           EEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVE+LKD+LEAY +  +   +D 
Sbjct: 305 EEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVENRDHSQDS 364

Query: 398 CCNKFEKIKELEAYLKKINFGSCQEQNMEVGEEDDCSEEDDSDLHSIELNMDNNNKSYRW 457
             NK +KIKELEAYLKKINFGS  +      EE+DC+ +++SDLHSIELNMDNNNKSYRW
Sbjct: 365 FNNKLDKIKELEAYLKKINFGSYNKNK----EEEDCNWDEESDLHSIELNMDNNNKSYRW 424

Query: 458 SFVHGS--KSKRNSFEIKDQNQNQTNARKSLSEKIQWGSICLNRKSSN-EFLGRKSHENS 517
           SFVHGS   SKRNSFE           RKSLSEKIQWGSIC N  S N EF G       
Sbjct: 425 SFVHGSHNASKRNSFE---------KERKSLSEKIQWGSICFNSSSKNGEFEG------- 484

Query: 518 ERFDWERFTELFTQSTTHKQDLDLDLDVDVHQLNEGDNEITHKINNTKSVKCLRDILFPG 577
              D ER                           +G+ +ITHK   +  V+CLRDILFP 
Sbjct: 485 ---DGER---------------------------DGEIQITHK-QKSGGVRCLRDILFPV 515

Query: 578 SAEDQNQVAKTEEDGIAMNVLQIDEEASSVVTKG 598
           S  ++N+V KTE+   AM  LQIDE  S VV KG
Sbjct: 545 SGVEENKVEKTED---AM-PLQIDEPCSVVVMKG 515

BLAST of Tan0017511 vs. TAIR 10
Match: AT3G11590.1 (unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G22310.1); Has 22320 Blast hits to 15179 proteins in 1213 species: Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi - 1700; Plants - 1146; Viruses - 65; Other Eukaryotes - 5824 (source: NCBI BLink). )

HSP 1 Score: 277.7 bits (709), Expect = 2.2e-74
Identity = 212/502 (42.23%), Postives = 304/502 (60.56%), Query Frame = 0

Query: 4   QSHSSKLSTGDHPNRSPSWSLDGGGKGKEASVSVSVSVSGRKTTANNNSQKLKNNSDVVE 63
           +S S + S   H   SPS S  G   GK   VS    VS RK  A         +  VVE
Sbjct: 63  RSPSPRASGALHAAASPS-SHCGSKTGK---VSAPAPVSARKLAATLWEMNEMPSPRVVE 122

Query: 64  D--------KKEVMKTRDLVSQISHSCLSDP---DPSCNNTNSEKVEGGRVHRRRRSASS 123
           +        +KE +          HS    P   DPS +  +      G   R+RR++S+
Sbjct: 123 EAAPMIRKSRKERIAPLPPPRSSVHSGSLPPHLSDPSHSPVSERMERSGTGSRQRRASST 182

Query: 124 ---LRIGIGEVGGSNFHGNDCLMEIENPSHQGKTTRRKTKFTLKTRLKEVSNCLTTSKEL 183
              LR+G   VG  +   +   M+IE  S     T   +   +KTRLK+ SN LTTSKEL
Sbjct: 183 VQKLRLGDCNVGARDPINSGSFMDIETRSR--VETPTGSTVGVKTRLKDCSNALTTSKEL 242

Query: 184 LRVLNHVWGHEDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEIEQLMKRF 243
           L+++N +WG +D    RPSS++SL++AL SEL+RA+ +V+ LI +     ++I  LMKRF
Sbjct: 243 LKIINRMWGQDD----RPSSSMSLVSALHSELERARLQVNQLIHEHKPENNDISYLMKRF 302

Query: 244 AEEKAGWKYRERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVSLSKAMKD 303
           AEEKA WK  E+  V +AI S+A E+EVE+KLRR+ E LNK++ KELA+ K +L KA+K+
Sbjct: 303 AEEKAVWKSNEQEVVEAAIESVAGELEVERKLRRRFESLNKKLGKELAETKSALMKAVKE 362

Query: 304 LQRERRAKEIFEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREER 363
           ++ E+RA+ + E++CDELA+ I ED+A+ EELK+ES KV+EEVEKEREMLQLAD LREER
Sbjct: 363 IENEKRARVMVEKVCDELARDISEDKAEVEELKRESFKVKEEVEKEREMLQLADALREER 422

Query: 364 VQMKLSEAKYQFEEKNAAVEKLKDELEAYLITQQEDYCCNKFEKIKELEAYLKK------ 423
           VQMKLSEAK+Q EEKNAAV+KL+++L+ YL  ++   C  K  +  + + + ++      
Sbjct: 423 VQMKLSEAKHQLEEKNAAVDKLRNQLQTYLKAKR---CKEKTREPPQTQLHNEEAGDYLN 482

Query: 424 --INFGSCQEQNMEVGEEDDCSEE--DDSDLHSIELNMDNNNKSYRWSFVHGSKSKRNSF 482
             I+FGS    N+E GE ++ +EE   +SDLHSIELN+D  NKSY+W +   ++ +    
Sbjct: 483 HHISFGS---YNIEDGEVENGNEEGSGESDLHSIELNID--NKSYKWPYGEENRGR---- 540

BLAST of Tan0017511 vs. TAIR 10
Match: AT1G50660.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast hits to 15134 proteins in 1325 species: Archae - 461; Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants - 1035; Viruses - 42; Other Eukaryotes - 4809 (source: NCBI BLink). )

HSP 1 Score: 162.2 bits (409), Expect = 1.3e-39
Identity = 109/311 (35.05%), Postives = 188/311 (60.45%), Query Frame = 0

Query: 162 CLTTSKELLRVLNHVWGHEDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDE 221
           CL T +E+ ++ +++   +  DQQ   + +SL+++L++EL+ A  R++ L  ++  H  +
Sbjct: 212 CLDTMEEVHQIYSNM---KRIDQQ--VNAVSLVSSLEAELEEAHARIEDLESEKRSHKKK 271

Query: 222 IEQLMKRFAEEKAGWKYRERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKV 281
           +EQ +++ +EE+A W+ RE  +VR+ I  M  ++  EKK R++ E +N ++  ELAD+K+
Sbjct: 272 LEQFLRKVSEERAAWRSREHEKVRAIIDDMKTDMNREKKTRQRLEIVNHKLVNELADSKL 331

Query: 282 SLSKAMKDLQRERRAKEIFEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQL 341
           ++ + M+D ++ER+A+E+ EE+CDELAK IGED+A+ E LK+ES  +REEV+ ER MLQ+
Sbjct: 332 AVKRYMQDYEKERKARELIEEVCDELAKEIGEDKAEIEALKRESMSLREEVDDERRMLQM 391

Query: 342 ADVLREERVQMKLSEAKYQFEEKNAAVEKLKDELEAYLITQ---------------QEDY 401
           A+V REERVQMKL +AK   EE+ + + KL  +LE++L ++               +E  
Sbjct: 392 AEVWREERVQMKLIDAKVALEERYSQMNKLVGDLESFLRSRDIVTDVKEVREAELLRETA 451

Query: 402 CCNKFEKIKE----------LEAYLKKINFGSCQEQNMEVGEEDDCSEEDDSDLHSIELN 448
                ++IKE          + A  +++N G   ++ ME           DS +H++ L+
Sbjct: 452 ASVNIQEIKEFTYVPANPDDIYAVFEEMNLGEAHDREMEKSVAYS-PISHDSKVHTVSLD 511

BLAST of Tan0017511 vs. TAIR 10
Match: AT5G22310.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11590.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 161.8 bits (408), Expect = 1.7e-39
Identity = 126/306 (41.18%), Postives = 184/306 (60.13%), Query Frame = 0

Query: 151 TLKTRLKEVSNCLTTSKELLRVLNHVWGHEDHDQQRPSSTLSLITALKSELDRAKTRVDH 210
           ++KTR K VS+ LTTSKEL++VL  + G    D +  S+   LI+AL  ELDRA++ + H
Sbjct: 174 SVKTRFKNVSDGLTTSKELVKVLKRI-GELGDDHKTASN--RLISALLCELDRARSSLKH 233

Query: 211 LIKDQNFHGDEIEQLMKRFAEEKAGWKYRERARVRSAITSMADEVEVEKKLRRQAERLNK 270
           L+ +     DE E       EEK           R  I S+ +E  VE+KLRR+ E++N+
Sbjct: 234 LMSEL----DEEE-------EEK-----------RRLIESLQEEAMVERKLRRRTEKMNR 293

Query: 271 RIAKELADAKVSLSKAMKDLQRERRAKEIFEEICDELAKGIGEDRAQFEELKKESAKVRE 330
           R+ +EL +AK +  K  ++++RE+RAK++ EE+CDEL KGIG+D              ++
Sbjct: 294 RLGRELTEAKETERKMKEEMKREKRAKDVLEEVCDELTKGIGDD--------------KK 353

Query: 331 EVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVEKLKDELEAYLITQQEDYCCNK 390
           E+EKEREM+ +ADVLREERVQMKL+EAK++FE+K AAVE+LK EL   L  ++       
Sbjct: 354 EMEKEREMMHIADVLREERVQMKLTEAKFEFEDKYAAVERLKKELRRVLDGEEG------ 413

Query: 391 FEKIKELEAYLKKINFGSCQEQNMEVGEEDDCSEEDDSDLHSIELNMDNNNKSYRWSFVH 450
            +   E+   L+ I+ GS        G +DD    ++SDL SIELNM++ +K   W +V 
Sbjct: 414 -KGSSEIRRILEVID-GS--------GSDDD----EESDLKSIELNMESGSK---WGYVD 417

Query: 451 GSKSKR 457
             K +R
Sbjct: 474 SLKDRR 417

BLAST of Tan0017511 vs. TAIR 10
Match: AT3G20350.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G50660.1); Has 15095 Blast hits to 11224 proteins in 1051 species: Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi - 1255; Plants - 746; Viruses - 40; Other Eukaryotes - 4245 (source: NCBI BLink). )

HSP 1 Score: 151.0 bits (380), Expect = 3.1e-36
Identity = 88/219 (40.18%), Postives = 144/219 (65.75%), Query Frame = 0

Query: 162 CLTTSKELLRVLNHV-WGHEDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGD 221
           CL T  ++ ++  +V W ++        + +SL ++++ +L  A+  +  L  ++     
Sbjct: 189 CLDTRDDVHQIYTNVKWNNQQ------VNDVSLASSIELKLQEARACIKDLESEKRSQKK 248

Query: 222 EIEQLMKRFAEEKAGWKYRERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAK 281
           ++EQ +K+ +EE+A W+ RE  +VR+ I  M  ++  EKK R++ E +N ++  ELAD+K
Sbjct: 249 KLEQFLKKVSEERAAWRSREHEKVRAIIDDMKADMNQEKKTRQRLEIVNSKLVNELADSK 308

Query: 282 VSLSKAMKDLQRERRAKEIFEEICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQ 341
           +++ + M D Q+ER+A+E+ EE+CDELAK I ED+A+ E LK ES  +REEV+ ER MLQ
Sbjct: 309 LAVKRYMHDYQQERKARELIEEVCDELAKEIEEDKAEIEALKSESMNLREEVDDERRMLQ 368

Query: 342 LADVLREERVQMKLSEAKYQFEEKNAAVEKLKDELEAYL 380
           +A+V REERVQMKL +AK   EEK + + KL  ++EA+L
Sbjct: 369 MAEVWREERVQMKLIDAKVTLEEKYSQMNKLVGDMEAFL 401

BLAST of Tan0017511 vs. TAIR 10
Match: AT5G41620.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, plasma membrane; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: 6 growth stages; BEST Arabidopsis thaliana protein match is: intracellular protein transport protein USO1-related (TAIR:AT1G64180.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 103.6 bits (257), Expect = 5.6e-22
Identity = 81/268 (30.22%), Postives = 150/268 (55.97%), Query Frame = 0

Query: 163 LTTSKELLRVLNHVWGHEDHDQQRPSSTLSLITALKSELDRAKTRVDHLIKDQNFHGDEI 222
           L TS ELL+VLN +W  E    ++  S +SLI ALK+E+  ++ R+  L++ Q     E+
Sbjct: 193 LKTSTELLKVLNRIWSLE----EQHVSNISLIKALKTEVAHSRVRIKELLRYQQADRHEL 252

Query: 223 EQLMKRFAEEKAGWKYRERARVRSAITSMADEVEVEKKLRRQAERLNKRIAKELADAKVS 282
           + ++K+ AEEK   K +E  R+ SA+ S+   +E E+KLR+++E L++++A+EL++ K S
Sbjct: 253 DSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDERKLRKRSESLHRKMARELSEVKSS 312

Query: 283 LSKAMKDLQRERRAKEIFEEICDELAKGIGEDRAQFEELKKESAKV--REEVEKEREMLQ 342
           LS  +K+L+R  ++ ++ E +CDE AKGI     +   LKK++           ++ +L 
Sbjct: 313 LSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGGGDQLVLH 372

Query: 343 LADVLREERVQMKLSEAKYQFEEKNAAVEKLKDELEAYLITQQEDYCCNKFEKIKELEAY 402
           +A+   +ER+QM+L        +  + ++KL+ E+E +L  ++ +   N+          
Sbjct: 373 IAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIETFLQEKRNEIPRNRRNS------- 432

Query: 403 LKKINFGSCQEQNMEVGEEDDCSEEDDS 429
           L+ + F +      +V  E+D    D +
Sbjct: 433 LESVPFNTLSAPPRDVDCEEDSGGSDSN 449

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q66GQ27.9e-2130.22Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 P... [more]
F4I8785.5e-0631.30Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAG6605972.12.6e-20771.64hypothetical protein SDJN03_03289, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022995046.15.9e-20771.81uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima] >XP_022995048... [more]
XP_022995049.15.9e-20771.81uncharacterized protein LOC111490718 isoform X2 [Cucurbita maxima] >XP_022995050... [more]
KAG7035923.11.3e-20671.48hypothetical protein SDJN02_02723 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023534124.12.2e-20671.48uncharacterized protein LOC111795778 isoform X2 [Cucurbita pepo subsp. pepo] >XP... [more]
Match NameE-valueIdentityDescription
A0A6J1K0X32.9e-20771.81uncharacterized protein LOC111490718 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1K3152.9e-20771.81uncharacterized protein LOC111490718 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1H0017.0e-20671.14uncharacterized protein LOC111459204 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1H0377.0e-20671.14uncharacterized protein LOC111459204 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1DGK14.8e-17065.51uncharacterized protein LOC111020667 isoform X2 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT3G11590.12.2e-7442.23unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures;... [more]
AT1G50660.11.3e-3935.05unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... [more]
AT5G22310.11.7e-3941.18unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G20350.13.1e-3640.18unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma mem... [more]
AT5G41620.15.6e-2230.22FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 251..293
NoneNo IPR availableCOILSCoilCoilcoord: 311..338
NoneNo IPR availableCOILSCoilCoilcoord: 194..214
NoneNo IPR availableCOILSCoilCoilcoord: 354..381
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..19
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..60
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 35..57
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..110
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..97
NoneNo IPR availablePANTHERPTHR31071:SF16OS04G0382800 PROTEINcoord: 56..564
IPR043424Protein BRANCHLESS TRICHOME-likePANTHERPTHR31071GB|AAF24581.1coord: 56..564

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017511.1Tan0017511.1mRNA