Tan0002169 (gene) Snake gourd v1

Overview
NameTan0002169
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSNF2 domain protein
LocationLG09: 63988122 .. 63991529 (-)
RNA-Seq ExpressionTan0002169
SyntenyTan0002169
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAATTCATACAGAAAAAACTTTGCTTTCTTCGAGTCCCAATTCTTTTCATAGTAATTGATTCAATTCGATAAGTTTCAGCGAGTCTCATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACACTTCAAACCAGAATCCTTTTCTTGGAGAAAATTATGAGTGTACTCTTGAGCAATCAATCCAGAACGTTTTAGCTGAAATTCGCAAAGGAAATCTTGGTTCTTCTCATTTGACGGAACGATTCTATGAGTTGATTCAAGCTAGAGCTGACCCACCAATGGAAGCGATCTGGTTCTACTCCGCATTAACGTTTCGTAGCCGTAGCTCCACTACTAAGGGCGACTTTTTGGACCGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTGATTCTTCGAAGACCATTGCGTTGCTTGCTCCAGTGGTTTTTGAGGTGTATAAGTTGATTGGTGACATGCTAAAAAAGGATTTGGCCTTGAAAAGGGAGAAGAAAGCGATGAGAGAGGTTAAATCTTTAGTTGAAGTGATTATTGGCTTTATAAATCTGAGTTCTTGCAAGGATTCGGACCAGAATTGTGAATCTCTTGATTTCAATTTGATTACTCCTTTTGTGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTGAACAGTGAGGTTTGTGGAGAGTTTAGTTCAGGCGTCTGTGTTGTTCGTCGCTTGGCTGGAGTTGTAATTGCTGAGGCATTTCTGATGAAACTGTGCTTGGACTTCAACAGTGGGCGTTCGAGGCAAGGTTTGGAGAAAGAGCTAAGGATATGGACTGTTGGTTCTATAACTCGGATTAGGAACTTCTACTTTTTTGGTTAGTGATACAAATTCTCTTCCGATTATTGTTTTGATTCTGTATAGTGTTGCAGTATTTATCAGATTATATTAGATAGTTTATGCTTCTTATGATATATCTCTTTATCTTACTGATGTTGGATGCAAGAACATCTTATATGCTGCCAGCTTGTTTGTTATACTATATTTTATGAAGTTCCGAGGGGTTTCTTATCGAGTTCTCTTTGGCATTTATTAACCAAATGAATAGCTGCGTACTTCTTGCACTAGCTGGGCTTAAGTGTGAAGCATTTAAAAAAAAGCGAGGGCTTTTTTAGGGAAGCACCTGATAGGGGAAAAATATTATTATGAGGAATGAGATAAATGAGAAATGATGTAGATTTTAGTAAGAGATTACAGATATAGACCTTTTATGCAATATATTTGTTGTTTCTTACAAAAAACGAAAAGGGAAAAGGAAACGAATGTTGGAAAAAAGACCAAATGTAAGCCTCATCGAGGGCGCCTTGCCTTAGGTATCCCCTAAGCTTACTTTGAAAACAATGATCATCCATTGCAAATATTGCTGCTTGTCCTTTAGTATGTTTGTTTCATGTATGTAGTTTTTTTAATTTAGATTCCAGTAATGTGCTTACATCTTGTTCTATCTTTGTCTCAACAGAAACTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTAATGTCTCTGTTGGTAAGTTTTATTTTTATCAGTGCTTATGGGTGTGTGCCTGGTCTTGGTTTGCATGACACCTCTATATGATATGCATACAAGAGTTCTCTAACTTTCATCTTTGACGGATTTCCTGATAAACCACATTTTTGTGTGTGTTAAATACAGGATAATCTTGTCTATCCTATTCACATTATTAGAGCAATGATTTCATGCCTGAGGAAAACACAGGTGTCATGTAACGTTTCAATTTTTCCGTGCTTGGAATTGAGAAAATCATTTGGAAAAACGTCTGGTTTTCTTGGTTTTTTCTGCATGCCCTAAGATTGTGGTGTACATTATTAATATGCTTCAGATACATGGATACCTATAATTGATGAAAATAGATAGTGAGATGAAGTCTTTCTGATTGGAAAATGAATTGATTGTTACCAATGTGTGTATGCACATGGTCATCTGCTTGTGATCATTTGATGTGATTGTAAACCTTTTACAGCAGAGCACTGAAGATGAAGCTCTGTTAAGGAAGGTTCTATATGATGCTCTAATATTGGTTGATTATTCATTTTTGGATCCTGAGAAAGCCATTAACTTACCTGCCGAACATGTGGCATTGCTGGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGTATGTAAAGAACTGAACAGATCTGGAACTTTAATCGAACCTTCCTCCAAAGTCTTATTGTTTATTTGTTATATTTAGGGAGCATGGAGATCAGAGCAGAGCCGTCTCTTATCTAAATGCCTTCTCAAGTTCTCTTGTATCTTCTCAAATTATTAGATGGGTCAGAAGCCAAATTCTTAGCAATGGAAATGTAAATCGACCCAATGGATCGTCGCCTAAAATACTTCTCAGTAAGTTGATAATAATTTTTATGTAAAAGTATACAAATAAACGGTATAGTTAATGATCTAATTTTCGGTTCATTTTCTTCCAGAGTGGCTTCTCAAGACTGAAGATCAAGGTGTGAGAGTATTTGACAATACCATTTCCGATCGTCGAACCAAATTAGTTCTTGATGTTTCTAAATCAGTCTCGGGGCATCCCACATTGGAGGGAGATAAAGTAGATGACGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGGCAAAACGATGGATGAATCAGTAAATGAAGCGTTTGTAACCGTTGCTCGTACCATGTCAACGACAGAAAACGCTTCAGGAAAGCAGAAGCGACGGAGAAAGGCTGAAAGAAAGAATAAGAAGATCAAATTTATAAAGTACGATCTCATCCCGAACTCTGATGCTGCCCAATTGAGGTCAGCTGTTGATAATAATGACTCGAACAGCGAGGGCGAAGTTCATAATCCACACCTGGACGAAGATTCTGAGATGGAAGAGTAATATGATCTAAAGAGAAAGGTCGTATTTGGTTGTCACAACACAACTTCAGATTCAGTTTGTTATCTGTTGATGACCTTTCTGGAGGAGCTGGCCAAGATACAAGTGCGAGCTCGTGAGGTTCGAGTAGGCAGGTCGATATGCCAAGAATGTTGGATTTTCAACACAAGTGCAACAATTCAACCAAGGTTGCAAATGTAAGTTGATCTATGATCAAATCAAAGTAAGTGAAAAAAGGAAAAGACATTTTTGGTACATTATGATACTGCTGTTTTTCATGAAGATGAATCAAATATTTAACTGCACTTGATGCGCTGTATTTTATTATTAATGCTTATTTTGACAACTTGACAAAACCATTGTTCAAACTTGTTCTCAACATGAGGAAAAACTAAAAGGAAAAAAACGAC

mRNA sequence

TGAATTCATACAGAAAAAACTTTGCTTTCTTCGAGTCCCAATTCTTTTCATAGTAATTGATTCAATTCGATAAGTTTCAGCGAGTCTCATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACACTTCAAACCAGAATCCTTTTCTTGGAGAAAATTATGAGTGTACTCTTGAGCAATCAATCCAGAACGTTTTAGCTGAAATTCGCAAAGGAAATCTTGGTTCTTCTCATTTGACGGAACGATTCTATGAGTTGATTCAAGCTAGAGCTGACCCACCAATGGAAGCGATCTGGTTCTACTCCGCATTAACGTTTCGTAGCCGTAGCTCCACTACTAAGGGCGACTTTTTGGACCGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTGATTCTTCGAAGACCATTGCGTTGCTTGCTCCAGTGGTTTTTGAGGTGTATAAGTTGATTGGTGACATGCTAAAAAAGGATTTGGCCTTGAAAAGGGAGAAGAAAGCGATGAGAGAGGTTAAATCTTTAGTTGAAGTGATTATTGGCTTTATAAATCTGAGTTCTTGCAAGGATTCGGACCAGAATTGTGAATCTCTTGATTTCAATTTGATTACTCCTTTTGTGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTGAACAGTGAGGTTTGTGGAGAGTTTAGTTCAGGCGTCTGTGTTGTTCGTCGCTTGGCTGGAGTTGTAATTGCTGAGGCATTTCTGATGAAACTGTGCTTGGACTTCAACAGTGGGCGTTCGAGGCAAGGTTTGGAGAAAGAGCTAAGGATATGGACTGTTGGTTCTATAACTCGGATTAGGAACTTCTACTTTTTTGAAACTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTAATGTCTCTGTTGCAGAGCACTGAAGATGAAGCTCTGTTAAGGAAGGTTCTATATGATGCTCTAATATTGGTTGATTATTCATTTTTGGATCCTGAGAAAGCCATTAACTTACCTGCCGAACATGTGGCATTGCTGGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAGCAGAGCCGTCTCTTATCTAAATGCCTTCTCAAGTTCTCTTGTATCTTCTCAAATTATTAGATGGGTCAGAAGCCAAATTCTTAGCAATGGAAATGTAAATCGACCCAATGGATCGTCGCCTAAAATACTTCTCAAGTGGCTTCTCAAGACTGAAGATCAAGGTGTGAGAGTATTTGACAATACCATTTCCGATCGTCGAACCAAATTAGTTCTTGATGTTTCTAAATCAGTCTCGGGGCATCCCACATTGGAGGGAGATAAAGTAGATGACGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGGCAAAACGATGGATGAATCAGTAAATGAAGCGTTTGTAACCGTTGCTCGTACCATGTCAACGACAGAAAACGCTTCAGGAAAGCAGAAGCGACGGAGAAAGGCTGAAAGAAAGAATAAGAAGATCAAATTTATAAAGTACGATCTCATCCCGAACTCTGATGCTGCCCAATTGAGGTCAGCTGTTGATAATAATGACTCGAACAGCGAGGGCGAAGTTCATAATCCACACCTGGACGAAGATTCTGAGATGGAAGAGTAATATGATCTAAAGAGAAAGGTCGTATTTGGTTGTCACAACACAACTTCAGATTCAGTTTGTTATCTGTTGATGACCTTTCTGGAGGAGCTGGCCAAGATACAAGTGCGAGCTCGTGAGGTTCGAGTAGGCAGGTCGATATGCCAAGAATGTTGGATTTTCAACACAAGTGCAACAATTCAACCAAGGTTGCAAATGTAAGTTGATCTATGATCAAATCAAAGTAAGTGAAAAAAGGAAAAGACATTTTTGGTACATTATGATACTGCTGTTTTTCATGAAGATGAATCAAATATTTAACTGCACTTGATGCGCTGTATTTTATTATTAATGCTTATTTTGACAACTTGACAAAACCATTGTTCAAACTTGTTCTCAACATGAGGAAAAACTAAAAGGAAAAAAACGAC

Coding sequence (CDS)

ATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACACTTCAAACCAGAATCCTTTTCTTGGAGAAAATTATGAGTGTACTCTTGAGCAATCAATCCAGAACGTTTTAGCTGAAATTCGCAAAGGAAATCTTGGTTCTTCTCATTTGACGGAACGATTCTATGAGTTGATTCAAGCTAGAGCTGACCCACCAATGGAAGCGATCTGGTTCTACTCCGCATTAACGTTTCGTAGCCGTAGCTCCACTACTAAGGGCGACTTTTTGGACCGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTGATTCTTCGAAGACCATTGCGTTGCTTGCTCCAGTGGTTTTTGAGGTGTATAAGTTGATTGGTGACATGCTAAAAAAGGATTTGGCCTTGAAAAGGGAGAAGAAAGCGATGAGAGAGGTTAAATCTTTAGTTGAAGTGATTATTGGCTTTATAAATCTGAGTTCTTGCAAGGATTCGGACCAGAATTGTGAATCTCTTGATTTCAATTTGATTACTCCTTTTGTGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTGAACAGTGAGGTTTGTGGAGAGTTTAGTTCAGGCGTCTGTGTTGTTCGTCGCTTGGCTGGAGTTGTAATTGCTGAGGCATTTCTGATGAAACTGTGCTTGGACTTCAACAGTGGGCGTTCGAGGCAAGGTTTGGAGAAAGAGCTAAGGATATGGACTGTTGGTTCTATAACTCGGATTAGGAACTTCTACTTTTTTGAAACTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTAATGTCTCTGTTGCAGAGCACTGAAGATGAAGCTCTGTTAAGGAAGGTTCTATATGATGCTCTAATATTGGTTGATTATTCATTTTTGGATCCTGAGAAAGCCATTAACTTACCTGCCGAACATGTGGCATTGCTGGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAGCAGAGCCGTCTCTTATCTAAATGCCTTCTCAAGTTCTCTTGTATCTTCTCAAATTATTAGATGGGTCAGAAGCCAAATTCTTAGCAATGGAAATGTAAATCGACCCAATGGATCGTCGCCTAAAATACTTCTCAAGTGGCTTCTCAAGACTGAAGATCAAGGTGTGAGAGTATTTGACAATACCATTTCCGATCGTCGAACCAAATTAGTTCTTGATGTTTCTAAATCAGTCTCGGGGCATCCCACATTGGAGGGAGATAAAGTAGATGACGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGGCAAAACGATGGATGAATCAGTAAATGAAGCGTTTGTAACCGTTGCTCGTACCATGTCAACGACAGAAAACGCTTCAGGAAAGCAGAAGCGACGGAGAAAGGCTGAAAGAAAGAATAAGAAGATCAAATTTATAAAGTACGATCTCATCCCGAACTCTGATGCTGCCCAATTGAGGTCAGCTGTTGATAATAATGACTCGAACAGCGAGGGCGAAGTTCATAATCCACACCTGGACGAAGATTCTGAGATGGAAGAGTAA

Protein sequence

MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
Homology
BLAST of Tan0002169 vs. NCBI nr
Match: XP_022147658.1 (uncharacterized protein LOC111016527 [Momordica charantia])

HSP 1 Score: 850.9 bits (2197), Expect = 6.1e-243
Identity = 449/544 (82.54%), Postives = 480/544 (88.24%), Query Frame = 0

Query: 1   MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
           MALALVESMDSMN  NQNPFLGENYE TL+QSI+NVLAEIR+GNLG  H TE FY+L+QA
Sbjct: 1   MALALVESMDSMNPPNQNPFLGENYELTLKQSIKNVLAEIREGNLGFCHFTEDFYKLMQA 60

Query: 61  RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
           R DPPME+IWFYSAL FRS SS  KGDFLDR+AAMKVLFQLVCSCSAPC SSKT+A LAP
Sbjct: 61  RVDPPMESIWFYSALMFRSHSS-AKGDFLDRLAAMKVLFQLVCSCSAPCGSSKTVASLAP 120

Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLI 180
           VVFEVYKLI DML KDLA KREKKAMREVK+LVE I+GFINLSSCK SDQN E LDFNLI
Sbjct: 121 VVFEVYKLIADMLGKDLASKREKKAMREVKALVEAILGFINLSSCKVSDQNVEQLDFNLI 180

Query: 181 TPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFN 240
           TPF+DLISIWTHPNEGLDQFLPLV+SEV G F SGVC VR LAGVVIAEAFLMKLCLDF+
Sbjct: 181 TPFMDLISIWTHPNEGLDQFLPLVSSEVRGGFCSGVCDVRHLAGVVIAEAFLMKLCLDFH 240

Query: 241 SGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL 300
           SGRSRQ LEK+LR+W VGSIT IRN Y FETL+RFLL  TLPVMSLL STEDE LLRKVL
Sbjct: 241 SGRSRQELEKDLRLWAVGSITGIRNCYLFETLIRFLLGVTLPVMSLL-STEDELLLRKVL 300

Query: 301 YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFS 360
           YDALILVDYSFL+P KAI+L AEHVA LAVKRLILTH+AIEF+REHGDQSRA+SYLNAFS
Sbjct: 301 YDALILVDYSFLNPVKAIDLHAEHVAFLAVKRLILTHDAIEFFREHGDQSRAISYLNAFS 360

Query: 361 SSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLV 420
           SS V SQ+IRWVRSQI SN NVNRPNGSSPKILL+WL K EDQGVRVFDNTISD R KLV
Sbjct: 361 SSPVPSQMIRWVRSQIPSNENVNRPNGSSPKILLEWLFKAEDQGVRVFDNTISDHRAKLV 420

Query: 421 LDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTT 480
           LD+SKS S HP LEG+KVDD LLFY+DKQGE EN SEE K MDESVN A VTVARTMS  
Sbjct: 421 LDISKSDSRHPKLEGNKVDDGLLFYVDKQGEKENESEEDKAMDESVNAALVTVARTMSMA 480

Query: 481 ENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDS 540
           EN SGK+KR+RK+ERKN KIKF+KYDL PN DAAQLRSAVDNND NSEGEVHNPH DEDS
Sbjct: 481 ENGSGKKKRQRKSERKN-KIKFVKYDLFPNPDAAQLRSAVDNNDPNSEGEVHNPHKDEDS 540

Query: 541 EMEE 545
           +MEE
Sbjct: 541 DMEE 541

BLAST of Tan0002169 vs. NCBI nr
Match: KAG7023645.1 (hypothetical protein SDJN02_14671, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 840.5 bits (2170), Expect = 8.2e-240
Identity = 436/540 (80.74%), Postives = 481/540 (89.07%), Query Frame = 0

Query: 6   VESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPP 65
           VESM+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S   E FYELIQAR DPP
Sbjct: 55  VESMESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLGFSQFMEGFYELIQARDDPP 114

Query: 66  MEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEV 125
           +E+IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAPC SSKTIALL+PVV+EV
Sbjct: 115 LESIWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLSPVVYEV 174

Query: 126 YKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVD 185
           YKLI DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVD
Sbjct: 175 YKLISDMLGKDLSSKREKKAMREVKSLVETMLGFINLSSCKDSDQNGESLDFNLVTPFVD 234

Query: 186 LISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSR 245
           LISIW + NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSR
Sbjct: 235 LISIWANSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLMKLCLDINSGRSR 294

Query: 246 QGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALI 305
           Q LE +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALI
Sbjct: 295 QDLENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALI 354

Query: 306 LVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVS 365
           LVDYSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVS
Sbjct: 355 LVDYSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVS 414

Query: 366 SQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSK 425
           SQIIRWV+SQI SN N N P GSSPKI L+WLLK ED GVRVFD+TIS+RR KLVLD SK
Sbjct: 415 SQIIRWVKSQIPSNENFNHPKGSSPKIFLEWLLKAEDHGVRVFDSTISNRRAKLVLDTSK 474

Query: 426 SVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENAS 485
           SVSGHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N S
Sbjct: 475 SVSGHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDESVNAALVSAAHTMSTTQNGS 534

Query: 486 GKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
           GK+KR+R A +K KKIKF+KYDL+PNSD  +LRSAV++ND++SEGEVHNPH DEDS+ +E
Sbjct: 535 GKKKRQRMA-KKQKKIKFMKYDLVPNSDVTELRSAVEDNDTDSEGEVHNPHSDEDSDTKE 592

BLAST of Tan0002169 vs. NCBI nr
Match: XP_038880003.1 (uncharacterized protein LOC120071696 [Benincasa hispida])

HSP 1 Score: 836.6 bits (2160), Expect = 1.2e-238
Identity = 447/545 (82.02%), Postives = 485/545 (88.99%), Query Frame = 0

Query: 1   MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
           MALALVESMDSMN   +NPFLGENYE TL QSIQNV+AEIRKGN G S  TE FYELIQA
Sbjct: 1   MALALVESMDSMNPLKKNPFLGENYEFTLAQSIQNVIAEIRKGNSGFSQFTEGFYELIQA 60

Query: 61  RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
           RADPP+E+IWFYSALTFRSR    KGDFL+RVAAMKVLFQLV SCSAPC SSKTI LL+P
Sbjct: 61  RADPPLESIWFYSALTFRSRGLNIKGDFLERVAAMKVLFQLVSSCSAPCGSSKTIPLLSP 120

Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQN-CESLDFNL 180
           VV EVYKLI DML KDLA KREKKAMREVKSLVE I+GFINLSSCKDSD+N  ESLDFNL
Sbjct: 121 VVSEVYKLIVDMLGKDLASKREKKAMREVKSLVEAILGFINLSSCKDSDKNDDESLDFNL 180

Query: 181 ITPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDF 240
           ITPFVDLIS+WTHPNEGLDQFLPLV+SEV GEFSSGVC VRRLAGVVIAE FLMKLCLDF
Sbjct: 181 ITPFVDLISVWTHPNEGLDQFLPLVSSEVRGEFSSGVCDVRRLAGVVIAETFLMKLCLDF 240

Query: 241 NSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKV 300
           N+G SRQ LEK+LRIWTVGSITRIRNFYFFETLVRFLLEATLPV SLL STEDEALLRKV
Sbjct: 241 NTGHSRQDLEKDLRIWTVGSITRIRNFYFFETLVRFLLEATLPVTSLL-STEDEALLRKV 300

Query: 301 LYDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAF 360
           LYD+LILV+YSFL PEKAI+LPAEHVA LAVKRLILTHEAIEFYREHGDQSRA+SYLNAF
Sbjct: 301 LYDSLILVEYSFLKPEKAIDLPAEHVASLAVKRLILTHEAIEFYREHGDQSRAISYLNAF 360

Query: 361 SSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKL 420
           SSS VSSQIIRWV+SQ+ SN NV RPNGSSPKI+L+WLL+ EDQGVRVFD TIS+R  KL
Sbjct: 361 SSSFVSSQIIRWVKSQMPSNENVKRPNGSSPKIVLEWLLEAEDQGVRVFDKTISNRCAKL 420

Query: 421 VLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMST 480
           VLD SKSVS    LEGDKVDDDLLFYIDKQGE+ENGSE+  TMDESVN A V+VARTMST
Sbjct: 421 VLDTSKSVS----LEGDKVDDDLLFYIDKQGESENGSED-TTMDESVNAALVSVARTMST 480

Query: 481 TENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDED 540
           TEN SGK KR+R  +RKN+KIKF+KYDL+P+SD  Q RS  DNND++SEG+VHNPH D+D
Sbjct: 481 TENGSGK-KRQRMVKRKNEKIKFVKYDLVPSSDTTQSRSPFDNNDTDSEGKVHNPHSDDD 538

Query: 541 SEMEE 545
           S+++E
Sbjct: 541 SDIKE 538

BLAST of Tan0002169 vs. NCBI nr
Match: KAG6589981.1 (hypothetical protein SDJN03_15404, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 833.6 bits (2152), Expect = 1.0e-237
Identity = 433/537 (80.63%), Postives = 477/537 (88.83%), Query Frame = 0

Query: 9   MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEA 68
           M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S   E FYELIQAR DPP+E+
Sbjct: 1   MESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLGFSQFMEGFYELIQARDDPPLES 60

Query: 69  IWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKL 128
           IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAPC SSKTIALL+PVV+EVYKL
Sbjct: 61  IWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLSPVVYEVYKL 120

Query: 129 IGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLIS 188
           I DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVDLIS
Sbjct: 121 ISDMLGKDLSSKREKKAMREVKSLVETMLGFINLSSCKDSDQNGESLDFNLVTPFVDLIS 180

Query: 189 IWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGL 248
           IW + NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSRQ L
Sbjct: 181 IWANSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLMKLCLDINSGRSRQDL 240

Query: 249 EKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD 308
           E +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Sbjct: 241 ENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALILVD 300

Query: 309 YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQI 368
           YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQI
Sbjct: 301 YSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVSSQI 360

Query: 369 IRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVS 428
           IRWV+SQI SN NVN P GSSPKI L+WLLK ED GVRVFD+TIS+RR KLVLD SKSVS
Sbjct: 361 IRWVKSQIPSNENVNHPKGSSPKIFLEWLLKAEDHGVRVFDSTISNRRAKLVLDTSKSVS 420

Query: 429 GHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQ 488
           GHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N S K+
Sbjct: 421 GHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDESVNAALVSAAHTMSTTQNGSAKK 480

Query: 489 KRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
           KR+R A +K KKIKF KYDL+PNSD  +LRSAV++ND++SEGEVHNPH DEDS+ +E
Sbjct: 481 KRQRMA-KKQKKIKFTKYDLVPNSDLTELRSAVEDNDTDSEGEVHNPHSDEDSDTKE 535

BLAST of Tan0002169 vs. NCBI nr
Match: XP_022987671.1 (uncharacterized protein LOC111485155 [Cucurbita maxima])

HSP 1 Score: 832.8 bits (2150), Expect = 1.7e-237
Identity = 434/537 (80.82%), Postives = 477/537 (88.83%), Query Frame = 0

Query: 9   MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEA 68
           M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNL  S   E FYELIQARADPP+E+
Sbjct: 1   MESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLVFSQFMEGFYELIQARADPPLES 60

Query: 69  IWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKL 128
           IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAPC SSKTIALLAPVV+EVYKL
Sbjct: 61  IWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLAPVVYEVYKL 120

Query: 129 IGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLIS 188
           I DML KDL  KREKKAMREVKSLVE I+GFINLSSCKDSDQN ESLDFNL+TPFVDLIS
Sbjct: 121 ISDMLGKDLFSKREKKAMREVKSLVETILGFINLSSCKDSDQNGESLDFNLVTPFVDLIS 180

Query: 189 IWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGL 248
           IWT+ NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FL+KLCLD NSGRSRQ L
Sbjct: 181 IWTNSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLLKLCLDINSGRSRQDL 240

Query: 249 EKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD 308
           E +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Sbjct: 241 ENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALILVD 300

Query: 309 YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQI 368
           YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQI
Sbjct: 301 YSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVSSQI 360

Query: 369 IRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVS 428
           IRWV+SQI S+ NVN P GSSPKI L+WL K ED GVRVFD+TIS+RR KLVLD SKSVS
Sbjct: 361 IRWVKSQIPSHENVNHPKGSSPKIFLEWLFKAEDHGVRVFDSTISNRRAKLVLDTSKSVS 420

Query: 429 GHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQ 488
           GHPT EG+ VDD+LLFYIDKQGENENGS EE + MDE+VN A V+ A TMSTT+N   K+
Sbjct: 421 GHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDETVNAALVSAAHTMSTTQNGLEKK 480

Query: 489 KRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
           KRRR A +K KKIKF KYDL+PNSDA +LRSAVD+ND++S+ EVHNPHLDEDS+M+E
Sbjct: 481 KRRRMA-KKQKKIKFTKYDLVPNSDATELRSAVDDNDTDSDSEVHNPHLDEDSDMKE 535

BLAST of Tan0002169 vs. ExPASy TrEMBL
Match: A0A6J1D1X1 (uncharacterized protein LOC111016527 OS=Momordica charantia OX=3673 GN=LOC111016527 PE=4 SV=1)

HSP 1 Score: 850.9 bits (2197), Expect = 2.9e-243
Identity = 449/544 (82.54%), Postives = 480/544 (88.24%), Query Frame = 0

Query: 1   MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
           MALALVESMDSMN  NQNPFLGENYE TL+QSI+NVLAEIR+GNLG  H TE FY+L+QA
Sbjct: 1   MALALVESMDSMNPPNQNPFLGENYELTLKQSIKNVLAEIREGNLGFCHFTEDFYKLMQA 60

Query: 61  RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
           R DPPME+IWFYSAL FRS SS  KGDFLDR+AAMKVLFQLVCSCSAPC SSKT+A LAP
Sbjct: 61  RVDPPMESIWFYSALMFRSHSS-AKGDFLDRLAAMKVLFQLVCSCSAPCGSSKTVASLAP 120

Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLI 180
           VVFEVYKLI DML KDLA KREKKAMREVK+LVE I+GFINLSSCK SDQN E LDFNLI
Sbjct: 121 VVFEVYKLIADMLGKDLASKREKKAMREVKALVEAILGFINLSSCKVSDQNVEQLDFNLI 180

Query: 181 TPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFN 240
           TPF+DLISIWTHPNEGLDQFLPLV+SEV G F SGVC VR LAGVVIAEAFLMKLCLDF+
Sbjct: 181 TPFMDLISIWTHPNEGLDQFLPLVSSEVRGGFCSGVCDVRHLAGVVIAEAFLMKLCLDFH 240

Query: 241 SGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL 300
           SGRSRQ LEK+LR+W VGSIT IRN Y FETL+RFLL  TLPVMSLL STEDE LLRKVL
Sbjct: 241 SGRSRQELEKDLRLWAVGSITGIRNCYLFETLIRFLLGVTLPVMSLL-STEDELLLRKVL 300

Query: 301 YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFS 360
           YDALILVDYSFL+P KAI+L AEHVA LAVKRLILTH+AIEF+REHGDQSRA+SYLNAFS
Sbjct: 301 YDALILVDYSFLNPVKAIDLHAEHVAFLAVKRLILTHDAIEFFREHGDQSRAISYLNAFS 360

Query: 361 SSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLV 420
           SS V SQ+IRWVRSQI SN NVNRPNGSSPKILL+WL K EDQGVRVFDNTISD R KLV
Sbjct: 361 SSPVPSQMIRWVRSQIPSNENVNRPNGSSPKILLEWLFKAEDQGVRVFDNTISDHRAKLV 420

Query: 421 LDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTT 480
           LD+SKS S HP LEG+KVDD LLFY+DKQGE EN SEE K MDESVN A VTVARTMS  
Sbjct: 421 LDISKSDSRHPKLEGNKVDDGLLFYVDKQGEKENESEEDKAMDESVNAALVTVARTMSMA 480

Query: 481 ENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDS 540
           EN SGK+KR+RK+ERKN KIKF+KYDL PN DAAQLRSAVDNND NSEGEVHNPH DEDS
Sbjct: 481 ENGSGKKKRQRKSERKN-KIKFVKYDLFPNPDAAQLRSAVDNNDPNSEGEVHNPHKDEDS 540

Query: 541 EMEE 545
           +MEE
Sbjct: 541 DMEE 541

BLAST of Tan0002169 vs. ExPASy TrEMBL
Match: A0A6J1JEZ4 (uncharacterized protein LOC111485155 OS=Cucurbita maxima OX=3661 GN=LOC111485155 PE=4 SV=1)

HSP 1 Score: 832.8 bits (2150), Expect = 8.3e-238
Identity = 434/537 (80.82%), Postives = 477/537 (88.83%), Query Frame = 0

Query: 9   MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEA 68
           M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNL  S   E FYELIQARADPP+E+
Sbjct: 1   MESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLVFSQFMEGFYELIQARADPPLES 60

Query: 69  IWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKL 128
           IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAPC SSKTIALLAPVV+EVYKL
Sbjct: 61  IWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLAPVVYEVYKL 120

Query: 129 IGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLIS 188
           I DML KDL  KREKKAMREVKSLVE I+GFINLSSCKDSDQN ESLDFNL+TPFVDLIS
Sbjct: 121 ISDMLGKDLFSKREKKAMREVKSLVETILGFINLSSCKDSDQNGESLDFNLVTPFVDLIS 180

Query: 189 IWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGL 248
           IWT+ NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FL+KLCLD NSGRSRQ L
Sbjct: 181 IWTNSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLLKLCLDINSGRSRQDL 240

Query: 249 EKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD 308
           E +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Sbjct: 241 ENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALILVD 300

Query: 309 YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQI 368
           YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQI
Sbjct: 301 YSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVSSQI 360

Query: 369 IRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVS 428
           IRWV+SQI S+ NVN P GSSPKI L+WL K ED GVRVFD+TIS+RR KLVLD SKSVS
Sbjct: 361 IRWVKSQIPSHENVNHPKGSSPKIFLEWLFKAEDHGVRVFDSTISNRRAKLVLDTSKSVS 420

Query: 429 GHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQ 488
           GHPT EG+ VDD+LLFYIDKQGENENGS EE + MDE+VN A V+ A TMSTT+N   K+
Sbjct: 421 GHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDETVNAALVSAAHTMSTTQNGLEKK 480

Query: 489 KRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
           KRRR A +K KKIKF KYDL+PNSDA +LRSAVD+ND++S+ EVHNPHLDEDS+M+E
Sbjct: 481 KRRRMA-KKQKKIKFTKYDLVPNSDATELRSAVDDNDTDSDSEVHNPHLDEDSDMKE 535

BLAST of Tan0002169 vs. ExPASy TrEMBL
Match: A0A6J1H8Q6 (uncharacterized protein LOC111461526 OS=Cucurbita moschata OX=3662 GN=LOC111461526 PE=4 SV=1)

HSP 1 Score: 830.5 bits (2144), Expect = 4.1e-237
Identity = 432/537 (80.45%), Postives = 476/537 (88.64%), Query Frame = 0

Query: 9   MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEA 68
           M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S   E FYELIQAR DPP+E+
Sbjct: 1   MESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLGFSQFMEGFYELIQARDDPPLES 60

Query: 69  IWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKL 128
           IWFYSALTFRSR ST  GDFLDRVA MK+LFQ  CSCSAPC SSKTIALL+PVV+EVYKL
Sbjct: 61  IWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLSPVVYEVYKL 120

Query: 129 IGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLIS 188
           I DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVDLIS
Sbjct: 121 ISDMLGKDLSSKREKKAMREVKSLVETMLGFINLSSCKDSDQNGESLDFNLVTPFVDLIS 180

Query: 189 IWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGL 248
           IW + NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSRQ L
Sbjct: 181 IWANSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLMKLCLDINSGRSRQDL 240

Query: 249 EKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD 308
           E +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Sbjct: 241 ENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALILVD 300

Query: 309 YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQI 368
           YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQI
Sbjct: 301 YSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVSSQI 360

Query: 369 IRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVS 428
           IRWV+SQI SN N N P GSSPKI L+WLLK ED GVRVFD+TIS+RR KLVLD SKSVS
Sbjct: 361 IRWVKSQIPSNENFNHPKGSSPKIFLEWLLKAEDHGVRVFDSTISNRRAKLVLDTSKSVS 420

Query: 429 GHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQ 488
           GHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N SGK+
Sbjct: 421 GHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDESVNAALVSAAHTMSTTQNGSGKK 480

Query: 489 KRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
           KR+R A +K KKIKF KYDL+ NSD  +LRSAV++ND++SEGEVHNPH DEDS+ +E
Sbjct: 481 KRQRMA-KKQKKIKFTKYDLVLNSDVTELRSAVEDNDTDSEGEVHNPHSDEDSDTKE 535

BLAST of Tan0002169 vs. ExPASy TrEMBL
Match: A0A0A0M1W0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G586780 PE=4 SV=1)

HSP 1 Score: 758.8 bits (1958), Expect = 1.5e-215
Identity = 423/614 (68.89%), Postives = 470/614 (76.55%), Query Frame = 0

Query: 1   MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
           MAL LVESM+S+N   QNPFLGENYE TL QSIQNVLAEIRKGN+  S  T+RFY+LIQA
Sbjct: 1   MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQA 60

Query: 61  RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
           RADPP+E+IWFYSAL FRS S   KGDFL+RVAAMKVLFQLVCSCSAPC SSKTI LL+P
Sbjct: 61  RADPPLESIWFYSALKFRS-SFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSP 120

Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLI 180
           VV EVYKL+ DM  KDL   REKKAMREVKSLVE I+GF+NLSS +DSD+N +SLDF+LI
Sbjct: 121 VVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLI 180

Query: 181 TPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFN 240
           TPF+DLISIWT PNEGLDQFLPLV SEV  EFSSG C VRRLAGVVIAE FLMKLCLDFN
Sbjct: 181 TPFMDLISIWTQPNEGLDQFLPLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFN 240

Query: 241 SGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL 300
            GRSRQ LEK+L  W VGSIT+IRNFY FETLVR LLEATLPV SLL ST++EALLRKVL
Sbjct: 241 YGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLL-STDNEALLRKVL 300

Query: 301 YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFS 360
           YDALILVDYSFL PE AINLPAEHVA LAVKRLILT+EAIEFYREHGDQ+RA+SYLNAFS
Sbjct: 301 YDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFS 360

Query: 361 SSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLV 420
           SSLVSSQIIRW++SQ+ SN N+N PNG SPK+ L+WLLK EDQGVRVFDNTIS+RR+KLV
Sbjct: 361 SSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRSKLV 420

Query: 421 LDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTT 480
           LD SKSVS     EGDKVDDDLLFYIDKQG N NGSEE  TMDESVN A  + A TMSTT
Sbjct: 421 LDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTT 480

Query: 481 ENASGKQKRR-------------------------------------------------- 540
           EN+S K+  R                                                  
Sbjct: 481 ENSSVKKLSRKAKKRNKKLKLLSQLKSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEA 540

Query: 541 --------------------RKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGE 545
                               RKA+RKNKK K +KYDL+PN+DA QL+SAV+NND++SEGE
Sbjct: 541 LVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGE 600

BLAST of Tan0002169 vs. ExPASy TrEMBL
Match: A0A5D3CFU4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold16G002540 PE=4 SV=1)

HSP 1 Score: 744.2 bits (1920), Expect = 3.9e-211
Identity = 408/544 (75.00%), Postives = 452/544 (83.09%), Query Frame = 0

Query: 1   MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
           MAL LVESM+S+N   +N FLGENYE TL QSIQNVLAEIRKGN+  S  TE FY+LIQA
Sbjct: 1   MALGLVESMESINPLKKNTFLGENYEFTLAQSIQNVLAEIRKGNVVFSRFTEGFYKLIQA 60

Query: 61  RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
           RADPP+E+IWFYSALTFRS S   KGDFL+RVAAMKVLFQLVCSCSAPC SSKTI LL+P
Sbjct: 61  RADPPLESIWFYSALTFRS-SFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSP 120

Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLI 180
           VV EVYKL+ DM  KDL  KREKKAMREVKSLVE I+G  NLSSC+DS++N +SLDFN I
Sbjct: 121 VVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGLTNLSSCEDSNKNDKSLDFNFI 180

Query: 181 TPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFN 240
           TPFVDLISIWTHPNEGLDQFLPLV SEV  EFSSG C VRRLAGVVIAE FL+KLCLDFN
Sbjct: 181 TPFVDLISIWTHPNEGLDQFLPLVCSEVREEFSSGECDVRRLAGVVIAETFLVKLCLDFN 240

Query: 241 SGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL 300
            G SRQ LE++LR WTVGSITRIRNFYFFETLVR LLEATLPV SLL ST+DEALLRKVL
Sbjct: 241 CGHSRQALEEDLRNWTVGSITRIRNFYFFETLVRLLLEATLPVTSLL-STDDEALLRKVL 300

Query: 301 YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFS 360
            DALILVDYSFL PEKAINLPAEH A LAVKRLILT+EA EFYR+HGDQ+RA+SYLNAFS
Sbjct: 301 SDALILVDYSFLKPEKAINLPAEHTAFLAVKRLILTYEATEFYRKHGDQNRAISYLNAFS 360

Query: 361 SSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLV 420
           SSLVSSQIIRWV+SQ+ SN N+N  NGSSPK+ L+WLLK EDQGVRVFDNTIS+ R K+V
Sbjct: 361 SSLVSSQIIRWVKSQMPSNENLNHLNGSSPKVFLEWLLKAEDQGVRVFDNTISNHRAKIV 420

Query: 421 LDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTT 480
           LD SKSV      EGDKVDDDLLFYIDKQGENENG EE KTMD+SVN A V+VA TMSTT
Sbjct: 421 LDTSKSV----LFEGDKVDDDLLFYIDKQGENENGREEDKTMDKSVNAALVSVAHTMSTT 480

Query: 481 ENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDS 540
           EN+S K KR RKA+++NKK          N+D +QL+SAV+NND+N +         ED+
Sbjct: 481 ENSSVK-KRSRKAKKRNKK---------KNADTSQLKSAVENNDTNGK---------EDT 519

Query: 541 EMEE 545
            M+E
Sbjct: 541 TMDE 519

BLAST of Tan0002169 vs. TAIR 10
Match: AT5G11780.1 (unknown protein; Has 37 Blast hits to 37 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 3; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 157.5 bits (397), Expect = 3.0e-38
Identity = 136/494 (27.53%), Postives = 230/494 (46.56%), Query Frame = 0

Query: 29  LEQSIQNVLAEIRKGNLGSSHLTERFYELIQARAD-PPMEAIWFYSALTFRSRSSTTKGD 88
           L  SI+ +L + R G    S     F  ++    + PP+E +WFYSA+ F S     +  
Sbjct: 21  LNDSIKQLLLQYRGGRTNFSDFDSIFTRILNDLPEPPPLELVWFYSAIRFYSSKLAFRD- 80

Query: 89  FLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMR 148
             D V      FQL+ S S      K ++LL+PVV+++ +L+         + R + A+ 
Sbjct: 81  --DSVRLTSCFFQLIVSFSDSFSGVKKVSLLSPVVYQLSRLV---------ISRRRDAL- 140

Query: 149 EVKSLVEVIIGFINLSSCKDSDQNCESLDFNLIT--PFVDLISIWT--------HPNEGL 208
              SL+E I+ +I++  C D   N E  D  +++   F DL  +W            + L
Sbjct: 141 ---SLLEGIVSYISM-YCVDEPGN-EDDDVLMVSGFSFADLSRVWVVDEVEDNCRVEDCL 200

Query: 209 DQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTV 268
           + F+P  +  +  E  S  C V  LAG+V ++ FL+ LC  F+    R  L+K+L+   +
Sbjct: 201 EVFMPFASEILRKEIDSESCGVGYLAGIVASQVFLLSLCSRFDLDLGRSELDKDLQESVL 260

Query: 269 GSITRIRNFYFFETLVRFLLEATLPVMSLLQST-EDEALLRKVLYDALI-LVDYSFLDPE 328
             I+   + +FF+ +++ LLE  L + SL+    EDEA L +++ +A+I  V+  FL+P 
Sbjct: 261 QMISGFHSCFFFDVILKMLLEPYLHLTSLMGVVPEDEAFLTEIITEAVIKSVEKLFLNPG 320

Query: 329 KAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQ 388
              +  + H+  +A+  L L  + +   R + DQ +   Y N FS+SL+   +I WV SQ
Sbjct: 321 NGTSQRSLHLKNIAINWLFLFDKTMASLRRNKDQEKISMYTNMFSNSLIPYHLINWVISQ 380

Query: 389 ILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEG 448
                + +     +P   ++WL+  E+QG RVF+   S    K V+  S+          
Sbjct: 381 GEVIRDADTLRNLTPASFIEWLVSLEEQGPRVFNCDHSKNYAKSVIHRSR---------- 440

Query: 449 DKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAER 506
                DL      Q + E   ++    DE  N + +++            +  R+RK ER
Sbjct: 441 ----PDLSIGTTLQKQEEEFDQDTDMADEQ-NVSSISIL----------SRNTRKRKEER 471

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022147658.16.1e-24382.54uncharacterized protein LOC111016527 [Momordica charantia][more]
KAG7023645.18.2e-24080.74hypothetical protein SDJN02_14671, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_038880003.11.2e-23882.02uncharacterized protein LOC120071696 [Benincasa hispida][more]
KAG6589981.11.0e-23780.63hypothetical protein SDJN03_15404, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022987671.11.7e-23780.82uncharacterized protein LOC111485155 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1D1X12.9e-24382.54uncharacterized protein LOC111016527 OS=Momordica charantia OX=3673 GN=LOC111016... [more]
A0A6J1JEZ48.3e-23880.82uncharacterized protein LOC111485155 OS=Cucurbita maxima OX=3661 GN=LOC111485155... [more]
A0A6J1H8Q64.1e-23780.45uncharacterized protein LOC111461526 OS=Cucurbita moschata OX=3662 GN=LOC1114615... [more]
A0A0A0M1W01.5e-21568.89Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G586780 PE=4 SV=1[more]
A0A5D3CFU43.9e-21175.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G11780.13.0e-3827.53unknown protein; Has 37 Blast hits to 37 proteins in 12 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 514..544
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 528..544
NoneNo IPR availablePANTHERPTHR35505:SF1OS01G0600300 PROTEINcoord: 1..539
NoneNo IPR availablePANTHERPTHR35505OS01G0600300 PROTEINcoord: 1..539

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0002169.1Tan0002169.1mRNA