Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAATTCATACAGAAAAAACTTTGCTTTCTTCGAGTCCCAATTCTTTTCATAGTAATTGATTCAATTCGATAAGTTTCAGCGAGTCTCATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACACTTCAAACCAGAATCCTTTTCTTGGAGAAAATTATGAGTGTACTCTTGAGCAATCAATCCAGAACGTTTTAGCTGAAATTCGCAAAGGAAATCTTGGTTCTTCTCATTTGACGGAACGATTCTATGAGTTGATTCAAGCTAGAGCTGACCCACCAATGGAAGCGATCTGGTTCTACTCCGCATTAACGTTTCGTAGCCGTAGCTCCACTACTAAGGGCGACTTTTTGGACCGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTGATTCTTCGAAGACCATTGCGTTGCTTGCTCCAGTGGTTTTTGAGGTGTATAAGTTGATTGGTGACATGCTAAAAAAGGATTTGGCCTTGAAAAGGGAGAAGAAAGCGATGAGAGAGGTTAAATCTTTAGTTGAAGTGATTATTGGCTTTATAAATCTGAGTTCTTGCAAGGATTCGGACCAGAATTGTGAATCTCTTGATTTCAATTTGATTACTCCTTTTGTGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTGAACAGTGAGGTTTGTGGAGAGTTTAGTTCAGGCGTCTGTGTTGTTCGTCGCTTGGCTGGAGTTGTAATTGCTGAGGCATTTCTGATGAAACTGTGCTTGGACTTCAACAGTGGGCGTTCGAGGCAAGGTTTGGAGAAAGAGCTAAGGATATGGACTGTTGGTTCTATAACTCGGATTAGGAACTTCTACTTTTTTGGTTAGTGATACAAATTCTCTTCCGATTATTGTTTTGATTCTGTATAGTGTTGCAGTATTTATCAGATTATATTAGATAGTTTATGCTTCTTATGATATATCTCTTTATCTTACTGATGTTGGATGCAAGAACATCTTATATGCTGCCAGCTTGTTTGTTATACTATATTTTATGAAGTTCCGAGGGGTTTCTTATCGAGTTCTCTTTGGCATTTATTAACCAAATGAATAGCTGCGTACTTCTTGCACTAGCTGGGCTTAAGTGTGAAGCATTTAAAAAAAAGCGAGGGCTTTTTTAGGGAAGCACCTGATAGGGGAAAAATATTATTATGAGGAATGAGATAAATGAGAAATGATGTAGATTTTAGTAAGAGATTACAGATATAGACCTTTTATGCAATATATTTGTTGTTTCTTACAAAAAACGAAAAGGGAAAAGGAAACGAATGTTGGAAAAAAGACCAAATGTAAGCCTCATCGAGGGCGCCTTGCCTTAGGTATCCCCTAAGCTTACTTTGAAAACAATGATCATCCATTGCAAATATTGCTGCTTGTCCTTTAGTATGTTTGTTTCATGTATGTAGTTTTTTTAATTTAGATTCCAGTAATGTGCTTACATCTTGTTCTATCTTTGTCTCAACAGAAACTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTAATGTCTCTGTTGGTAAGTTTTATTTTTATCAGTGCTTATGGGTGTGTGCCTGGTCTTGGTTTGCATGACACCTCTATATGATATGCATACAAGAGTTCTCTAACTTTCATCTTTGACGGATTTCCTGATAAACCACATTTTTGTGTGTGTTAAATACAGGATAATCTTGTCTATCCTATTCACATTATTAGAGCAATGATTTCATGCCTGAGGAAAACACAGGTGTCATGTAACGTTTCAATTTTTCCGTGCTTGGAATTGAGAAAATCATTTGGAAAAACGTCTGGTTTTCTTGGTTTTTTCTGCATGCCCTAAGATTGTGGTGTACATTATTAATATGCTTCAGATACATGGATACCTATAATTGATGAAAATAGATAGTGAGATGAAGTCTTTCTGATTGGAAAATGAATTGATTGTTACCAATGTGTGTATGCACATGGTCATCTGCTTGTGATCATTTGATGTGATTGTAAACCTTTTACAGCAGAGCACTGAAGATGAAGCTCTGTTAAGGAAGGTTCTATATGATGCTCTAATATTGGTTGATTATTCATTTTTGGATCCTGAGAAAGCCATTAACTTACCTGCCGAACATGTGGCATTGCTGGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGTATGTAAAGAACTGAACAGATCTGGAACTTTAATCGAACCTTCCTCCAAAGTCTTATTGTTTATTTGTTATATTTAGGGAGCATGGAGATCAGAGCAGAGCCGTCTCTTATCTAAATGCCTTCTCAAGTTCTCTTGTATCTTCTCAAATTATTAGATGGGTCAGAAGCCAAATTCTTAGCAATGGAAATGTAAATCGACCCAATGGATCGTCGCCTAAAATACTTCTCAGTAAGTTGATAATAATTTTTATGTAAAAGTATACAAATAAACGGTATAGTTAATGATCTAATTTTCGGTTCATTTTCTTCCAGAGTGGCTTCTCAAGACTGAAGATCAAGGTGTGAGAGTATTTGACAATACCATTTCCGATCGTCGAACCAAATTAGTTCTTGATGTTTCTAAATCAGTCTCGGGGCATCCCACATTGGAGGGAGATAAAGTAGATGACGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGGCAAAACGATGGATGAATCAGTAAATGAAGCGTTTGTAACCGTTGCTCGTACCATGTCAACGACAGAAAACGCTTCAGGAAAGCAGAAGCGACGGAGAAAGGCTGAAAGAAAGAATAAGAAGATCAAATTTATAAAGTACGATCTCATCCCGAACTCTGATGCTGCCCAATTGAGGTCAGCTGTTGATAATAATGACTCGAACAGCGAGGGCGAAGTTCATAATCCACACCTGGACGAAGATTCTGAGATGGAAGAGTAATATGATCTAAAGAGAAAGGTCGTATTTGGTTGTCACAACACAACTTCAGATTCAGTTTGTTATCTGTTGATGACCTTTCTGGAGGAGCTGGCCAAGATACAAGTGCGAGCTCGTGAGGTTCGAGTAGGCAGGTCGATATGCCAAGAATGTTGGATTTTCAACACAAGTGCAACAATTCAACCAAGGTTGCAAATGTAAGTTGATCTATGATCAAATCAAAGTAAGTGAAAAAAGGAAAAGACATTTTTGGTACATTATGATACTGCTGTTTTTCATGAAGATGAATCAAATATTTAACTGCACTTGATGCGCTGTATTTTATTATTAATGCTTATTTTGACAACTTGACAAAACCATTGTTCAAACTTGTTCTCAACATGAGGAAAAACTAAAAGGAAAAAAACGAC
mRNA sequence
TGAATTCATACAGAAAAAACTTTGCTTTCTTCGAGTCCCAATTCTTTTCATAGTAATTGATTCAATTCGATAAGTTTCAGCGAGTCTCATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACACTTCAAACCAGAATCCTTTTCTTGGAGAAAATTATGAGTGTACTCTTGAGCAATCAATCCAGAACGTTTTAGCTGAAATTCGCAAAGGAAATCTTGGTTCTTCTCATTTGACGGAACGATTCTATGAGTTGATTCAAGCTAGAGCTGACCCACCAATGGAAGCGATCTGGTTCTACTCCGCATTAACGTTTCGTAGCCGTAGCTCCACTACTAAGGGCGACTTTTTGGACCGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTGATTCTTCGAAGACCATTGCGTTGCTTGCTCCAGTGGTTTTTGAGGTGTATAAGTTGATTGGTGACATGCTAAAAAAGGATTTGGCCTTGAAAAGGGAGAAGAAAGCGATGAGAGAGGTTAAATCTTTAGTTGAAGTGATTATTGGCTTTATAAATCTGAGTTCTTGCAAGGATTCGGACCAGAATTGTGAATCTCTTGATTTCAATTTGATTACTCCTTTTGTGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTGAACAGTGAGGTTTGTGGAGAGTTTAGTTCAGGCGTCTGTGTTGTTCGTCGCTTGGCTGGAGTTGTAATTGCTGAGGCATTTCTGATGAAACTGTGCTTGGACTTCAACAGTGGGCGTTCGAGGCAAGGTTTGGAGAAAGAGCTAAGGATATGGACTGTTGGTTCTATAACTCGGATTAGGAACTTCTACTTTTTTGAAACTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTAATGTCTCTGTTGCAGAGCACTGAAGATGAAGCTCTGTTAAGGAAGGTTCTATATGATGCTCTAATATTGGTTGATTATTCATTTTTGGATCCTGAGAAAGCCATTAACTTACCTGCCGAACATGTGGCATTGCTGGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAGCAGAGCCGTCTCTTATCTAAATGCCTTCTCAAGTTCTCTTGTATCTTCTCAAATTATTAGATGGGTCAGAAGCCAAATTCTTAGCAATGGAAATGTAAATCGACCCAATGGATCGTCGCCTAAAATACTTCTCAAGTGGCTTCTCAAGACTGAAGATCAAGGTGTGAGAGTATTTGACAATACCATTTCCGATCGTCGAACCAAATTAGTTCTTGATGTTTCTAAATCAGTCTCGGGGCATCCCACATTGGAGGGAGATAAAGTAGATGACGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGGCAAAACGATGGATGAATCAGTAAATGAAGCGTTTGTAACCGTTGCTCGTACCATGTCAACGACAGAAAACGCTTCAGGAAAGCAGAAGCGACGGAGAAAGGCTGAAAGAAAGAATAAGAAGATCAAATTTATAAAGTACGATCTCATCCCGAACTCTGATGCTGCCCAATTGAGGTCAGCTGTTGATAATAATGACTCGAACAGCGAGGGCGAAGTTCATAATCCACACCTGGACGAAGATTCTGAGATGGAAGAGTAATATGATCTAAAGAGAAAGGTCGTATTTGGTTGTCACAACACAACTTCAGATTCAGTTTGTTATCTGTTGATGACCTTTCTGGAGGAGCTGGCCAAGATACAAGTGCGAGCTCGTGAGGTTCGAGTAGGCAGGTCGATATGCCAAGAATGTTGGATTTTCAACACAAGTGCAACAATTCAACCAAGGTTGCAAATGTAAGTTGATCTATGATCAAATCAAAGTAAGTGAAAAAAGGAAAAGACATTTTTGGTACATTATGATACTGCTGTTTTTCATGAAGATGAATCAAATATTTAACTGCACTTGATGCGCTGTATTTTATTATTAATGCTTATTTTGACAACTTGACAAAACCATTGTTCAAACTTGTTCTCAACATGAGGAAAAACTAAAAGGAAAAAAACGAC
Coding sequence (CDS)
ATGGCTTTGGCTCTTGTTGAATCGATGGATTCTATGAACACTTCAAACCAGAATCCTTTTCTTGGAGAAAATTATGAGTGTACTCTTGAGCAATCAATCCAGAACGTTTTAGCTGAAATTCGCAAAGGAAATCTTGGTTCTTCTCATTTGACGGAACGATTCTATGAGTTGATTCAAGCTAGAGCTGACCCACCAATGGAAGCGATCTGGTTCTACTCCGCATTAACGTTTCGTAGCCGTAGCTCCACTACTAAGGGCGACTTTTTGGACCGAGTGGCAGCCATGAAAGTCTTGTTTCAGTTGGTGTGTTCTTGTTCGGCTCCTTGTGATTCTTCGAAGACCATTGCGTTGCTTGCTCCAGTGGTTTTTGAGGTGTATAAGTTGATTGGTGACATGCTAAAAAAGGATTTGGCCTTGAAAAGGGAGAAGAAAGCGATGAGAGAGGTTAAATCTTTAGTTGAAGTGATTATTGGCTTTATAAATCTGAGTTCTTGCAAGGATTCGGACCAGAATTGTGAATCTCTTGATTTCAATTTGATTACTCCTTTTGTGGATTTAATTAGTATTTGGACGCACCCAAATGAGGGATTGGATCAGTTCTTACCGCTCGTGAACAGTGAGGTTTGTGGAGAGTTTAGTTCAGGCGTCTGTGTTGTTCGTCGCTTGGCTGGAGTTGTAATTGCTGAGGCATTTCTGATGAAACTGTGCTTGGACTTCAACAGTGGGCGTTCGAGGCAAGGTTTGGAGAAAGAGCTAAGGATATGGACTGTTGGTTCTATAACTCGGATTAGGAACTTCTACTTTTTTGAAACTCTTGTAAGATTCCTGCTGGAGGCGACTTTACCTGTAATGTCTCTGTTGCAGAGCACTGAAGATGAAGCTCTGTTAAGGAAGGTTCTATATGATGCTCTAATATTGGTTGATTATTCATTTTTGGATCCTGAGAAAGCCATTAACTTACCTGCCGAACATGTGGCATTGCTGGCTGTTAAGAGATTGATTCTTACTCATGAGGCCATAGAGTTTTACAGGGAGCATGGAGATCAGAGCAGAGCCGTCTCTTATCTAAATGCCTTCTCAAGTTCTCTTGTATCTTCTCAAATTATTAGATGGGTCAGAAGCCAAATTCTTAGCAATGGAAATGTAAATCGACCCAATGGATCGTCGCCTAAAATACTTCTCAAGTGGCTTCTCAAGACTGAAGATCAAGGTGTGAGAGTATTTGACAATACCATTTCCGATCGTCGAACCAAATTAGTTCTTGATGTTTCTAAATCAGTCTCGGGGCATCCCACATTGGAGGGAGATAAAGTAGATGACGATCTTTTGTTTTACATTGACAAGCAAGGGGAAAATGAAAATGGAAGTGAGGAGGGCAAAACGATGGATGAATCAGTAAATGAAGCGTTTGTAACCGTTGCTCGTACCATGTCAACGACAGAAAACGCTTCAGGAAAGCAGAAGCGACGGAGAAAGGCTGAAAGAAAGAATAAGAAGATCAAATTTATAAAGTACGATCTCATCCCGAACTCTGATGCTGCCCAATTGAGGTCAGCTGTTGATAATAATGACTCGAACAGCGAGGGCGAAGTTCATAATCCACACCTGGACGAAGATTCTGAGATGGAAGAGTAA
Protein sequence
MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE
Homology
BLAST of Tan0002169 vs. NCBI nr
Match:
XP_022147658.1 (uncharacterized protein LOC111016527 [Momordica charantia])
HSP 1 Score: 850.9 bits (2197), Expect = 6.1e-243
Identity = 449/544 (82.54%), Postives = 480/544 (88.24%), Query Frame = 0
Query: 1 MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
MALALVESMDSMN NQNPFLGENYE TL+QSI+NVLAEIR+GNLG H TE FY+L+QA
Sbjct: 1 MALALVESMDSMNPPNQNPFLGENYELTLKQSIKNVLAEIREGNLGFCHFTEDFYKLMQA 60
Query: 61 RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
R DPPME+IWFYSAL FRS SS KGDFLDR+AAMKVLFQLVCSCSAPC SSKT+A LAP
Sbjct: 61 RVDPPMESIWFYSALMFRSHSS-AKGDFLDRLAAMKVLFQLVCSCSAPCGSSKTVASLAP 120
Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLI 180
VVFEVYKLI DML KDLA KREKKAMREVK+LVE I+GFINLSSCK SDQN E LDFNLI
Sbjct: 121 VVFEVYKLIADMLGKDLASKREKKAMREVKALVEAILGFINLSSCKVSDQNVEQLDFNLI 180
Query: 181 TPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFN 240
TPF+DLISIWTHPNEGLDQFLPLV+SEV G F SGVC VR LAGVVIAEAFLMKLCLDF+
Sbjct: 181 TPFMDLISIWTHPNEGLDQFLPLVSSEVRGGFCSGVCDVRHLAGVVIAEAFLMKLCLDFH 240
Query: 241 SGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL 300
SGRSRQ LEK+LR+W VGSIT IRN Y FETL+RFLL TLPVMSLL STEDE LLRKVL
Sbjct: 241 SGRSRQELEKDLRLWAVGSITGIRNCYLFETLIRFLLGVTLPVMSLL-STEDELLLRKVL 300
Query: 301 YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFS 360
YDALILVDYSFL+P KAI+L AEHVA LAVKRLILTH+AIEF+REHGDQSRA+SYLNAFS
Sbjct: 301 YDALILVDYSFLNPVKAIDLHAEHVAFLAVKRLILTHDAIEFFREHGDQSRAISYLNAFS 360
Query: 361 SSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLV 420
SS V SQ+IRWVRSQI SN NVNRPNGSSPKILL+WL K EDQGVRVFDNTISD R KLV
Sbjct: 361 SSPVPSQMIRWVRSQIPSNENVNRPNGSSPKILLEWLFKAEDQGVRVFDNTISDHRAKLV 420
Query: 421 LDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTT 480
LD+SKS S HP LEG+KVDD LLFY+DKQGE EN SEE K MDESVN A VTVARTMS
Sbjct: 421 LDISKSDSRHPKLEGNKVDDGLLFYVDKQGEKENESEEDKAMDESVNAALVTVARTMSMA 480
Query: 481 ENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDS 540
EN SGK+KR+RK+ERKN KIKF+KYDL PN DAAQLRSAVDNND NSEGEVHNPH DEDS
Sbjct: 481 ENGSGKKKRQRKSERKN-KIKFVKYDLFPNPDAAQLRSAVDNNDPNSEGEVHNPHKDEDS 540
Query: 541 EMEE 545
+MEE
Sbjct: 541 DMEE 541
BLAST of Tan0002169 vs. NCBI nr
Match:
KAG7023645.1 (hypothetical protein SDJN02_14671, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 840.5 bits (2170), Expect = 8.2e-240
Identity = 436/540 (80.74%), Postives = 481/540 (89.07%), Query Frame = 0
Query: 6 VESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPP 65
VESM+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S E FYELIQAR DPP
Sbjct: 55 VESMESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLGFSQFMEGFYELIQARDDPP 114
Query: 66 MEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEV 125
+E+IWFYSALTFRSR ST GDFLDRVA MK+LFQ CSCSAPC SSKTIALL+PVV+EV
Sbjct: 115 LESIWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLSPVVYEV 174
Query: 126 YKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVD 185
YKLI DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVD
Sbjct: 175 YKLISDMLGKDLSSKREKKAMREVKSLVETMLGFINLSSCKDSDQNGESLDFNLVTPFVD 234
Query: 186 LISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSR 245
LISIW + NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSR
Sbjct: 235 LISIWANSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLMKLCLDINSGRSR 294
Query: 246 QGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALI 305
Q LE +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALI
Sbjct: 295 QDLENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALI 354
Query: 306 LVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVS 365
LVDYSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVS
Sbjct: 355 LVDYSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVS 414
Query: 366 SQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSK 425
SQIIRWV+SQI SN N N P GSSPKI L+WLLK ED GVRVFD+TIS+RR KLVLD SK
Sbjct: 415 SQIIRWVKSQIPSNENFNHPKGSSPKIFLEWLLKAEDHGVRVFDSTISNRRAKLVLDTSK 474
Query: 426 SVSGHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENAS 485
SVSGHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N S
Sbjct: 475 SVSGHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDESVNAALVSAAHTMSTTQNGS 534
Query: 486 GKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
GK+KR+R A +K KKIKF+KYDL+PNSD +LRSAV++ND++SEGEVHNPH DEDS+ +E
Sbjct: 535 GKKKRQRMA-KKQKKIKFMKYDLVPNSDVTELRSAVEDNDTDSEGEVHNPHSDEDSDTKE 592
BLAST of Tan0002169 vs. NCBI nr
Match:
XP_038880003.1 (uncharacterized protein LOC120071696 [Benincasa hispida])
HSP 1 Score: 836.6 bits (2160), Expect = 1.2e-238
Identity = 447/545 (82.02%), Postives = 485/545 (88.99%), Query Frame = 0
Query: 1 MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
MALALVESMDSMN +NPFLGENYE TL QSIQNV+AEIRKGN G S TE FYELIQA
Sbjct: 1 MALALVESMDSMNPLKKNPFLGENYEFTLAQSIQNVIAEIRKGNSGFSQFTEGFYELIQA 60
Query: 61 RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
RADPP+E+IWFYSALTFRSR KGDFL+RVAAMKVLFQLV SCSAPC SSKTI LL+P
Sbjct: 61 RADPPLESIWFYSALTFRSRGLNIKGDFLERVAAMKVLFQLVSSCSAPCGSSKTIPLLSP 120
Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQN-CESLDFNL 180
VV EVYKLI DML KDLA KREKKAMREVKSLVE I+GFINLSSCKDSD+N ESLDFNL
Sbjct: 121 VVSEVYKLIVDMLGKDLASKREKKAMREVKSLVEAILGFINLSSCKDSDKNDDESLDFNL 180
Query: 181 ITPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDF 240
ITPFVDLIS+WTHPNEGLDQFLPLV+SEV GEFSSGVC VRRLAGVVIAE FLMKLCLDF
Sbjct: 181 ITPFVDLISVWTHPNEGLDQFLPLVSSEVRGEFSSGVCDVRRLAGVVIAETFLMKLCLDF 240
Query: 241 NSGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKV 300
N+G SRQ LEK+LRIWTVGSITRIRNFYFFETLVRFLLEATLPV SLL STEDEALLRKV
Sbjct: 241 NTGHSRQDLEKDLRIWTVGSITRIRNFYFFETLVRFLLEATLPVTSLL-STEDEALLRKV 300
Query: 301 LYDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAF 360
LYD+LILV+YSFL PEKAI+LPAEHVA LAVKRLILTHEAIEFYREHGDQSRA+SYLNAF
Sbjct: 301 LYDSLILVEYSFLKPEKAIDLPAEHVASLAVKRLILTHEAIEFYREHGDQSRAISYLNAF 360
Query: 361 SSSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKL 420
SSS VSSQIIRWV+SQ+ SN NV RPNGSSPKI+L+WLL+ EDQGVRVFD TIS+R KL
Sbjct: 361 SSSFVSSQIIRWVKSQMPSNENVKRPNGSSPKIVLEWLLEAEDQGVRVFDKTISNRCAKL 420
Query: 421 VLDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMST 480
VLD SKSVS LEGDKVDDDLLFYIDKQGE+ENGSE+ TMDESVN A V+VARTMST
Sbjct: 421 VLDTSKSVS----LEGDKVDDDLLFYIDKQGESENGSED-TTMDESVNAALVSVARTMST 480
Query: 481 TENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDED 540
TEN SGK KR+R +RKN+KIKF+KYDL+P+SD Q RS DNND++SEG+VHNPH D+D
Sbjct: 481 TENGSGK-KRQRMVKRKNEKIKFVKYDLVPSSDTTQSRSPFDNNDTDSEGKVHNPHSDDD 538
Query: 541 SEMEE 545
S+++E
Sbjct: 541 SDIKE 538
BLAST of Tan0002169 vs. NCBI nr
Match:
KAG6589981.1 (hypothetical protein SDJN03_15404, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 833.6 bits (2152), Expect = 1.0e-237
Identity = 433/537 (80.63%), Postives = 477/537 (88.83%), Query Frame = 0
Query: 9 MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEA 68
M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S E FYELIQAR DPP+E+
Sbjct: 1 MESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLGFSQFMEGFYELIQARDDPPLES 60
Query: 69 IWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKL 128
IWFYSALTFRSR ST GDFLDRVA MK+LFQ CSCSAPC SSKTIALL+PVV+EVYKL
Sbjct: 61 IWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLSPVVYEVYKL 120
Query: 129 IGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLIS 188
I DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVDLIS
Sbjct: 121 ISDMLGKDLSSKREKKAMREVKSLVETMLGFINLSSCKDSDQNGESLDFNLVTPFVDLIS 180
Query: 189 IWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGL 248
IW + NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSRQ L
Sbjct: 181 IWANSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLMKLCLDINSGRSRQDL 240
Query: 249 EKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD 308
E +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Sbjct: 241 ENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALILVD 300
Query: 309 YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQI 368
YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQI
Sbjct: 301 YSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVSSQI 360
Query: 369 IRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVS 428
IRWV+SQI SN NVN P GSSPKI L+WLLK ED GVRVFD+TIS+RR KLVLD SKSVS
Sbjct: 361 IRWVKSQIPSNENVNHPKGSSPKIFLEWLLKAEDHGVRVFDSTISNRRAKLVLDTSKSVS 420
Query: 429 GHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQ 488
GHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N S K+
Sbjct: 421 GHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDESVNAALVSAAHTMSTTQNGSAKK 480
Query: 489 KRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
KR+R A +K KKIKF KYDL+PNSD +LRSAV++ND++SEGEVHNPH DEDS+ +E
Sbjct: 481 KRQRMA-KKQKKIKFTKYDLVPNSDLTELRSAVEDNDTDSEGEVHNPHSDEDSDTKE 535
BLAST of Tan0002169 vs. NCBI nr
Match:
XP_022987671.1 (uncharacterized protein LOC111485155 [Cucurbita maxima])
HSP 1 Score: 832.8 bits (2150), Expect = 1.7e-237
Identity = 434/537 (80.82%), Postives = 477/537 (88.83%), Query Frame = 0
Query: 9 MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEA 68
M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNL S E FYELIQARADPP+E+
Sbjct: 1 MESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLVFSQFMEGFYELIQARADPPLES 60
Query: 69 IWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKL 128
IWFYSALTFRSR ST GDFLDRVA MK+LFQ CSCSAPC SSKTIALLAPVV+EVYKL
Sbjct: 61 IWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLAPVVYEVYKL 120
Query: 129 IGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLIS 188
I DML KDL KREKKAMREVKSLVE I+GFINLSSCKDSDQN ESLDFNL+TPFVDLIS
Sbjct: 121 ISDMLGKDLFSKREKKAMREVKSLVETILGFINLSSCKDSDQNGESLDFNLVTPFVDLIS 180
Query: 189 IWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGL 248
IWT+ NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FL+KLCLD NSGRSRQ L
Sbjct: 181 IWTNSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLLKLCLDINSGRSRQDL 240
Query: 249 EKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD 308
E +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Sbjct: 241 ENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALILVD 300
Query: 309 YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQI 368
YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQI
Sbjct: 301 YSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVSSQI 360
Query: 369 IRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVS 428
IRWV+SQI S+ NVN P GSSPKI L+WL K ED GVRVFD+TIS+RR KLVLD SKSVS
Sbjct: 361 IRWVKSQIPSHENVNHPKGSSPKIFLEWLFKAEDHGVRVFDSTISNRRAKLVLDTSKSVS 420
Query: 429 GHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQ 488
GHPT EG+ VDD+LLFYIDKQGENENGS EE + MDE+VN A V+ A TMSTT+N K+
Sbjct: 421 GHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDETVNAALVSAAHTMSTTQNGLEKK 480
Query: 489 KRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
KRRR A +K KKIKF KYDL+PNSDA +LRSAVD+ND++S+ EVHNPHLDEDS+M+E
Sbjct: 481 KRRRMA-KKQKKIKFTKYDLVPNSDATELRSAVDDNDTDSDSEVHNPHLDEDSDMKE 535
BLAST of Tan0002169 vs. ExPASy TrEMBL
Match:
A0A6J1D1X1 (uncharacterized protein LOC111016527 OS=Momordica charantia OX=3673 GN=LOC111016527 PE=4 SV=1)
HSP 1 Score: 850.9 bits (2197), Expect = 2.9e-243
Identity = 449/544 (82.54%), Postives = 480/544 (88.24%), Query Frame = 0
Query: 1 MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
MALALVESMDSMN NQNPFLGENYE TL+QSI+NVLAEIR+GNLG H TE FY+L+QA
Sbjct: 1 MALALVESMDSMNPPNQNPFLGENYELTLKQSIKNVLAEIREGNLGFCHFTEDFYKLMQA 60
Query: 61 RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
R DPPME+IWFYSAL FRS SS KGDFLDR+AAMKVLFQLVCSCSAPC SSKT+A LAP
Sbjct: 61 RVDPPMESIWFYSALMFRSHSS-AKGDFLDRLAAMKVLFQLVCSCSAPCGSSKTVASLAP 120
Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLI 180
VVFEVYKLI DML KDLA KREKKAMREVK+LVE I+GFINLSSCK SDQN E LDFNLI
Sbjct: 121 VVFEVYKLIADMLGKDLASKREKKAMREVKALVEAILGFINLSSCKVSDQNVEQLDFNLI 180
Query: 181 TPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFN 240
TPF+DLISIWTHPNEGLDQFLPLV+SEV G F SGVC VR LAGVVIAEAFLMKLCLDF+
Sbjct: 181 TPFMDLISIWTHPNEGLDQFLPLVSSEVRGGFCSGVCDVRHLAGVVIAEAFLMKLCLDFH 240
Query: 241 SGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL 300
SGRSRQ LEK+LR+W VGSIT IRN Y FETL+RFLL TLPVMSLL STEDE LLRKVL
Sbjct: 241 SGRSRQELEKDLRLWAVGSITGIRNCYLFETLIRFLLGVTLPVMSLL-STEDELLLRKVL 300
Query: 301 YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFS 360
YDALILVDYSFL+P KAI+L AEHVA LAVKRLILTH+AIEF+REHGDQSRA+SYLNAFS
Sbjct: 301 YDALILVDYSFLNPVKAIDLHAEHVAFLAVKRLILTHDAIEFFREHGDQSRAISYLNAFS 360
Query: 361 SSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLV 420
SS V SQ+IRWVRSQI SN NVNRPNGSSPKILL+WL K EDQGVRVFDNTISD R KLV
Sbjct: 361 SSPVPSQMIRWVRSQIPSNENVNRPNGSSPKILLEWLFKAEDQGVRVFDNTISDHRAKLV 420
Query: 421 LDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTT 480
LD+SKS S HP LEG+KVDD LLFY+DKQGE EN SEE K MDESVN A VTVARTMS
Sbjct: 421 LDISKSDSRHPKLEGNKVDDGLLFYVDKQGEKENESEEDKAMDESVNAALVTVARTMSMA 480
Query: 481 ENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDS 540
EN SGK+KR+RK+ERKN KIKF+KYDL PN DAAQLRSAVDNND NSEGEVHNPH DEDS
Sbjct: 481 ENGSGKKKRQRKSERKN-KIKFVKYDLFPNPDAAQLRSAVDNNDPNSEGEVHNPHKDEDS 540
Query: 541 EMEE 545
+MEE
Sbjct: 541 DMEE 541
BLAST of Tan0002169 vs. ExPASy TrEMBL
Match:
A0A6J1JEZ4 (uncharacterized protein LOC111485155 OS=Cucurbita maxima OX=3661 GN=LOC111485155 PE=4 SV=1)
HSP 1 Score: 832.8 bits (2150), Expect = 8.3e-238
Identity = 434/537 (80.82%), Postives = 477/537 (88.83%), Query Frame = 0
Query: 9 MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEA 68
M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNL S E FYELIQARADPP+E+
Sbjct: 1 MESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLVFSQFMEGFYELIQARADPPLES 60
Query: 69 IWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKL 128
IWFYSALTFRSR ST GDFLDRVA MK+LFQ CSCSAPC SSKTIALLAPVV+EVYKL
Sbjct: 61 IWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLAPVVYEVYKL 120
Query: 129 IGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLIS 188
I DML KDL KREKKAMREVKSLVE I+GFINLSSCKDSDQN ESLDFNL+TPFVDLIS
Sbjct: 121 ISDMLGKDLFSKREKKAMREVKSLVETILGFINLSSCKDSDQNGESLDFNLVTPFVDLIS 180
Query: 189 IWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGL 248
IWT+ NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FL+KLCLD NSGRSRQ L
Sbjct: 181 IWTNSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLLKLCLDINSGRSRQDL 240
Query: 249 EKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD 308
E +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Sbjct: 241 ENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALILVD 300
Query: 309 YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQI 368
YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQI
Sbjct: 301 YSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVSSQI 360
Query: 369 IRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVS 428
IRWV+SQI S+ NVN P GSSPKI L+WL K ED GVRVFD+TIS+RR KLVLD SKSVS
Sbjct: 361 IRWVKSQIPSHENVNHPKGSSPKIFLEWLFKAEDHGVRVFDSTISNRRAKLVLDTSKSVS 420
Query: 429 GHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQ 488
GHPT EG+ VDD+LLFYIDKQGENENGS EE + MDE+VN A V+ A TMSTT+N K+
Sbjct: 421 GHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDETVNAALVSAAHTMSTTQNGLEKK 480
Query: 489 KRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
KRRR A +K KKIKF KYDL+PNSDA +LRSAVD+ND++S+ EVHNPHLDEDS+M+E
Sbjct: 481 KRRRMA-KKQKKIKFTKYDLVPNSDATELRSAVDDNDTDSDSEVHNPHLDEDSDMKE 535
BLAST of Tan0002169 vs. ExPASy TrEMBL
Match:
A0A6J1H8Q6 (uncharacterized protein LOC111461526 OS=Cucurbita moschata OX=3662 GN=LOC111461526 PE=4 SV=1)
HSP 1 Score: 830.5 bits (2144), Expect = 4.1e-237
Identity = 432/537 (80.45%), Postives = 476/537 (88.64%), Query Frame = 0
Query: 9 MDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQARADPPMEA 68
M+SMN+S Q+PFLGENYE TLEQSIQNVLAEIR+GNLG S E FYELIQAR DPP+E+
Sbjct: 1 MESMNSSKQSPFLGENYEFTLEQSIQNVLAEIREGNLGFSQFMEGFYELIQARDDPPLES 60
Query: 69 IWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKL 128
IWFYSALTFRSR ST GDFLDRVA MK+LFQ CSCSAPC SSKTIALL+PVV+EVYKL
Sbjct: 61 IWFYSALTFRSRISTMNGDFLDRVATMKILFQTTCSCSAPCGSSKTIALLSPVVYEVYKL 120
Query: 129 IGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLITPFVDLIS 188
I DML KDL+ KREKKAMREVKSLVE ++GFINLSSCKDSDQN ESLDFNL+TPFVDLIS
Sbjct: 121 ISDMLGKDLSSKREKKAMREVKSLVETMLGFINLSSCKDSDQNGESLDFNLVTPFVDLIS 180
Query: 189 IWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGL 248
IW + NEGLDQFLPLV+SEV GEFSSGVC +RRLAGVVIAE FLMKLCLD NSGRSRQ L
Sbjct: 181 IWANSNEGLDQFLPLVSSEVRGEFSSGVCDIRRLAGVVIAETFLMKLCLDINSGRSRQDL 240
Query: 249 EKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVLYDALILVD 308
E +LRIW VGSITRI+NFYFFETLVRFLLEATLPVMSLL STEDEALLRK+LYDALILVD
Sbjct: 241 ENDLRIWAVGSITRIKNFYFFETLVRFLLEATLPVMSLL-STEDEALLRKILYDALILVD 300
Query: 309 YSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQI 368
YSFL+ EKAINLPA+HVA LAVKRLILTHEAIEFYREHGDQ+RA+SYLNAFS+SLVSSQI
Sbjct: 301 YSFLNDEKAINLPADHVAFLAVKRLILTHEAIEFYREHGDQNRAISYLNAFSTSLVSSQI 360
Query: 369 IRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVS 428
IRWV+SQI SN N N P GSSPKI L+WLLK ED GVRVFD+TIS+RR KLVLD SKSVS
Sbjct: 361 IRWVKSQIPSNENFNHPKGSSPKIFLEWLLKAEDHGVRVFDSTISNRRAKLVLDTSKSVS 420
Query: 429 GHPTLEGDKVDDDLLFYIDKQGENENGS-EEGKTMDESVNEAFVTVARTMSTTENASGKQ 488
GHPT EG+ VDD+LLFYIDKQGENENGS EE + MDESVN A V+ A TMSTT+N SGK+
Sbjct: 421 GHPTSEGNSVDDELLFYIDKQGENENGSEEEDRVMDESVNAALVSAAHTMSTTQNGSGKK 480
Query: 489 KRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDSEMEE 545
KR+R A +K KKIKF KYDL+ NSD +LRSAV++ND++SEGEVHNPH DEDS+ +E
Sbjct: 481 KRQRMA-KKQKKIKFTKYDLVLNSDVTELRSAVEDNDTDSEGEVHNPHSDEDSDTKE 535
BLAST of Tan0002169 vs. ExPASy TrEMBL
Match:
A0A0A0M1W0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G586780 PE=4 SV=1)
HSP 1 Score: 758.8 bits (1958), Expect = 1.5e-215
Identity = 423/614 (68.89%), Postives = 470/614 (76.55%), Query Frame = 0
Query: 1 MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
MAL LVESM+S+N QNPFLGENYE TL QSIQNVLAEIRKGN+ S T+RFY+LIQA
Sbjct: 1 MALGLVESMESINPLKQNPFLGENYEFTLAQSIQNVLAEIRKGNVVFSQFTKRFYKLIQA 60
Query: 61 RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
RADPP+E+IWFYSAL FRS S KGDFL+RVAAMKVLFQLVCSCSAPC SSKTI LL+P
Sbjct: 61 RADPPLESIWFYSALKFRS-SFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSP 120
Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLI 180
VV EVYKL+ DM KDL REKKAMREVKSLVE I+GF+NLSS +DSD+N +SLDF+LI
Sbjct: 121 VVSEVYKLVIDMRGKDLNSTREKKAMREVKSLVEAILGFMNLSSREDSDKNDKSLDFSLI 180
Query: 181 TPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFN 240
TPF+DLISIWT PNEGLDQFLPLV SEV EFSSG C VRRLAGVVIAE FLMKLCLDFN
Sbjct: 181 TPFMDLISIWTQPNEGLDQFLPLVCSEVREEFSSGECDVRRLAGVVIAEIFLMKLCLDFN 240
Query: 241 SGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL 300
GRSRQ LEK+L W VGSIT+IRNFY FETLVR LLEATLPV SLL ST++EALLRKVL
Sbjct: 241 YGRSRQDLEKDLITWAVGSITQIRNFYSFETLVRVLLEATLPVTSLL-STDNEALLRKVL 300
Query: 301 YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFS 360
YDALILVDYSFL PE AINLPAEHVA LAVKRLILT+EAIEFYREHGDQ+RA+SYLNAFS
Sbjct: 301 YDALILVDYSFLKPEIAINLPAEHVAFLAVKRLILTYEAIEFYREHGDQNRAISYLNAFS 360
Query: 361 SSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLV 420
SSLVSSQIIRW++SQ+ SN N+N PNG SPK+ L+WLLK EDQGVRVFDNTIS+RR+KLV
Sbjct: 361 SSLVSSQIIRWIKSQMPSNENLNCPNGLSPKVFLEWLLKAEDQGVRVFDNTISNRRSKLV 420
Query: 421 LDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTT 480
LD SKSVS EGDKVDDDLLFYIDKQG N NGSEE TMDESVN A + A TMSTT
Sbjct: 421 LDTSKSVS----FEGDKVDDDLLFYIDKQGGNVNGSEEDTTMDESVNAALASAAPTMSTT 480
Query: 481 ENASGKQKRR-------------------------------------------------- 540
EN+S K+ R
Sbjct: 481 ENSSVKKLSRKAKKRNKKLKLLSQLKSAVEGDLLFCINKQGENENGNEEDTTMNEPVNEA 540
Query: 541 --------------------RKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGE 545
RKA+RKNKK K +KYDL+PN+DA QL+SAV+NND++SEGE
Sbjct: 541 LVSAAPTMSTTENSSVKSLKRKAKRKNKKNKLVKYDLVPNTDATQLKSAVENNDTHSEGE 600
BLAST of Tan0002169 vs. ExPASy TrEMBL
Match:
A0A5D3CFU4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold16G002540 PE=4 SV=1)
HSP 1 Score: 744.2 bits (1920), Expect = 3.9e-211
Identity = 408/544 (75.00%), Postives = 452/544 (83.09%), Query Frame = 0
Query: 1 MALALVESMDSMNTSNQNPFLGENYECTLEQSIQNVLAEIRKGNLGSSHLTERFYELIQA 60
MAL LVESM+S+N +N FLGENYE TL QSIQNVLAEIRKGN+ S TE FY+LIQA
Sbjct: 1 MALGLVESMESINPLKKNTFLGENYEFTLAQSIQNVLAEIRKGNVVFSRFTEGFYKLIQA 60
Query: 61 RADPPMEAIWFYSALTFRSRSSTTKGDFLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAP 120
RADPP+E+IWFYSALTFRS S KGDFL+RVAAMKVLFQLVCSCSAPC SSKTI LL+P
Sbjct: 61 RADPPLESIWFYSALTFRS-SFNPKGDFLERVAAMKVLFQLVCSCSAPCGSSKTITLLSP 120
Query: 121 VVFEVYKLIGDMLKKDLALKREKKAMREVKSLVEVIIGFINLSSCKDSDQNCESLDFNLI 180
VV EVYKL+ DM KDL KREKKAMREVKSLVE I+G NLSSC+DS++N +SLDFN I
Sbjct: 121 VVSEVYKLVIDMRGKDLNSKREKKAMREVKSLVEAILGLTNLSSCEDSNKNDKSLDFNFI 180
Query: 181 TPFVDLISIWTHPNEGLDQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFN 240
TPFVDLISIWTHPNEGLDQFLPLV SEV EFSSG C VRRLAGVVIAE FL+KLCLDFN
Sbjct: 181 TPFVDLISIWTHPNEGLDQFLPLVCSEVREEFSSGECDVRRLAGVVIAETFLVKLCLDFN 240
Query: 241 SGRSRQGLEKELRIWTVGSITRIRNFYFFETLVRFLLEATLPVMSLLQSTEDEALLRKVL 300
G SRQ LE++LR WTVGSITRIRNFYFFETLVR LLEATLPV SLL ST+DEALLRKVL
Sbjct: 241 CGHSRQALEEDLRNWTVGSITRIRNFYFFETLVRLLLEATLPVTSLL-STDDEALLRKVL 300
Query: 301 YDALILVDYSFLDPEKAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFS 360
DALILVDYSFL PEKAINLPAEH A LAVKRLILT+EA EFYR+HGDQ+RA+SYLNAFS
Sbjct: 301 SDALILVDYSFLKPEKAINLPAEHTAFLAVKRLILTYEATEFYRKHGDQNRAISYLNAFS 360
Query: 361 SSLVSSQIIRWVRSQILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLV 420
SSLVSSQIIRWV+SQ+ SN N+N NGSSPK+ L+WLLK EDQGVRVFDNTIS+ R K+V
Sbjct: 361 SSLVSSQIIRWVKSQMPSNENLNHLNGSSPKVFLEWLLKAEDQGVRVFDNTISNHRAKIV 420
Query: 421 LDVSKSVSGHPTLEGDKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTT 480
LD SKSV EGDKVDDDLLFYIDKQGENENG EE KTMD+SVN A V+VA TMSTT
Sbjct: 421 LDTSKSV----LFEGDKVDDDLLFYIDKQGENENGREEDKTMDKSVNAALVSVAHTMSTT 480
Query: 481 ENASGKQKRRRKAERKNKKIKFIKYDLIPNSDAAQLRSAVDNNDSNSEGEVHNPHLDEDS 540
EN+S K KR RKA+++NKK N+D +QL+SAV+NND+N + ED+
Sbjct: 481 ENSSVK-KRSRKAKKRNKK---------KNADTSQLKSAVENNDTNGK---------EDT 519
Query: 541 EMEE 545
M+E
Sbjct: 541 TMDE 519
BLAST of Tan0002169 vs. TAIR 10
Match:
AT5G11780.1 (unknown protein; Has 37 Blast hits to 37 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 3; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 157.5 bits (397), Expect = 3.0e-38
Identity = 136/494 (27.53%), Postives = 230/494 (46.56%), Query Frame = 0
Query: 29 LEQSIQNVLAEIRKGNLGSSHLTERFYELIQARAD-PPMEAIWFYSALTFRSRSSTTKGD 88
L SI+ +L + R G S F ++ + PP+E +WFYSA+ F S +
Sbjct: 21 LNDSIKQLLLQYRGGRTNFSDFDSIFTRILNDLPEPPPLELVWFYSAIRFYSSKLAFRD- 80
Query: 89 FLDRVAAMKVLFQLVCSCSAPCDSSKTIALLAPVVFEVYKLIGDMLKKDLALKREKKAMR 148
D V FQL+ S S K ++LL+PVV+++ +L+ + R + A+
Sbjct: 81 --DSVRLTSCFFQLIVSFSDSFSGVKKVSLLSPVVYQLSRLV---------ISRRRDAL- 140
Query: 149 EVKSLVEVIIGFINLSSCKDSDQNCESLDFNLIT--PFVDLISIWT--------HPNEGL 208
SL+E I+ +I++ C D N E D +++ F DL +W + L
Sbjct: 141 ---SLLEGIVSYISM-YCVDEPGN-EDDDVLMVSGFSFADLSRVWVVDEVEDNCRVEDCL 200
Query: 209 DQFLPLVNSEVCGEFSSGVCVVRRLAGVVIAEAFLMKLCLDFNSGRSRQGLEKELRIWTV 268
+ F+P + + E S C V LAG+V ++ FL+ LC F+ R L+K+L+ +
Sbjct: 201 EVFMPFASEILRKEIDSESCGVGYLAGIVASQVFLLSLCSRFDLDLGRSELDKDLQESVL 260
Query: 269 GSITRIRNFYFFETLVRFLLEATLPVMSLLQST-EDEALLRKVLYDALI-LVDYSFLDPE 328
I+ + +FF+ +++ LLE L + SL+ EDEA L +++ +A+I V+ FL+P
Sbjct: 261 QMISGFHSCFFFDVILKMLLEPYLHLTSLMGVVPEDEAFLTEIITEAVIKSVEKLFLNPG 320
Query: 329 KAINLPAEHVALLAVKRLILTHEAIEFYREHGDQSRAVSYLNAFSSSLVSSQIIRWVRSQ 388
+ + H+ +A+ L L + + R + DQ + Y N FS+SL+ +I WV SQ
Sbjct: 321 NGTSQRSLHLKNIAINWLFLFDKTMASLRRNKDQEKISMYTNMFSNSLIPYHLINWVISQ 380
Query: 389 ILSNGNVNRPNGSSPKILLKWLLKTEDQGVRVFDNTISDRRTKLVLDVSKSVSGHPTLEG 448
+ + +P ++WL+ E+QG RVF+ S K V+ S+
Sbjct: 381 GEVIRDADTLRNLTPASFIEWLVSLEEQGPRVFNCDHSKNYAKSVIHRSR---------- 440
Query: 449 DKVDDDLLFYIDKQGENENGSEEGKTMDESVNEAFVTVARTMSTTENASGKQKRRRKAER 506
DL Q + E ++ DE N + +++ + R+RK ER
Sbjct: 441 ----PDLSIGTTLQKQEEEFDQDTDMADEQ-NVSSISIL----------SRNTRKRKEER 471
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022147658.1 | 6.1e-243 | 82.54 | uncharacterized protein LOC111016527 [Momordica charantia] | [more] |
KAG7023645.1 | 8.2e-240 | 80.74 | hypothetical protein SDJN02_14671, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_038880003.1 | 1.2e-238 | 82.02 | uncharacterized protein LOC120071696 [Benincasa hispida] | [more] |
KAG6589981.1 | 1.0e-237 | 80.63 | hypothetical protein SDJN03_15404, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022987671.1 | 1.7e-237 | 80.82 | uncharacterized protein LOC111485155 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1D1X1 | 2.9e-243 | 82.54 | uncharacterized protein LOC111016527 OS=Momordica charantia OX=3673 GN=LOC111016... | [more] |
A0A6J1JEZ4 | 8.3e-238 | 80.82 | uncharacterized protein LOC111485155 OS=Cucurbita maxima OX=3661 GN=LOC111485155... | [more] |
A0A6J1H8Q6 | 4.1e-237 | 80.45 | uncharacterized protein LOC111461526 OS=Cucurbita moschata OX=3662 GN=LOC1114615... | [more] |
A0A0A0M1W0 | 1.5e-215 | 68.89 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G586780 PE=4 SV=1 | [more] |
A0A5D3CFU4 | 3.9e-211 | 75.00 | Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... | [more] |
Match Name | E-value | Identity | Description | |
AT5G11780.1 | 3.0e-38 | 27.53 | unknown protein; Has 37 Blast hits to 37 proteins in 12 species: Archae - 0; Bac... | [more] |