Cp4.1LG20g05980 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g05980
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionMitochondrial transcription termination factor family protein, putative
LocationCp4.1LG20 : 3653355 .. 3655094 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTACCCAGCTCAATCACTTGCTGTTCTTTGCGCCTGTTCTATCTGAAAAAACTATTACTCAAGCTCATAATCCGTCTTTTGTTAGCTTTAAATCTCGATTGTTTTGCAATTATGGATTTGATATTCATCCCAGAGCGTCCCTTATAGAATCACCGCAATCGCCCACTACAGGTGGCCGAGTGTCTCGGTATGCAAGAACAGAGGCCCAGAAAGTGTTATTTGACTATCTGCATTGTACTAGGAGCCTCGGCTTTGCAGATGCTGAACATATAAGCAAGAACTCGCCTCATTTTCTTCAAAATTTGATCATGAAACTTGACAGCGAAAAAGATGTGGCTAGGTCTCTCAGAAAGTATCTTAGATACAATCCCATCAATGAATTCGAGCCATTCTTTGAAAGCCTGGGACTACCCCCATCAGAGCTCCCCTTGTTTCTTCCACGACGCTTGATGTTTTTGAGTGATGATCATCTTATGCTTGAAAACTTCCATGTTTTGTGCAATTATGGTATCCCGCGTAGCAAGATGGGCAAAATGTATAAGGAAGCAAGAGAAATATTTGCTTATGATTATGGTCTATTAGCCTCAAAACTTAGAGCTTATGAGGATTTAGGTCTTGGAATGGGCACACTTATTAAGCTTGTTAGTTGCTGCCCTTCACTTTTGATTGGTCAGATCAAAATGGAGTTCATCAAAGTTCTGGAGAAGCTAAGTAAATTAGGTATTGAAGAAGATTGGATTGGAGGATATATATCTCATAAAAGTACCTATAATTGGAATAGACTGGTTGACACTCTGGACTTTCTAGCTAAAGTAGGTTATACAGAGATACAGATGCACGATCTGTTTAAATCAAATCCCTCATTGCTGCTTGAAGATTCTGGGAAGAAAGTGTATGTATTATTCGTTCGATTAGTCAAGTTGGGTCTTAAGATGGACGAAGCTTATTCGATTTATAAACAAAACCCTGTAATTTTGTCTGGGAAGTATGTTAAAAACATTCTGAGAGCAGTAGATTTTCTCTTTGATATCGGGTTGGGAACAGAGGACATTGCAGGTATAGTATCTCATCAAATTCTGTTACTTGGTTCGTGTACTCTGAAAGGGCCGAAAACTGTCTGTAAAGAACTAAAAGTCGGAAAAGAAGGTTTATGTCTGATCATTCGAGACGACCCGTCCAAGCTGTTTACCTTGGCTTCCAAATCAAAGCTAAAAAGCAGTGAACAGGCTTCTTGCCAAAACCCCGCCAAAGAGATGGAGAAGACTACATTCCTGCTGAAGTTGGGATACGTCGAAAACTCAGATGAGTTGGCAAAGGCGTCGAAACAGTTTCGGGGTCGGGGAGATCAATTACAGGAGAGATTTGATTGCCTGGTAAATGCTGGTTTGGACTGTCATGTGGTGACAAATATAGTCAGACATGCGCCCATGGTTCTAAACCAGAGCAAAGATGTAATCCAAGAGAAGATTGATTGCTTAAGAAACTGTTTAGGTTACCCCTTGCATACAATAGCGGCATTCCCAGTTTATTTATGTTACAACATGGAGAGAATAAACACAAGATTTTCAATGTATAGATGGTTAAGGGATAAGGGTGCTGCAAAACCCAACTTATCATTGAGCACTGTCTTGGCTTGTTCTGATGCAAGATTTGTAAAATATTTTGTGGATGTTCATCCAGAAGGCCCCTCCATGTGGGAAAGTTGTAAAAAACATGGTCTCCATAATCATAGTTAA

mRNA sequence

ATGATTACCCAGCTCAATCACTTGCTGTTCTTTGCGCCTGTTCTATCTGAAAAAACTATTACTCAAGCTCATAATCCGTCTTTTGTTAGCTTTAAATCTCGATTGTTTTGCAATTATGGATTTGATATTCATCCCAGAGCGTCCCTTATAGAATCACCGCAATCGCCCACTACAGGTGGCCGAGTGTCTCGGTATGCAAGAACAGAGGCCCAGAAAGTGTTATTTGACTATCTGCATTGTACTAGGAGCCTCGGCTTTGCAGATGCTGAACATATAAGCAAGAACTCGCCTCATTTTCTTCAAAATTTGATCATGAAACTTGACAGCGAAAAAGATGTGGCTAGGTCTCTCAGAAAGTATCTTAGATACAATCCCATCAATGAATTCGAGCCATTCTTTGAAAGCCTGGGACTACCCCCATCAGAGCTCCCCTTGTTTCTTCCACGACGCTTGATGTTTTTGAGTGATGATCATCTTATGCTTGAAAACTTCCATGTTTTGTGCAATTATGGTATCCCGCGTAGCAAGATGGGCAAAATGTATAAGGAAGCAAGAGAAATATTTGCTTATGATTATGGTCTATTAGCCTCAAAACTTAGAGCTTATGAGGATTTAGGTCTTGGAATGGGCACACTTATTAAGCTTGTTAGTTGCTGCCCTTCACTTTTGATTGGTCAGATCAAAATGGAGTTCATCAAAGTTCTGGAGAAGCTAAGTAAATTAGGTATTGAAGAAGATTGGATTGGAGGATATATATCTCATAAAAGTACCTATAATTGGAATAGACTGGTTGACACTCTGGACTTTCTAGCTAAAGTAGGTTATACAGAGATACAGATGCACGATCTGTTTAAATCAAATCCCTCATTGCTGCTTGAAGATTCTGGGAAGAAAGTGTATGTATTATTCGTTCGATTAGTCAAGTTGGGTCTTAAGATGGACGAAGCTTATTCGATTTATAAACAAAACCCTGTAATTTTGTCTGGGAAGTATGTTAAAAACATTCTGAGAGCAGTAGATTTTCTCTTTGATATCGGGTTGGGAACAGAGGACATTGCAGGTATAGTATCTCATCAAATTCTGTTACTTGGTTCGTGTACTCTGAAAGGGCCGAAAACTGTCTGTAAAGAACTAAAAGTCGGAAAAGAAGGTTTATGTCTGATCATTCGAGACGACCCGTCCAAGCTGTTTACCTTGGCTTCCAAATCAAAGCTAAAAAGCAGTGAACAGGCTTCTTGCCAAAACCCCGCCAAAGAGATGGAGAAGACTACATTCCTGCTGAAGTTGGGATACGTCGAAAACTCAGATGAGTTGGCAAAGGCGTCGAAACAGTTTCGGGGTCGGGGAGATCAATTACAGGAGAGATTTGATTGCCTGGTAAATGCTGGTTTGGACTGTCATGTGGTGACAAATATAGTCAGACATGCGCCCATGGTTCTAAACCAGAGCAAAGATGTAATCCAAGAGAAGATTGATTGCTTAAGAAACTGTTTAGGTTACCCCTTGCATACAATAGCGGCATTCCCAGTTTATTTATGTTACAACATGGAGAGAATAAACACAAGATTTTCAATGTATAGATGGTTAAGGGATAAGGGTGCTGCAAAACCCAACTTATCATTGAGCACTGTCTTGGCTTGTTCTGATGCAAGATTTGTAAAATATTTTGTGGATGTTCATCCAGAAGGCCCCTCCATGTGGGAAAGTTGTAAAAAACATGGTCTCCATAATCATAGTTAA

Coding sequence (CDS)

ATGATTACCCAGCTCAATCACTTGCTGTTCTTTGCGCCTGTTCTATCTGAAAAAACTATTACTCAAGCTCATAATCCGTCTTTTGTTAGCTTTAAATCTCGATTGTTTTGCAATTATGGATTTGATATTCATCCCAGAGCGTCCCTTATAGAATCACCGCAATCGCCCACTACAGGTGGCCGAGTGTCTCGGTATGCAAGAACAGAGGCCCAGAAAGTGTTATTTGACTATCTGCATTGTACTAGGAGCCTCGGCTTTGCAGATGCTGAACATATAAGCAAGAACTCGCCTCATTTTCTTCAAAATTTGATCATGAAACTTGACAGCGAAAAAGATGTGGCTAGGTCTCTCAGAAAGTATCTTAGATACAATCCCATCAATGAATTCGAGCCATTCTTTGAAAGCCTGGGACTACCCCCATCAGAGCTCCCCTTGTTTCTTCCACGACGCTTGATGTTTTTGAGTGATGATCATCTTATGCTTGAAAACTTCCATGTTTTGTGCAATTATGGTATCCCGCGTAGCAAGATGGGCAAAATGTATAAGGAAGCAAGAGAAATATTTGCTTATGATTATGGTCTATTAGCCTCAAAACTTAGAGCTTATGAGGATTTAGGTCTTGGAATGGGCACACTTATTAAGCTTGTTAGTTGCTGCCCTTCACTTTTGATTGGTCAGATCAAAATGGAGTTCATCAAAGTTCTGGAGAAGCTAAGTAAATTAGGTATTGAAGAAGATTGGATTGGAGGATATATATCTCATAAAAGTACCTATAATTGGAATAGACTGGTTGACACTCTGGACTTTCTAGCTAAAGTAGGTTATACAGAGATACAGATGCACGATCTGTTTAAATCAAATCCCTCATTGCTGCTTGAAGATTCTGGGAAGAAAGTGTATGTATTATTCGTTCGATTAGTCAAGTTGGGTCTTAAGATGGACGAAGCTTATTCGATTTATAAACAAAACCCTGTAATTTTGTCTGGGAAGTATGTTAAAAACATTCTGAGAGCAGTAGATTTTCTCTTTGATATCGGGTTGGGAACAGAGGACATTGCAGGTATAGTATCTCATCAAATTCTGTTACTTGGTTCGTGTACTCTGAAAGGGCCGAAAACTGTCTGTAAAGAACTAAAAGTCGGAAAAGAAGGTTTATGTCTGATCATTCGAGACGACCCGTCCAAGCTGTTTACCTTGGCTTCCAAATCAAAGCTAAAAAGCAGTGAACAGGCTTCTTGCCAAAACCCCGCCAAAGAGATGGAGAAGACTACATTCCTGCTGAAGTTGGGATACGTCGAAAACTCAGATGAGTTGGCAAAGGCGTCGAAACAGTTTCGGGGTCGGGGAGATCAATTACAGGAGAGATTTGATTGCCTGGTAAATGCTGGTTTGGACTGTCATGTGGTGACAAATATAGTCAGACATGCGCCCATGGTTCTAAACCAGAGCAAAGATGTAATCCAAGAGAAGATTGATTGCTTAAGAAACTGTTTAGGTTACCCCTTGCATACAATAGCGGCATTCCCAGTTTATTTATGTTACAACATGGAGAGAATAAACACAAGATTTTCAATGTATAGATGGTTAAGGGATAAGGGTGCTGCAAAACCCAACTTATCATTGAGCACTGTCTTGGCTTGTTCTGATGCAAGATTTGTAAAATATTTTGTGGATGTTCATCCAGAAGGCCCCTCCATGTGGGAAAGTTGTAAAAAACATGGTCTCCATAATCATAGTTAA

Protein sequence

MITQLNHLLFFAPVLSEKTITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTTGGRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSLRKYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMGKMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEKLSKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKKVYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVSHQILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPAKEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAPMVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKPNLSLSTVLACSDARFVKYFVDVHPEGPSMWESCKKHGLHNHS
BLAST of Cp4.1LG20g05980 vs. Swiss-Prot
Match: MTEFH_ARATH (Transcription termination factor MTEF18, mitochondrial OS=Arabidopsis thaliana GN=MTERF18 PE=1 SV=1)

HSP 1 Score: 233.8 bits (595), Expect = 4.7e-60
Identity = 151/515 (29.32%), Postives = 262/515 (50.87%), Query Frame = 1

Query: 69  EAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLD-SEKDVARSLRKYLRYNPIN 128
           +AQ+ + DYLH TRSL +  AE I+ N+   ++NLI+KLD S    ++SLRK+L Y+PIN
Sbjct: 36  KAQQAITDYLHTTRSLSYTHAEQIASNASVSIRNLILKLDFSVPTFSKSLRKHLSYHPIN 95

Query: 129 EFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMGKMYKEAREI 188
           EFE FFES+G+  SE+  FLP +  F S+D  +L+    L  +G P +K+GK+YKE R +
Sbjct: 96  EFEFFFESIGIDYSEVSEFLPEKKFFFSEDRTVLDAAFALSGFGFPWNKLGKLYKEERLV 155

Query: 189 FAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLI--GQIKMEFIKVLEKLSKLGIEE 248
           F    G + S+L  ++D+G     +I      P  L   G++  E   +  KL +L  E 
Sbjct: 156 FVQRPGEIESRLLKFKDIGFSTVAVIGTCLAIPRTLCGGGELGSEIRCLFVKLKRLFDEF 215

Query: 249 DWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKKVYVLFVR 308
           D    ++  ++  +W  +   +     +G    +M +L   N SL LE S + +      
Sbjct: 216 D--SHHLFEENVDSWLAVSRKIRIFYDLGCENEEMWELMCRNKSLFLEYSEEALMNKAGY 275

Query: 309 LVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVSHQILLLGS 368
             + G+  ++A  +  +NP I++    K ++     L   GL  +++  +      + G 
Sbjct: 276 FCRFGVSKEDAALLILRNPAIMNFDLEKPVISVTGMLKHFGLRQDEVDAVAQKYPYVFGR 335

Query: 369 CTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQ------------ASC 428
             LK    V + + +  E +  I+++    L  LAS + +   E              + 
Sbjct: 336 NQLKNLPYVLRAIDL-HERIFDILKNGNHHL--LASYTLMDPDEDLEREYQEGLEELQNS 395

Query: 429 QNPAKEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIV 488
           +     ++K  FL ++G+ EN   + K  +   G   +L +RF  L+N+G+    +  ++
Sbjct: 396 RTKRHNIQKLDFLHEIGFGENGITM-KVLQHVHGTAVELHDRFQILLNSGIIFSKICMLI 455

Query: 489 RHAPMVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNME-RINTRFSMYRWLRDK 548
           R AP +LNQ    IQ+K+  L   +G  L  +  FP YLC+++E RI+ RF  ++WL +K
Sbjct: 456 RSAPKILNQKPHSIQDKLRFLCGEMGDSLDYLEVFPAYLCFDLENRISPRFRFHKWLVEK 515

Query: 549 GAAKPNLSLSTVLACSDARFVKYFVDVHPEGPSMW 568
           G ++ + S+++++A S+  F+     +HP  P  W
Sbjct: 516 GFSEKSYSIASIVATSEKAFIARLYGIHPAIPKHW 544

BLAST of Cp4.1LG20g05980 vs. Swiss-Prot
Match: MTEFE_ARATH (Transcription termination factor MTERF15, mitochondrial OS=Arabidopsis thaliana GN=MTERF15 PE=2 SV=1)

HSP 1 Score: 55.5 bits (132), Expect = 2.3e-06
Identity = 33/115 (28.70%), Postives = 61/115 (53.04%), Query Frame = 1

Query: 449 GDQLQERFDCLVNAGLDCHVVTNIVRHAPMVLNQSKDVIQEKIDCLRNCLGYPLHTIAAF 508
           G +++ R DCL   GL       +V   P V+    + I++KI+ L N +G+ ++ +A  
Sbjct: 282 GFEVKLRVDCLCKYGLIRRDAFKVVWKEPRVILYEIEDIEKKIEFLTNRMGFHINCLADV 341

Query: 509 PVYLCYNMER-INTRFSMYRWLRDKGAAKPNLSLSTVLACSDARFVKYFVDVHPE 563
           P YL  N+++ I  R+++  +L+ KG    ++ L  ++  S  RF   +V  +PE
Sbjct: 342 PEYLGVNLQKQIVPRYNVIDYLKLKGGLGCDIGLKGLIKPSMKRFYNLYVMPYPE 396

BLAST of Cp4.1LG20g05980 vs. Swiss-Prot
Match: MTEF8_ARATH (Transcription termination factor MTERF8, chloroplastic OS=Arabidopsis thaliana GN=MTERF8 PE=1 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 3.0e-06
Identity = 40/135 (29.63%), Postives = 65/135 (48.15%), Query Frame = 1

Query: 422 KTTFLLKLGYVENSDELAKASKQF-RGRGDQLQERFDCLVNAGLDCHVVTNIVRHAPMVL 481
           K  FL+K+GY   + ELA A     R   D +Q      ++ GL    +  +    P VL
Sbjct: 361 KLGFLVKIGYKHRTKELAFAMGAVTRTSSDNMQRVIGLYLSYGLSFEDILAMSTKHPQVL 420

Query: 482 NQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNME-RINTRFSMYRWLRDKGAAKPNL 541
             +   ++EK++ L   +G  +  + AFP +L Y ++ RI  R+     L+ +G    N+
Sbjct: 421 QYNYTSLEEKLEYLIEYMGREVEELLAFPAFLGYKLDSRIKHRYE--EKLKSRG---ENM 480

Query: 542 SLSTVLACSDARFVK 555
           SL+ +L  S  RF K
Sbjct: 481 SLNKLLTVSAERFSK 490

BLAST of Cp4.1LG20g05980 vs. TrEMBL
Match: V4U7M9_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014682mg PE=4 SV=1)

HSP 1 Score: 779.2 bits (2011), Expect = 3.4e-222
Identity = 381/575 (66.26%), Postives = 461/575 (80.17%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKT-ITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTT- 60
           MITQ+NH L F PV +EKT I Q  N  F+  + R FC+      P+ SL ES Q P + 
Sbjct: 16  MITQINHFLVFFPVSNEKTSIMQKQNAFFIPVRVRSFCSSRLTHQPKVSLAESTQPPASL 75

Query: 61  -GGRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSL 120
              RVSR ARTEAQ+VLFDYLH TRSLG+ DAEHISKNSP F+ NL+ K+DS KDV RSL
Sbjct: 76  VASRVSRLARTEAQEVLFDYLHSTRSLGYMDAEHISKNSPDFVLNLLSKIDSGKDVTRSL 135

Query: 121 RKYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKM 180
            ++LRYNPINEFEPFFESLGL  SEL   LPR LMFLSDD ++L+NFHVLC+YGIPRSKM
Sbjct: 136 TRFLRYNPINEFEPFFESLGLSQSELSPLLPRHLMFLSDDEVLLDNFHVLCDYGIPRSKM 195

Query: 181 GKMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEK 240
           GKMY EA EIF +D G+LASKL AYE+LGL   T+IKLVSCCPSLLIG +   F+KVLEK
Sbjct: 196 GKMYVEATEIFRHDRGVLASKLWAYENLGLSKNTVIKLVSCCPSLLIGGVDSRFVKVLEK 255

Query: 241 LSKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGK 300
           L +LG + DWIG Y+S K +YNW+++ +TLDFL K+GY E+Q+ +LFK+NP+L+ E SG+
Sbjct: 256 LKELGFKNDWIGRYLSGKGSYNWDQVSETLDFLYKIGYNEVQLLNLFKTNPALVFEGSGQ 315

Query: 301 KVYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVS 360
           KVYVLF RL+KLGLKM+E YS++ QNP ILS K+VKN+L+AV FL +IG+G +DI+ +V 
Sbjct: 316 KVYVLFGRLLKLGLKMNEVYSLFSQNPQILSSKFVKNLLQAVGFLIEIGMGMKDISNMVL 375

Query: 361 HQILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPA 420
               L+GSC+LKGPKTVC +LKVG+E LC II+DDP KLF LASK+++K  EQ  CQNP+
Sbjct: 376 MHAELMGSCSLKGPKTVCSKLKVGRESLCQIIKDDPLKLFHLASKTEVKIDEQVDCQNPS 435

Query: 421 KEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAP 480
           K++EKT FLL+LGYVENS+E+ KA KQFRGRGDQLQERFDCLV AGLD +VV NIV+ AP
Sbjct: 436 KDVEKTEFLLRLGYVENSEEVTKALKQFRGRGDQLQERFDCLVQAGLDSNVVRNIVKRAP 495

Query: 481 MVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKP 540
           MVLNQSKDV+++KID L+N L YPL ++ AFP YLCY+M RIN R  MY WLR++G AKP
Sbjct: 496 MVLNQSKDVLEKKIDYLKNYLCYPLESVVAFPAYLCYDMGRINHRCKMYVWLRERGVAKP 555

Query: 541 NLSLSTVLACSDARFVKYFVDVHPEGPSMWESCKK 573
            LSLST+LACSDA+F KYFVDVHPEGP+MWES KK
Sbjct: 556 TLSLSTILACSDAKFEKYFVDVHPEGPAMWESLKK 590

BLAST of Cp4.1LG20g05980 vs. TrEMBL
Match: A0A067GCM4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g007621mg PE=4 SV=1)

HSP 1 Score: 777.3 bits (2006), Expect = 1.3e-221
Identity = 380/575 (66.09%), Postives = 460/575 (80.00%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKT-ITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTT- 60
           MITQ+NH L F PV +EKT I Q  N  F+  + R FC+      P+ SL ES Q P + 
Sbjct: 16  MITQINHFLVFFPVSNEKTSIMQKQNAFFIPVRVRSFCSSRLTHQPKVSLAESTQPPASL 75

Query: 61  -GGRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSL 120
              RVSR ARTEAQ+VLFDYLH TRSLG+ DAEHISKNSP F+ NL+ K+DS KDV RSL
Sbjct: 76  VASRVSRLARTEAQEVLFDYLHSTRSLGYMDAEHISKNSPDFVLNLLSKIDSGKDVTRSL 135

Query: 121 RKYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKM 180
            ++LRYNPINEFEPFFESLGL  SEL   LPR LMFLSDD ++L+NFHVLC+YGIPRSKM
Sbjct: 136 MRFLRYNPINEFEPFFESLGLSQSELSPLLPRHLMFLSDDEVLLDNFHVLCDYGIPRSKM 195

Query: 181 GKMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEK 240
           GKMY EA EIF +D G+LASKL AYE+LGL   T+IKLVSCCPSLLIG +   F+KVLEK
Sbjct: 196 GKMYVEATEIFRHDRGVLASKLWAYENLGLSKNTVIKLVSCCPSLLIGGVDSRFVKVLEK 255

Query: 241 LSKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGK 300
           L +LG + DWIG Y+  K +YNW+++ +TLDFL K+GY E+Q+ +LFK+NP+L+ E SG+
Sbjct: 256 LKELGFKNDWIGRYLPGKGSYNWDQVSETLDFLYKIGYNEVQLLNLFKTNPALVFEGSGQ 315

Query: 301 KVYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVS 360
           KVYVLF RL+KLGLKM+E YS++ QNP ILS K+VKN+L+AV FL +IG+G +DI+ +V 
Sbjct: 316 KVYVLFGRLLKLGLKMNEVYSLFSQNPQILSSKFVKNLLQAVGFLIEIGMGMKDISNMVL 375

Query: 361 HQILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPA 420
               L+GSC+LKGPKTVC +LKVG+E LC II+DDP KLF LASK+++K  EQ  CQNP+
Sbjct: 376 MHAELMGSCSLKGPKTVCSKLKVGRESLCQIIKDDPLKLFHLASKTEVKIDEQVDCQNPS 435

Query: 421 KEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAP 480
           K++EKT FLL+LGYVENS+E+ KA KQFRGRGDQLQERFDCLV AGLD +VV NIV+ AP
Sbjct: 436 KDVEKTEFLLRLGYVENSEEVTKALKQFRGRGDQLQERFDCLVQAGLDSNVVRNIVKRAP 495

Query: 481 MVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKP 540
           MVLNQSKDV+++KID L+N L YPL ++ AFP YLCY+M RIN R  MY WLR++G AKP
Sbjct: 496 MVLNQSKDVLEKKIDYLKNYLCYPLESVVAFPAYLCYDMGRINHRCKMYVWLRERGVAKP 555

Query: 541 NLSLSTVLACSDARFVKYFVDVHPEGPSMWESCKK 573
            LSLST+LACSDA+F KYFVDVHPEGP+MWES KK
Sbjct: 556 TLSLSTILACSDAKFEKYFVDVHPEGPAMWESLKK 590

BLAST of Cp4.1LG20g05980 vs. TrEMBL
Match: M5WPG4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003449mg PE=4 SV=1)

HSP 1 Score: 775.4 bits (2001), Expect = 4.9e-221
Identity = 381/573 (66.49%), Postives = 459/573 (80.10%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKTITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTT-- 60
           MI+QLNHL+ F+PV    T  Q  NPS VS K + FC+     H + S+IES QSP +  
Sbjct: 1   MISQLNHLVLFSPVFERTTFVQ--NPSSVSLKFQCFCSSRLTHHSKVSVIESAQSPNSPF 60

Query: 61  GGRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSLR 120
             RVSR ARTEAQ  LFDYLHCTRS  F DAEHISKNSP FLQNL+  +DSEKDVARSL 
Sbjct: 61  ANRVSRNARTEAQATLFDYLHCTRSFSFTDAEHISKNSPIFLQNLLSNIDSEKDVARSLT 120

Query: 121 KYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMG 180
           ++LRYNPINEFEPFFESLGL PSEL  FLPR LM+LSDD ++ +N H LCNYGIPRS +G
Sbjct: 121 RFLRYNPINEFEPFFESLGLSPSELLSFLPRHLMYLSDDCVLTDNVHALCNYGIPRSNIG 180

Query: 181 KMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEKL 240
           KMYKEA+EIF YDYG+LA KL+AYE+LG+   T+IKLVSCCP LL+G +  +F++V EKL
Sbjct: 181 KMYKEAKEIFGYDYGVLALKLQAYENLGISKATVIKLVSCCPLLLVGGVNSDFVRVHEKL 240

Query: 241 SKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKK 300
            +LG+  DWIGGY S  STYNW+R+ DT+DFL KVGYTE QM  LF+ NP+LLLE SGK 
Sbjct: 241 KRLGLGMDWIGGYASGNSTYNWDRMFDTMDFLDKVGYTEEQMCVLFELNPALLLEGSGKN 300

Query: 301 VYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVSH 360
           VYVLF RL+KLGL+M+E YS++ QNP +LS K +KN+L AVDFLF+IG+GTE++A IV++
Sbjct: 301 VYVLFGRLLKLGLEMNEVYSLFMQNPQVLSVKCMKNLLLAVDFLFEIGMGTEEMADIVAN 360

Query: 361 QILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPAK 420
            +  L S + K PKTVCK+LKV ++GL  +I++DP K+ TLASKSK K+  Q     P+K
Sbjct: 361 DVEFLSSSSFKRPKTVCKDLKVKRDGLLQMIKEDPHKVLTLASKSKGKN--QLISPLPSK 420

Query: 421 EMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAPM 480
            MEKT+FL++LGY+ENSDE+ KA K+FRGRGDQLQERFDCLV AGLDC+VV+NIV+ AP 
Sbjct: 421 HMEKTSFLVRLGYIENSDEMMKALKKFRGRGDQLQERFDCLVQAGLDCNVVSNIVKQAPH 480

Query: 481 VLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKPN 540
           VLNQSKDVI+ KI CL NCL YPL ++ AFP YLCY+M+RIN RFSMY WLR+KGAAKP 
Sbjct: 481 VLNQSKDVIEMKISCLTNCLRYPLDSVVAFPAYLCYDMDRINLRFSMYAWLREKGAAKPM 540

Query: 541 LSLSTVLACSDARFVKYFVDVHPEGPSMWESCK 572
           LSLST+LACSDARFVKY+VDVHPEGP+MWES K
Sbjct: 541 LSLSTLLACSDARFVKYYVDVHPEGPAMWESFK 569

BLAST of Cp4.1LG20g05980 vs. TrEMBL
Match: A0A061GDK1_THECC (Mitochondrial transcription termination factor family protein, putative OS=Theobroma cacao GN=TCM_029329 PE=4 SV=1)

HSP 1 Score: 770.4 bits (1988), Expect = 1.6e-219
Identity = 381/579 (65.80%), Postives = 467/579 (80.66%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKTITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTTG- 60
           MIT L+  +  +P++ EK+    HN   VS + R F +      P+ASL +S +S ++G 
Sbjct: 16  MITHLDKFVVLSPIVYEKSDV-VHNLCSVSLRVRYFRSSRLVFRPKASLADSIRSSSSGF 75

Query: 61  -GRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSLR 120
             R+SR A+TEAQ VLFDYLH TRS  F DAEHISKNS HFLQNL+ K+D EKDVA+SL 
Sbjct: 76  ASRISRAAKTEAQVVLFDYLHSTRSFRFMDAEHISKNSHHFLQNLLSKIDPEKDVAKSLT 135

Query: 121 KYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMG 180
           K+LR+NP+NEFEPFFESLGL PSE+   +P+RLMFL DD +ML+NFHVLC+YGIPRSKMG
Sbjct: 136 KFLRFNPVNEFEPFFESLGLSPSEVSTLVPQRLMFLRDDSVMLDNFHVLCDYGIPRSKMG 195

Query: 181 KMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEKL 240
           KMYK AREIF YDYG+LA KL+AYE+LGL   T+IKLVSCCPSLL+G +  EF   LE+L
Sbjct: 196 KMYKVAREIFGYDYGVLALKLQAYENLGLSKPTVIKLVSCCPSLLVGGVDAEFAGALERL 255

Query: 241 SKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKK 300
             LGI+ D IGGY+S K  Y+W R++D L+FL +VGY E Q+ +LFK+NP+LL E SGKK
Sbjct: 256 KVLGIKNDDIGGYLSGKGMYDWGRMLDMLNFLDRVGYNEEQLGNLFKTNPALLFEGSGKK 315

Query: 301 VYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVSH 360
           VYVLF RL+KLGL+M+E +S++ QNP ILS K  KN+ +A+DFLFDI + TEDIA IVS 
Sbjct: 316 VYVLFGRLIKLGLRMNEVHSLFMQNPHILSVKCTKNLFKALDFLFDIAMDTEDIAHIVSR 375

Query: 361 QILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPAK 420
            + L+GSC+LKGPKTVC+EL V KE LCLII++DP K F+LASKSK+ SS Q + ++ +K
Sbjct: 376 HVELMGSCSLKGPKTVCRELNVEKEELCLIIKEDPLKWFSLASKSKVLSSGQVASKDTSK 435

Query: 421 EMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAPM 480
            +EKTTFLL+LGY+ENSDE+ KA KQFRGRGDQLQERFDCLV AGLDC+VV N++RHAPM
Sbjct: 436 YLEKTTFLLRLGYLENSDEMLKALKQFRGRGDQLQERFDCLVCAGLDCNVVKNLIRHAPM 495

Query: 481 VLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKPN 540
           VLNQSKDVI++KIDCL+N LGYPL ++ AFP YLCY+MERI+ RFSMY WLR++GAAKP 
Sbjct: 496 VLNQSKDVIEKKIDCLKNWLGYPLESVVAFPAYLCYDMERISRRFSMYVWLRERGAAKPM 555

Query: 541 LSLSTVLACSDARFVKYFVDVHPEGPSMWESCKKHGLHN 578
           LSLSTVLACSDARFVKYFVDVHPEGP+ WE+ KK  LH+
Sbjct: 556 LSLSTVLACSDARFVKYFVDVHPEGPAKWETLKK-SLHS 592

BLAST of Cp4.1LG20g05980 vs. TrEMBL
Match: W9QRR9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006241 PE=4 SV=1)

HSP 1 Score: 729.9 bits (1883), Expect = 2.3e-207
Identity = 361/571 (63.22%), Postives = 443/571 (77.58%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKTITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTTGG 60
           MI+QLN +  F+PVL E+  +   NPSF+S +        F     +SL  S        
Sbjct: 1   MISQLNQIPLFSPVLYERA-SYIQNPSFISLR--------FLSVRSSSLAHSADV----S 60

Query: 61  RVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSLRKY 120
           RVSR  RTEAQ+ LFDYLHCTR+  F DAEHISKN P+F+QNL+ ++D+EKD+ R L ++
Sbjct: 61  RVSRVTRTEAQEALFDYLHCTRNFNFMDAEHISKNCPYFVQNLLSEIDTEKDIPRELTRF 120

Query: 121 LRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMGKM 180
             Y+PINEFEPFFESLGL PSELPL LPR  MFLSD+  ML+NFHVLC+YGIP SK+G+M
Sbjct: 121 FHYHPINEFEPFFESLGLRPSELPLLLPRDSMFLSDNSSMLQNFHVLCDYGIPHSKIGRM 180

Query: 181 YKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEKLSK 240
           Y EA+EIF YD G+++ KLRAYE LGL   T+IKLVSC P LL+G +  EF+KVL+KL +
Sbjct: 181 YLEAKEIFGYDKGVMSLKLRAYEKLGLSRPTVIKLVSCYPLLLVGGVNSEFVKVLQKLRE 240

Query: 241 LGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKKVY 300
           LGIE DW  GYI+  +T NW R+ DT+DFL  VG+ E QM  L K++PSLLLE SGK+VY
Sbjct: 241 LGIENDWFRGYITSSNTCNWKRMTDTMDFLQDVGFREEQMRSLLKTSPSLLLEGSGKRVY 300

Query: 301 VLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVSHQI 360
            LF RL+KLGL+M+E   ++KQNP ILS K+ +N+L+AVDFLF IG+  EDIA IVS  I
Sbjct: 301 ALFGRLLKLGLEMNEICFMFKQNPKILSRKFSQNLLQAVDFLFGIGMPIEDIADIVSKHI 360

Query: 361 LLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPAKEM 420
             LGS TLKGPKTVCKELKV ++ LC II++DP  +  LASK K KSSEQ SC +P+K +
Sbjct: 361 EFLGSSTLKGPKTVCKELKVRRDHLCQIIKEDPLGVLWLASKLKNKSSEQISCPSPSKHL 420

Query: 421 EKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAPMVL 480
           EK++FL++LGY ENSDE+ KA K+FRGRGDQLQERFDCLV AGLDC+VV +I++ APMVL
Sbjct: 421 EKSSFLVRLGYAENSDEMTKALKKFRGRGDQLQERFDCLVQAGLDCNVVADIIKRAPMVL 480

Query: 481 NQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKPNLS 540
           NQSKDVI++KIDCL N LGYPL ++ AFP YLCY+MERIN RFSMY WLR+KGAAKP L 
Sbjct: 481 NQSKDVIEKKIDCLINYLGYPLESVVAFPTYLCYDMERINLRFSMYAWLREKGAAKPMLK 540

Query: 541 LSTVLACSDARFVKYFVDVHPEGPSMWESCK 572
           LST+LACSD+RF+KYFVDVHPEGP+MWE+ K
Sbjct: 541 LSTLLACSDSRFLKYFVDVHPEGPAMWETLK 558

BLAST of Cp4.1LG20g05980 vs. TAIR10
Match: AT4G19650.1 (AT4G19650.1 Mitochondrial transcription termination factor family protein)

HSP 1 Score: 577.8 bits (1488), Expect = 7.5e-165
Identity = 280/481 (58.21%), Postives = 363/481 (75.47%), Query Frame = 1

Query: 90  EHISKNSPHFLQNLIMKLD-SEKDVARSLRKYLRYNPINEFEPFFESLGLPPSELPLFLP 149
           EHISKNSP F+  L+ K+D ++KDV++ L K+LRYNPINEFEPFFESLGL P E   FLP
Sbjct: 90  EHISKNSPCFMSTLLSKIDDNQKDVSKGLTKFLRYNPINEFEPFFESLGLCPYEFETFLP 149

Query: 150 RRLMFLSDDHLMLENFHVLCNYGIPRSKMGKMYKEAREIFAYDYGLLASKLRAYEDLGLG 209
           R+LMFLSDD +M ENFH LCNYGIPR K+G+MYKEAREIF Y+ G+LA KLR YE+LGL 
Sbjct: 150 RKLMFLSDDGIMFENFHALCNYGIPRGKIGRMYKEAREIFRYESGMLAMKLRGYENLGLS 209

Query: 210 MGTLIKLVSCCPSLLIGQIKMEFIKVLEKLSKLGIEEDWIGGYISHKSTYNWNRLVDTLD 269
             T+IKLV+ CP LL+G I  EF  V++KL  L +  DW+G Y+S + TY+W R+++T++
Sbjct: 210 KATVIKLVTSCPLLLVGGIDAEFSSVVDKLKGLQVGCDWLGRYLSDRKTYSWRRILETIE 269

Query: 270 FLAKVGYTEIQMHDLFKSNPSLLLEDSGKKVYVLFVRLVKLGLKMDEAYSIYKQNPVILS 329
           FL KVG  E ++  L K+ P+L++E SGKK YVLF RL K GL+++E Y ++  NP +LS
Sbjct: 270 FLDKVGCKEEKLSSLLKTYPALVIEGSGKKFYVLFGRLFKAGLQVNEIYRLFIDNPEMLS 329

Query: 330 GKYVKNILRAVDFLFDIGLGTEDIAGIVSHQILLLGSCTLKGPKTVCKELKVGKEGLCLI 389
            K VKNI + +DFL  I + T+ I  I+   + L+GSC+L  P+T C  L V ++ LC I
Sbjct: 330 DKCVKNIQKTLDFLIAIRMETQFITKILLSHMELIGSCSLPAPRTACLSLNVKQDELCKI 389

Query: 390 IRDDPSKLFTLASKSKLKSSEQASCQNPAKEMEKTTFLLKLGYVENSDELAKASKQFRGR 449
           ++ +P +LF   S +K + S+  S ++  K +EKT FLL+LGYVENSDE+ KA KQFRGR
Sbjct: 390 LKKEPLRLFCFVSTTKKRKSKPLS-EDSRKYLEKTEFLLRLGYVENSDEMVKALKQFRGR 449

Query: 450 GDQLQERFDCLVNAGLDCHVVTNIVRHAPMVLNQSKDVIQEKIDCLRNCLGYPLHTIAAF 509
           GDQLQERFDCLV AGL+ +VVT I+RHAPM+LN SKDVI++KI  L   LGYP+ ++  F
Sbjct: 450 GDQLQERFDCLVKAGLNYNVVTEIIRHAPMILNLSKDVIEKKIHSLTELLGYPIESLVRF 509

Query: 510 PVYLCYNMERINTRFSMYRWLRDKGAAKPNLSLSTVLACSDARFVKYFVDVHPEGPSMWE 569
           P YLCY+M+RI+ RFSMY WLR++ AAKP LS ST+L C DARFVKYFV+VHPEGP++WE
Sbjct: 510 PAYLCYDMQRIHHRFSMYLWLRERDAAKPMLSPSTILTCGDARFVKYFVNVHPEGPAIWE 569

BLAST of Cp4.1LG20g05980 vs. TAIR10
Match: AT5G45113.1 (AT5G45113.1 mitochondrial transcription termination factor-related / mTERF-related)

HSP 1 Score: 399.1 bits (1024), Expect = 4.8e-111
Identity = 197/413 (47.70%), Postives = 281/413 (68.04%), Query Frame = 1

Query: 160 MLENFHVLCNYGIPRSKMGKMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCC 219
           M ENFHVLC YGIPR K+G++YKEAREIF Y+ G+LASKL  YE L L    +IKLV+CC
Sbjct: 1   MFENFHVLCYYGIPRDKIGRLYKEAREIFVYENGVLASKLEPYEILVLRKAIVIKLVTCC 60

Query: 220 PSLLIGQIKMEFIKVLEKLSKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQ 279
           P LL+G I  EF+ V+ KL  L +  DW+  Y+S + TYNW R+++T++ L KVG+ E +
Sbjct: 61  PLLLVGGIDCEFVSVVNKLKGLNLGCDWLARYLSVRKTYNWRRILETMELLEKVGFKEKK 120

Query: 280 MHDLFKSNPSLLLEDSGKKVYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAV 339
           + +L K+ P L+ E SG K Y++F +  K+GL+M+E   +   N  +L  K VK IL A+
Sbjct: 121 LSNLLKAYPDLVGETSGNKAYIMFEKFHKVGLQMNEIDKLLIDNSEMLLEKSVKRILEAL 180

Query: 340 DFLFDIGLGTEDIAGIVSHQILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTL 399
            FL  I +  + +   +   +  + S +L  P+ V   LK+ ++ LC II+++P +LF++
Sbjct: 181 KFLKCIRIEKQFVVRFLQCHMKHICSSSLLVPRAVWNRLKIRRDELCQIIKEEPLRLFSI 240

Query: 400 ASKSKLKSSEQASCQNPAKEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCL 459
           ASK+     E  S    ++  EKTTFLLKLGYVENSDE+ +A K+F+GRGD+LQERFDC 
Sbjct: 241 ASKTNKGRIELDSLD--SRNAEKTTFLLKLGYVENSDEMVRALKKFQGRGDELQERFDCF 300

Query: 460 VNAGLDCHVVTNIVRHAPMVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERI 519
           V AGLD +VV+ +V+ AP +LN+ KD+I++KI  L + L YP+ ++   P YLCY+M+RI
Sbjct: 301 VKAGLDYNVVSQLVKRAPHILNRPKDIIEKKIIMLIDYLVYPIESVIESPTYLCYSMKRI 360

Query: 520 NTRFSMYRWLRDKGAAKPNLSLSTVLACSDARFVKYFVDVHPEGPSMWESCKK 573
           + RF+MY WLR++ A  P L+L TV+  S+   V YFV+ HPEGP+ WE+ KK
Sbjct: 361 HQRFTMYIWLRERDAVIPRLTLGTVVGISNTLIVPYFVNTHPEGPATWENIKK 411

BLAST of Cp4.1LG20g05980 vs. TAIR10
Match: AT5G06810.1 (AT5G06810.1 Mitochondrial transcription termination factor family protein)

HSP 1 Score: 396.0 bits (1016), Expect = 4.1e-110
Identity = 217/550 (39.45%), Postives = 319/550 (58.00%), Query Frame = 1

Query: 32   KSRLFCNYGFDIHPRASLIESPQSPTTGGRVSRYARTEAQKVLFDYLHCTRSLGFADAEH 91
            K +L  N  F    RA +         G R     R  AQ  +FDY + TR L F  AE 
Sbjct: 588  KPQLSRNPRFFATQRALVDAEVSGEKWGLRTRNEIRKVAQVAMFDYFYQTRGLQFLVAES 647

Query: 92   ISKNSPHFLQNLIMKL-------DSEKDVARSLRKYLRYNPINEFEPFFESLGLPPSELP 151
            +SKN+P F  NL+ KL       D + D+ +++ ++L ++P+NEFEPF ESLGL PSE  
Sbjct: 648  MSKNAPVFNDNLLKKLNGCDVDVDDDDDIVKAITRFLWFHPVNEFEPFLESLGLKPSEFS 707

Query: 152  LFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMGKMYKEAREIFAYDYGLLASKLRAYED 211
              +P   MFL++D  +LEN+HV  NYGI R KMGK++KEARE+F Y+ G+LASK+++YED
Sbjct: 708  HLIPCDKMFLNEDAFLLENYHVFWNYGIGREKMGKIFKEAREVFGYETGVLASKIKSYED 767

Query: 212  LGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEKLSKLGIEEDWIGGYISHKSTYNWNRLV 271
            LG     L KL+ C PS+LIG + +   KV+E L  +G   DW+   +S + +Y+W+ + 
Sbjct: 768  LGFSKLFLSKLIVCSPSILIGDMNVGLAKVMEMLKAIGFGVDWVTENLSEEVSYDWSSMH 827

Query: 272  DTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKKVYVLFVRLVKLGLKMDEAYSIYKQNP 331
              L FL  +   E ++ +L +  P L+ EDSG+   +L     KLG    E  S++++ P
Sbjct: 828  RCLSFLRDLYVDENELCELIRKMPRLIFEDSGEWTLILAGFEAKLGSSRSELSSLFQKFP 887

Query: 332  VILS-GKYVKNILRAVDFLFDIGLGTEDIAGIVSHQILLLGSCTLKGPKTVCKELKVGKE 391
               S GK+V N+     FL DI +  ++I  I     L +G   LK   T+   LK GK 
Sbjct: 888  QCQSLGKFVLNLRHCFLFLKDIEMDDDEIGKIFRLHSLWIGVSRLKQTSTLLINLKGGKG 947

Query: 392  GLCLIIRDDPSKLFTLASKSKLKSSEQASCQ-NPAKEMEKTTFLLKLGYVENSDELAKAS 451
             LC +I+++P ++       +++       + N   +  KT FLL LGY ENS+E+ +A 
Sbjct: 948  RLCQVIQENPEEMKKWIMGLRVQPLPATGYKVNTKSKTMKTQFLLDLGYKENSEEMERAL 1007

Query: 452  KQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAPMVLNQSKDVIQEKIDCLRNCLGYPL 511
            K FRG+G +L+ERF+ LV+ GL    V ++V+  P +L Q+ D+++ K++ L   LGYPL
Sbjct: 1008 KNFRGKGSELRERFNVLVSFGLTEKDVKDMVKACPSILTQACDILESKVNYLVKELGYPL 1067

Query: 512  HTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKPNLSLSTVLACSDARFVKYFVDVHPE 571
             T+  FP  L Y ++R+  RFSM+ WL+D+G A P L +ST+L CSD  F   FV+ HP+
Sbjct: 1068 STLVTFPTCLKYTLQRMKLRFSMFSWLQDRGKADPKLQVSTILVCSDKFFATRFVNRHPD 1127

Query: 572  GPSMWESCKK 573
            GP   E  KK
Sbjct: 1128 GPKHLEDLKK 1137

BLAST of Cp4.1LG20g05980 vs. TAIR10
Match: AT3G60400.1 (AT3G60400.1 Mitochondrial transcription termination factor family protein)

HSP 1 Score: 233.8 bits (595), Expect = 2.7e-61
Identity = 151/515 (29.32%), Postives = 262/515 (50.87%), Query Frame = 1

Query: 69  EAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLD-SEKDVARSLRKYLRYNPIN 128
           +AQ+ + DYLH TRSL +  AE I+ N+   ++NLI+KLD S    ++SLRK+L Y+PIN
Sbjct: 36  KAQQAITDYLHTTRSLSYTHAEQIASNASVSIRNLILKLDFSVPTFSKSLRKHLSYHPIN 95

Query: 129 EFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMGKMYKEAREI 188
           EFE FFES+G+  SE+  FLP +  F S+D  +L+    L  +G P +K+GK+YKE R +
Sbjct: 96  EFEFFFESIGIDYSEVSEFLPEKKFFFSEDRTVLDAAFALSGFGFPWNKLGKLYKEERLV 155

Query: 189 FAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLI--GQIKMEFIKVLEKLSKLGIEE 248
           F    G + S+L  ++D+G     +I      P  L   G++  E   +  KL +L  E 
Sbjct: 156 FVQRPGEIESRLLKFKDIGFSTVAVIGTCLAIPRTLCGGGELGSEIRCLFVKLKRLFDEF 215

Query: 249 DWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKKVYVLFVR 308
           D    ++  ++  +W  +   +     +G    +M +L   N SL LE S + +      
Sbjct: 216 D--SHHLFEENVDSWLAVSRKIRIFYDLGCENEEMWELMCRNKSLFLEYSEEALMNKAGY 275

Query: 309 LVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVSHQILLLGS 368
             + G+  ++A  +  +NP I++    K ++     L   GL  +++  +      + G 
Sbjct: 276 FCRFGVSKEDAALLILRNPAIMNFDLEKPVISVTGMLKHFGLRQDEVDAVAQKYPYVFGR 335

Query: 369 CTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQ------------ASC 428
             LK    V + + +  E +  I+++    L  LAS + +   E              + 
Sbjct: 336 NQLKNLPYVLRAIDL-HERIFDILKNGNHHL--LASYTLMDPDEDLEREYQEGLEELQNS 395

Query: 429 QNPAKEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIV 488
           +     ++K  FL ++G+ EN   + K  +   G   +L +RF  L+N+G+    +  ++
Sbjct: 396 RTKRHNIQKLDFLHEIGFGENGITM-KVLQHVHGTAVELHDRFQILLNSGIIFSKICMLI 455

Query: 489 RHAPMVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNME-RINTRFSMYRWLRDK 548
           R AP +LNQ    IQ+K+  L   +G  L  +  FP YLC+++E RI+ RF  ++WL +K
Sbjct: 456 RSAPKILNQKPHSIQDKLRFLCGEMGDSLDYLEVFPAYLCFDLENRISPRFRFHKWLVEK 515

Query: 549 GAAKPNLSLSTVLACSDARFVKYFVDVHPEGPSMW 568
           G ++ + S+++++A S+  F+     +HP  P  W
Sbjct: 516 GFSEKSYSIASIVATSEKAFIARLYGIHPAIPKHW 544

BLAST of Cp4.1LG20g05980 vs. TAIR10
Match: AT1G74120.1 (AT1G74120.1 Mitochondrial transcription termination factor family protein)

HSP 1 Score: 55.5 bits (132), Expect = 1.3e-07
Identity = 33/115 (28.70%), Postives = 61/115 (53.04%), Query Frame = 1

Query: 449 GDQLQERFDCLVNAGLDCHVVTNIVRHAPMVLNQSKDVIQEKIDCLRNCLGYPLHTIAAF 508
           G +++ R DCL   GL       +V   P V+    + I++KI+ L N +G+ ++ +A  
Sbjct: 282 GFEVKLRVDCLCKYGLIRRDAFKVVWKEPRVILYEIEDIEKKIEFLTNRMGFHINCLADV 341

Query: 509 PVYLCYNMER-INTRFSMYRWLRDKGAAKPNLSLSTVLACSDARFVKYFVDVHPE 563
           P YL  N+++ I  R+++  +L+ KG    ++ L  ++  S  RF   +V  +PE
Sbjct: 342 PEYLGVNLQKQIVPRYNVIDYLKLKGGLGCDIGLKGLIKPSMKRFYNLYVMPYPE 396

BLAST of Cp4.1LG20g05980 vs. NCBI nr
Match: gi|567912857|ref|XP_006448742.1| (hypothetical protein CICLE_v10014682mg [Citrus clementina])

HSP 1 Score: 779.2 bits (2011), Expect = 4.8e-222
Identity = 381/575 (66.26%), Postives = 461/575 (80.17%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKT-ITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTT- 60
           MITQ+NH L F PV +EKT I Q  N  F+  + R FC+      P+ SL ES Q P + 
Sbjct: 16  MITQINHFLVFFPVSNEKTSIMQKQNAFFIPVRVRSFCSSRLTHQPKVSLAESTQPPASL 75

Query: 61  -GGRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSL 120
              RVSR ARTEAQ+VLFDYLH TRSLG+ DAEHISKNSP F+ NL+ K+DS KDV RSL
Sbjct: 76  VASRVSRLARTEAQEVLFDYLHSTRSLGYMDAEHISKNSPDFVLNLLSKIDSGKDVTRSL 135

Query: 121 RKYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKM 180
            ++LRYNPINEFEPFFESLGL  SEL   LPR LMFLSDD ++L+NFHVLC+YGIPRSKM
Sbjct: 136 TRFLRYNPINEFEPFFESLGLSQSELSPLLPRHLMFLSDDEVLLDNFHVLCDYGIPRSKM 195

Query: 181 GKMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEK 240
           GKMY EA EIF +D G+LASKL AYE+LGL   T+IKLVSCCPSLLIG +   F+KVLEK
Sbjct: 196 GKMYVEATEIFRHDRGVLASKLWAYENLGLSKNTVIKLVSCCPSLLIGGVDSRFVKVLEK 255

Query: 241 LSKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGK 300
           L +LG + DWIG Y+S K +YNW+++ +TLDFL K+GY E+Q+ +LFK+NP+L+ E SG+
Sbjct: 256 LKELGFKNDWIGRYLSGKGSYNWDQVSETLDFLYKIGYNEVQLLNLFKTNPALVFEGSGQ 315

Query: 301 KVYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVS 360
           KVYVLF RL+KLGLKM+E YS++ QNP ILS K+VKN+L+AV FL +IG+G +DI+ +V 
Sbjct: 316 KVYVLFGRLLKLGLKMNEVYSLFSQNPQILSSKFVKNLLQAVGFLIEIGMGMKDISNMVL 375

Query: 361 HQILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPA 420
               L+GSC+LKGPKTVC +LKVG+E LC II+DDP KLF LASK+++K  EQ  CQNP+
Sbjct: 376 MHAELMGSCSLKGPKTVCSKLKVGRESLCQIIKDDPLKLFHLASKTEVKIDEQVDCQNPS 435

Query: 421 KEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAP 480
           K++EKT FLL+LGYVENS+E+ KA KQFRGRGDQLQERFDCLV AGLD +VV NIV+ AP
Sbjct: 436 KDVEKTEFLLRLGYVENSEEVTKALKQFRGRGDQLQERFDCLVQAGLDSNVVRNIVKRAP 495

Query: 481 MVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKP 540
           MVLNQSKDV+++KID L+N L YPL ++ AFP YLCY+M RIN R  MY WLR++G AKP
Sbjct: 496 MVLNQSKDVLEKKIDYLKNYLCYPLESVVAFPAYLCYDMGRINHRCKMYVWLRERGVAKP 555

Query: 541 NLSLSTVLACSDARFVKYFVDVHPEGPSMWESCKK 573
            LSLST+LACSDA+F KYFVDVHPEGP+MWES KK
Sbjct: 556 TLSLSTILACSDAKFEKYFVDVHPEGPAMWESLKK 590

BLAST of Cp4.1LG20g05980 vs. NCBI nr
Match: gi|641858721|gb|KDO77443.1| (hypothetical protein CISIN_1g007621mg [Citrus sinensis])

HSP 1 Score: 777.3 bits (2006), Expect = 1.8e-221
Identity = 380/575 (66.09%), Postives = 460/575 (80.00%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKT-ITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTT- 60
           MITQ+NH L F PV +EKT I Q  N  F+  + R FC+      P+ SL ES Q P + 
Sbjct: 16  MITQINHFLVFFPVSNEKTSIMQKQNAFFIPVRVRSFCSSRLTHQPKVSLAESTQPPASL 75

Query: 61  -GGRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSL 120
              RVSR ARTEAQ+VLFDYLH TRSLG+ DAEHISKNSP F+ NL+ K+DS KDV RSL
Sbjct: 76  VASRVSRLARTEAQEVLFDYLHSTRSLGYMDAEHISKNSPDFVLNLLSKIDSGKDVTRSL 135

Query: 121 RKYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKM 180
            ++LRYNPINEFEPFFESLGL  SEL   LPR LMFLSDD ++L+NFHVLC+YGIPRSKM
Sbjct: 136 MRFLRYNPINEFEPFFESLGLSQSELSPLLPRHLMFLSDDEVLLDNFHVLCDYGIPRSKM 195

Query: 181 GKMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEK 240
           GKMY EA EIF +D G+LASKL AYE+LGL   T+IKLVSCCPSLLIG +   F+KVLEK
Sbjct: 196 GKMYVEATEIFRHDRGVLASKLWAYENLGLSKNTVIKLVSCCPSLLIGGVDSRFVKVLEK 255

Query: 241 LSKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGK 300
           L +LG + DWIG Y+  K +YNW+++ +TLDFL K+GY E+Q+ +LFK+NP+L+ E SG+
Sbjct: 256 LKELGFKNDWIGRYLPGKGSYNWDQVSETLDFLYKIGYNEVQLLNLFKTNPALVFEGSGQ 315

Query: 301 KVYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVS 360
           KVYVLF RL+KLGLKM+E YS++ QNP ILS K+VKN+L+AV FL +IG+G +DI+ +V 
Sbjct: 316 KVYVLFGRLLKLGLKMNEVYSLFSQNPQILSSKFVKNLLQAVGFLIEIGMGMKDISNMVL 375

Query: 361 HQILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPA 420
               L+GSC+LKGPKTVC +LKVG+E LC II+DDP KLF LASK+++K  EQ  CQNP+
Sbjct: 376 MHAELMGSCSLKGPKTVCSKLKVGRESLCQIIKDDPLKLFHLASKTEVKIDEQVDCQNPS 435

Query: 421 KEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAP 480
           K++EKT FLL+LGYVENS+E+ KA KQFRGRGDQLQERFDCLV AGLD +VV NIV+ AP
Sbjct: 436 KDVEKTEFLLRLGYVENSEEVTKALKQFRGRGDQLQERFDCLVQAGLDSNVVRNIVKRAP 495

Query: 481 MVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKP 540
           MVLNQSKDV+++KID L+N L YPL ++ AFP YLCY+M RIN R  MY WLR++G AKP
Sbjct: 496 MVLNQSKDVLEKKIDYLKNYLCYPLESVVAFPAYLCYDMGRINHRCKMYVWLRERGVAKP 555

Query: 541 NLSLSTVLACSDARFVKYFVDVHPEGPSMWESCKK 573
            LSLST+LACSDA+F KYFVDVHPEGP+MWES KK
Sbjct: 556 TLSLSTILACSDAKFEKYFVDVHPEGPAMWESLKK 590

BLAST of Cp4.1LG20g05980 vs. NCBI nr
Match: gi|568828220|ref|XP_006468442.1| (PREDICTED: uncharacterized protein LOC102621440 [Citrus sinensis])

HSP 1 Score: 777.3 bits (2006), Expect = 1.8e-221
Identity = 380/575 (66.09%), Postives = 460/575 (80.00%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKT-ITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTT- 60
           MITQ+NH L F PV +EKT I Q  N  F+  + R FC+      P+ SL ES Q P + 
Sbjct: 16  MITQINHFLVFFPVSNEKTSIMQKQNAFFIPVRVRSFCSSRLTHQPKVSLAESTQPPASL 75

Query: 61  -GGRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSL 120
              RVSR ARTEAQ+VLFDYLH TRSLG+ DAEHISKNSP F+ NL+ K+DS KDV RSL
Sbjct: 76  VASRVSRLARTEAQEVLFDYLHSTRSLGYMDAEHISKNSPDFVLNLLSKIDSGKDVTRSL 135

Query: 121 RKYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKM 180
            ++LRYNPINEFEPFFESLGL  SEL   LPR LMFLSDD ++L+NFHVLC+YGIPRSKM
Sbjct: 136 TRFLRYNPINEFEPFFESLGLSQSELSPLLPRHLMFLSDDEVLLDNFHVLCDYGIPRSKM 195

Query: 181 GKMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEK 240
           GKMY EA EIF +D G+LASKL AYE+LGL   T+IKLVSCCPSLLIG +   F+KVLEK
Sbjct: 196 GKMYVEATEIFRHDRGVLASKLWAYENLGLSKNTVIKLVSCCPSLLIGGVDSRFVKVLEK 255

Query: 241 LSKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGK 300
           L +LG + DWIG Y+  K +YNW+++ +TLDFL K+GY E+Q+ +LFK+NP+L+ E SG+
Sbjct: 256 LKELGFKNDWIGRYLPGKGSYNWDQVSETLDFLYKIGYNEVQLLNLFKTNPALVFEGSGQ 315

Query: 301 KVYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVS 360
           KVYVLF RL+KLGLKM+E YS++ QNP ILS K+VKN+L+AV FL +IG+G +DI+ +V 
Sbjct: 316 KVYVLFGRLLKLGLKMNEVYSLFSQNPQILSSKFVKNLLQAVGFLIEIGMGMKDISNMVL 375

Query: 361 HQILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPA 420
               L+GSC+LKGPKTVC +LKVG+E LC II+DDP KLF LASK+++K  EQ  CQNP+
Sbjct: 376 MHAELMGSCSLKGPKTVCSKLKVGRESLCQIIKDDPLKLFHLASKTEVKIDEQVDCQNPS 435

Query: 421 KEMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAP 480
           K++EKT FLL+LGYVENS+E+ KA KQFRGRGDQLQERFDCLV AGLD +VV NIV+ AP
Sbjct: 436 KDVEKTEFLLRLGYVENSEEVTKALKQFRGRGDQLQERFDCLVQAGLDSNVVRNIVKRAP 495

Query: 481 MVLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKP 540
           MVLNQSKDV+++KID L+N L YPL ++ AFP YLCY+M RIN R  MY WLR++G AKP
Sbjct: 496 MVLNQSKDVLEKKIDYLKNYLCYPLESVVAFPAYLCYDMGRINHRCKMYVWLRERGVAKP 555

Query: 541 NLSLSTVLACSDARFVKYFVDVHPEGPSMWESCKK 573
            LSLST+LACSDA+F KYFVDVHPEGP+MWES KK
Sbjct: 556 TLSLSTILACSDAKFEKYFVDVHPEGPAMWESLKK 590

BLAST of Cp4.1LG20g05980 vs. NCBI nr
Match: gi|595902982|ref|XP_007213937.1| (hypothetical protein PRUPE_ppa003449mg [Prunus persica])

HSP 1 Score: 775.4 bits (2001), Expect = 7.0e-221
Identity = 381/573 (66.49%), Postives = 459/573 (80.10%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKTITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTT-- 60
           MI+QLNHL+ F+PV    T  Q  NPS VS K + FC+     H + S+IES QSP +  
Sbjct: 1   MISQLNHLVLFSPVFERTTFVQ--NPSSVSLKFQCFCSSRLTHHSKVSVIESAQSPNSPF 60

Query: 61  GGRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSLR 120
             RVSR ARTEAQ  LFDYLHCTRS  F DAEHISKNSP FLQNL+  +DSEKDVARSL 
Sbjct: 61  ANRVSRNARTEAQATLFDYLHCTRSFSFTDAEHISKNSPIFLQNLLSNIDSEKDVARSLT 120

Query: 121 KYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMG 180
           ++LRYNPINEFEPFFESLGL PSEL  FLPR LM+LSDD ++ +N H LCNYGIPRS +G
Sbjct: 121 RFLRYNPINEFEPFFESLGLSPSELLSFLPRHLMYLSDDCVLTDNVHALCNYGIPRSNIG 180

Query: 181 KMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEKL 240
           KMYKEA+EIF YDYG+LA KL+AYE+LG+   T+IKLVSCCP LL+G +  +F++V EKL
Sbjct: 181 KMYKEAKEIFGYDYGVLALKLQAYENLGISKATVIKLVSCCPLLLVGGVNSDFVRVHEKL 240

Query: 241 SKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKK 300
            +LG+  DWIGGY S  STYNW+R+ DT+DFL KVGYTE QM  LF+ NP+LLLE SGK 
Sbjct: 241 KRLGLGMDWIGGYASGNSTYNWDRMFDTMDFLDKVGYTEEQMCVLFELNPALLLEGSGKN 300

Query: 301 VYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVSH 360
           VYVLF RL+KLGL+M+E YS++ QNP +LS K +KN+L AVDFLF+IG+GTE++A IV++
Sbjct: 301 VYVLFGRLLKLGLEMNEVYSLFMQNPQVLSVKCMKNLLLAVDFLFEIGMGTEEMADIVAN 360

Query: 361 QILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPAK 420
            +  L S + K PKTVCK+LKV ++GL  +I++DP K+ TLASKSK K+  Q     P+K
Sbjct: 361 DVEFLSSSSFKRPKTVCKDLKVKRDGLLQMIKEDPHKVLTLASKSKGKN--QLISPLPSK 420

Query: 421 EMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAPM 480
            MEKT+FL++LGY+ENSDE+ KA K+FRGRGDQLQERFDCLV AGLDC+VV+NIV+ AP 
Sbjct: 421 HMEKTSFLVRLGYIENSDEMMKALKKFRGRGDQLQERFDCLVQAGLDCNVVSNIVKQAPH 480

Query: 481 VLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKPN 540
           VLNQSKDVI+ KI CL NCL YPL ++ AFP YLCY+M+RIN RFSMY WLR+KGAAKP 
Sbjct: 481 VLNQSKDVIEMKISCLTNCLRYPLDSVVAFPAYLCYDMDRINLRFSMYAWLREKGAAKPM 540

Query: 541 LSLSTVLACSDARFVKYFVDVHPEGPSMWESCK 572
           LSLST+LACSDARFVKY+VDVHPEGP+MWES K
Sbjct: 541 LSLSTLLACSDARFVKYYVDVHPEGPAMWESFK 569

BLAST of Cp4.1LG20g05980 vs. NCBI nr
Match: gi|590621796|ref|XP_007024871.1| (Mitochondrial transcription termination factor family protein, putative [Theobroma cacao])

HSP 1 Score: 770.4 bits (1988), Expect = 2.2e-219
Identity = 381/579 (65.80%), Postives = 467/579 (80.66%), Query Frame = 1

Query: 1   MITQLNHLLFFAPVLSEKTITQAHNPSFVSFKSRLFCNYGFDIHPRASLIESPQSPTTG- 60
           MIT L+  +  +P++ EK+    HN   VS + R F +      P+ASL +S +S ++G 
Sbjct: 16  MITHLDKFVVLSPIVYEKSDV-VHNLCSVSLRVRYFRSSRLVFRPKASLADSIRSSSSGF 75

Query: 61  -GRVSRYARTEAQKVLFDYLHCTRSLGFADAEHISKNSPHFLQNLIMKLDSEKDVARSLR 120
             R+SR A+TEAQ VLFDYLH TRS  F DAEHISKNS HFLQNL+ K+D EKDVA+SL 
Sbjct: 76  ASRISRAAKTEAQVVLFDYLHSTRSFRFMDAEHISKNSHHFLQNLLSKIDPEKDVAKSLT 135

Query: 121 KYLRYNPINEFEPFFESLGLPPSELPLFLPRRLMFLSDDHLMLENFHVLCNYGIPRSKMG 180
           K+LR+NP+NEFEPFFESLGL PSE+   +P+RLMFL DD +ML+NFHVLC+YGIPRSKMG
Sbjct: 136 KFLRFNPVNEFEPFFESLGLSPSEVSTLVPQRLMFLRDDSVMLDNFHVLCDYGIPRSKMG 195

Query: 181 KMYKEAREIFAYDYGLLASKLRAYEDLGLGMGTLIKLVSCCPSLLIGQIKMEFIKVLEKL 240
           KMYK AREIF YDYG+LA KL+AYE+LGL   T+IKLVSCCPSLL+G +  EF   LE+L
Sbjct: 196 KMYKVAREIFGYDYGVLALKLQAYENLGLSKPTVIKLVSCCPSLLVGGVDAEFAGALERL 255

Query: 241 SKLGIEEDWIGGYISHKSTYNWNRLVDTLDFLAKVGYTEIQMHDLFKSNPSLLLEDSGKK 300
             LGI+ D IGGY+S K  Y+W R++D L+FL +VGY E Q+ +LFK+NP+LL E SGKK
Sbjct: 256 KVLGIKNDDIGGYLSGKGMYDWGRMLDMLNFLDRVGYNEEQLGNLFKTNPALLFEGSGKK 315

Query: 301 VYVLFVRLVKLGLKMDEAYSIYKQNPVILSGKYVKNILRAVDFLFDIGLGTEDIAGIVSH 360
           VYVLF RL+KLGL+M+E +S++ QNP ILS K  KN+ +A+DFLFDI + TEDIA IVS 
Sbjct: 316 VYVLFGRLIKLGLRMNEVHSLFMQNPHILSVKCTKNLFKALDFLFDIAMDTEDIAHIVSR 375

Query: 361 QILLLGSCTLKGPKTVCKELKVGKEGLCLIIRDDPSKLFTLASKSKLKSSEQASCQNPAK 420
            + L+GSC+LKGPKTVC+EL V KE LCLII++DP K F+LASKSK+ SS Q + ++ +K
Sbjct: 376 HVELMGSCSLKGPKTVCRELNVEKEELCLIIKEDPLKWFSLASKSKVLSSGQVASKDTSK 435

Query: 421 EMEKTTFLLKLGYVENSDELAKASKQFRGRGDQLQERFDCLVNAGLDCHVVTNIVRHAPM 480
            +EKTTFLL+LGY+ENSDE+ KA KQFRGRGDQLQERFDCLV AGLDC+VV N++RHAPM
Sbjct: 436 YLEKTTFLLRLGYLENSDEMLKALKQFRGRGDQLQERFDCLVCAGLDCNVVKNLIRHAPM 495

Query: 481 VLNQSKDVIQEKIDCLRNCLGYPLHTIAAFPVYLCYNMERINTRFSMYRWLRDKGAAKPN 540
           VLNQSKDVI++KIDCL+N LGYPL ++ AFP YLCY+MERI+ RFSMY WLR++GAAKP 
Sbjct: 496 VLNQSKDVIEKKIDCLKNWLGYPLESVVAFPAYLCYDMERISRRFSMYVWLRERGAAKPM 555

Query: 541 LSLSTVLACSDARFVKYFVDVHPEGPSMWESCKKHGLHN 578
           LSLSTVLACSDARFVKYFVDVHPEGP+ WE+ KK  LH+
Sbjct: 556 LSLSTVLACSDARFVKYFVDVHPEGPAKWETLKK-SLHS 592

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MTEFH_ARATH4.7e-6029.32Transcription termination factor MTEF18, mitochondrial OS=Arabidopsis thaliana G... [more]
MTEFE_ARATH2.3e-0628.70Transcription termination factor MTERF15, mitochondrial OS=Arabidopsis thaliana ... [more]
MTEF8_ARATH3.0e-0629.63Transcription termination factor MTERF8, chloroplastic OS=Arabidopsis thaliana G... [more]
Match NameE-valueIdentityDescription
V4U7M9_9ROSI3.4e-22266.26Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014682mg PE=4 SV=1[more]
A0A067GCM4_CITSI1.3e-22166.09Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g007621mg PE=4 SV=1[more]
M5WPG4_PRUPE4.9e-22166.49Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003449mg PE=4 SV=1[more]
A0A061GDK1_THECC1.6e-21965.80Mitochondrial transcription termination factor family protein, putative OS=Theob... [more]
W9QRR9_9ROSA2.3e-20763.22Uncharacterized protein OS=Morus notabilis GN=L484_006241 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G19650.17.5e-16558.21 Mitochondrial transcription termination factor family protein[more]
AT5G45113.14.8e-11147.70 mitochondrial transcription termination factor-related / mTERF-relat... [more]
AT5G06810.14.1e-11039.45 Mitochondrial transcription termination factor family protein[more]
AT3G60400.12.7e-6129.32 Mitochondrial transcription termination factor family protein[more]
AT1G74120.11.3e-0728.70 Mitochondrial transcription termination factor family protein[more]
Match NameE-valueIdentityDescription
gi|567912857|ref|XP_006448742.1|4.8e-22266.26hypothetical protein CICLE_v10014682mg [Citrus clementina][more]
gi|641858721|gb|KDO77443.1|1.8e-22166.09hypothetical protein CISIN_1g007621mg [Citrus sinensis][more]
gi|568828220|ref|XP_006468442.1|1.8e-22166.09PREDICTED: uncharacterized protein LOC102621440 [Citrus sinensis][more]
gi|595902982|ref|XP_007213937.1|7.0e-22166.49hypothetical protein PRUPE_ppa003449mg [Prunus persica][more]
gi|590621796|ref|XP_007024871.1|2.2e-21965.80Mitochondrial transcription termination factor family protein, putative [Theobro... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003690double-stranded DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR003690MTERF
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0003690 double-stranded DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g05980.1Cp4.1LG20g05980.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003690Transcription termination factor, mitochondrial/chloroplasticPFAMPF02536mTERFcoord: 261..358
score: 3.2E-8coord: 267..405
score: 2.4E-8coord: 450..552
score: 3.0
IPR003690Transcription termination factor, mitochondrial/chloroplasticSMARTSM00733mt_12coord: 470..501
score: 23.0coord: 502..535
score: 210.0coord: 213..244
score: 110.0coord: 317..348
score: 1600.0coord: 178..208
score: 3
NoneNo IPR availablePANTHERPTHR13068CGI-12 PROTEIN-RELATEDcoord: 413..572
score: 1.7E-271coord: 61..396
score: 1.7E
NoneNo IPR availablePANTHERPTHR13068:SF7MITOCHONDRIAL TRANSCRIPTION TERMINATION FACTOR FAMILY PROTEIN-RELATEDcoord: 413..572
score: 1.7E-271coord: 61..396
score: 1.7E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG20g05980Cucurbita pepo (Zucchini)cpecpeB048
Cp4.1LG20g05980Cucumber (Gy14) v1cgycpeB0625
Cp4.1LG20g05980Cucurbita maxima (Rimu)cmacpeB438
Cp4.1LG20g05980Cucurbita moschata (Rifu)cmocpeB403
Cp4.1LG20g05980Wild cucumber (PI 183967)cpecpiB520
Cp4.1LG20g05980Cucumber (Chinese Long) v2cpecuB518
Cp4.1LG20g05980Melon (DHL92) v3.5.1cpemeB477
Cp4.1LG20g05980Cucumber (Gy14) v2cgybcpeB083
Cp4.1LG20g05980Melon (DHL92) v3.6.1cpemedB565
Cp4.1LG20g05980Silver-seed gourdcarcpeB0630
Cp4.1LG20g05980Cucumber (Chinese Long) v3cpecucB0637