American Journal of Bioinformatics Research

p-ISSN: 2167-6992    e-ISSN: 2167-6976

2013;  3(3): 35-41

doi:10.5923/j.bioinformatics.20130303.01

Structural Characterization of the NodD Transcription Factor

Natalia V. Kostiuk1, Maya B. Belyakova2, Dzhulianna V. Leshchenko2, Mikhail V. Miniaev3, Margarita B. Petrova1, Elena A. Kharitonova1

1Biology Department, Tver State Medical Academy, Tver, Russia

2Chemistry and Biochemistry Department, Tver State Medical Academy, Tver, Russia

3Research Center, Tver State Medical Academy, Tver, Russia

Correspondence to: Maya B. Belyakova, Chemistry and Biochemistry Department, Tver State Medical Academy, Tver, Russia.

Email:

Copyright © 2012 Scientific & Academic Publishing. All Rights Reserved.

Abstract

Using computer analysis and modeling we studied NodD transcription factor which controls expression of nodulation genes of symbiotic nitrogen-fixing bacteria. On the basis of the amino acid sequences it was predicted that in the secondary structure of NodD protein it must be present ten alpha helices and three beta sheets which form two domains after folding. Since NodD acts in oligomeric mode, it was analyzed all four known for other similar factors of transcription ways of assembly into dimers. Using molecular docking the probable mechanism of NodD activation by binding of low molecular weight inducer (on the model of luteolin) was clarified.

Keywords: Molecular Modeling, Molecular Docking, NodD-protein, Nodulation, Rhizobium

Cite this paper: Natalia V. Kostiuk, Maya B. Belyakova, Dzhulianna V. Leshchenko, Mikhail V. Miniaev, Margarita B. Petrova, Elena A. Kharitonova, Structural Characterization of the NodD Transcription Factor, American Journal of Bioinformatics Research, Vol. 3 No. 3, 2013, pp. 35-41. doi: 10.5923/j.bioinformatics.20130303.01.

1. Introduction

The formation of nitrogen-fixing nodules is a remarkable example of the symbiotic relationship between the bacteria and leguminous plants. These interactions are highly species-specific: each type of rhizobia complies with a limited host range[1]. At the molecular level, the selectivity is provided by numerous specific determinants. Some of them are specifically secreted by cells, such as plant flavonoids, bacterial lipooligosaccharides (NodD-factor) and proteins. Other specific determinants are integrated into cell membranes[1, 2]. Insight into the molecular mechanisms of symbiotic specificity makes it possible to develop new crop varieties and bacterial strains that could improve the agronomic potential of symbiotic nitrogen fixation[3].
Important role in the development of nodules belongs to nod-genes of rhizobia, which are organized in several operons and located either in the chromosome or inSym-plasmids[4]. The expression of many genes is controlled by the NodD transcription factor of nodulation[5]. Its activation occurs in response to the appearance of a plant flavonoid in bacterial cell[3]. Among others, complex flavonoid-protein binds to a promoter region of nodABCIJ operon. As a result of its activation a lipooligosaccharide is synthesized, which is a signal for cell division and the formation of the root nodule tissue[4].
NodD protein is a representative of a numerous family of LysR-type transcriptional regulators (LTTRs)[25].Molecules of these proteins include the two domains: highly conservative DNA binding domain and a variable regulatory domain. LTTRs can form homodimers, homotetramers, and homooctamers[7, 8]. Functionally active tetramer is a result of convergence of two dimers associated with DNA. Usually, the binding sites are located at a distance of 50-60 bp. Joining of LTTR leads to bending of the polypeptide chain which facilitates connection of RNA polymerase and initiation of transcription[7, 9, 10].
At present three-dimensional structure of NodD protein remains undeciphered. However, evidence was provided that NodD binds to target DNA through anchoring the two half-sites of the nod box as a tetramer[11, 12]. By experimental and computer methods the complexes of NodD with nodulation genes were studied[13-16]. An imperfect inverted repeat AT-N10-GAT was found in each half-site and is critical for NodD binding[12].
DNA binding ability of NodD depends on a number of plant flavonoids such as luteolin, naringenin, eriodictyol, daidzein, and other[3, 17]. However, at the present day the experimental data concerning the localization, structure, and functioning of the binding sites of flavonoids are absent. Also unclear mechanisms of NodD activation under the influence of a low-molecular inductor.
The aim of this study was the modeling of spatial assembling of NodD protein and the analysis of structure of binding site for low molecular weight inducer.

2. Materials and Methods

2.1. Analysis of the Amino Acid Sequence Homology

The amino acid sequence of NodD was found in the database National Center for Biotechnology Information (NCBI). Evaluation of homology as well as identification of other closely related proteins were carried out using software system BLAST[18]. All subsequent studies were performed on the basis of typical sequence of NodD-protein - AAA85282 owned by Rhizobium sp.

2.2. Secondary Structure Prediction

For the prediction of the secondary structure of the NodD- protein we used several programs based on different algorithms:
- PSSFinder[19] – prediction using Markov chains;
- PSIPRED[20, 21] - method of neural networks using data obtained from PSI-BLAST
- GOR5[22, 23] – Garnier-Osguthorpe-Robson method combines information theory and Bayesian statistics
- CDM[24] – combined method based on the method of GOR and the database of protein fragments FDM.

2.3. Modeling of the Tertiary Structure

Spatial structure was modeled by method of homology (program 3DJIGSAW[25-27], SWISS-MODEL[28-30]) and by ‘threading’ method (the program I-TASSER[31, 32]). In all cases, subunits of 2esn transcription factor were used as template. Additional energy minimization was performed using the Swiss-PDB-Viewer with potential forces of Gromos 96. The residue profiles of three-dimensional models were checked by Verify 3D. The presence of steric and conformational difficulties were assessed using Ramachandran plot and server ProCheck[33]. Residues located in the prohibited regions of Ramachandran plot were not found. Calculation of intramolecular interactions was made by Protein Interactions Calculator[34].

2.4. Docking with a Low Molecular Weight Ligands

Domain organization of the NodD-protein was studied involving database of Pfam[35]. Search for amino acid residues participated in the formation of flavonoid binding site was implemented by the program I-TASSER and COACH[36-37].
For NodD, flavonoids of various structures are low molecular weight ligand[3, 17]. As a model of the ligand we used luteolin. Docking of the ligand to the monomeric and oligomeric proteins was performed by servers SwissDoc[38, 39], PatchDoc[40, 41]. Docked structures with the best estimate were optimized using FineDoc[42, 43].

2.5. Modeling of the Oligomeric Structure

Obtained in the previous stage model of spatial packing of polypeptides were used to create oligomer. Formation of dimeric structures was performed by servers SymmDock[41, 44], ClusPro[45-47]. Additional energy minimization was carried out using the Swiss-PDB-Viewer. For identifying of steric hindrance, Ramachandran plot and ProCheck were used. The residue profiles were checked by Verify3D.

3. Results and Discussion

3.1. Analysis of the Amino Acid Sequences of NodD

Currently most sequenced NodD proteins aresingle-stranded molecules of 302-337 amino acids in length. The share of identical amino acids in the orthologs accounts for at least 52%.
In order to identify closely related proteins it was performed paired and multiple alignment of studied sequence AAA85282 with other members of the LysR subfamily. 21 proteins were compared: AmpR, CatM, CatR, CynR, CysB, GltC, IciA, IlvY, IrgB, LysR, MetR, NahR, NhaR, NolR, OxyR, PssR, RbcR, SyrM, TcbR, TfdS, TrpI. Multiple alignments did not find a significant similarity within the group. Paired alignments resulted in identification of six proteins with homology with the NodD over 20% (Table 1). In other cases, similar were only short fragments scattered throughout the chain.
Table 1. Proteins of LysR Subfamily Homologous to the NodD
     

3.2. The Prediction of Secondary Structure Elements in the NodD Protein

With a high probability, ten alpha helices extending up to 2.5 turns, and three beta sheets are found in the molecule. The existence of seven other secondary structure elements and their length cannot be determined unambiguously (Fig. 1).

3.3. Modeling of the Structure of Monomeric Protein NodD

Models of the spatial structure of the monomer NodD obtained by different methods show similar folding motifs but differ in details of structure in irregular regions (Fig. 2). Studied protein is L-shaped with distinct structural domains (Fig. 3). Small N-terminal domain (residues 1-74, the linear size 40 Å × 28 Å × 25 Å) is constructed mainly of spiral elements. In large C-terminal domain (residues 90-308, the linear size 64 Å × 37 Å × 26 Å) the two isolated beta sheets are closed by alpha helices. Domains are connected by a linker region (residues 75-89). In general, the distribution of secondary structure elements matched with predicted by analysis of the amino acid sequence. Calculation of intramolecular interactions showed that the tertiary structure of the protein is supported only by hydrogen, ionic bonds and hydrophobic interactions. Disulfide bonds were not found in the molecule.
Figure 1. The prediction of secondary structure elements of NodD
Figure 2. Superimposition of NodD models: red line - homology modeling, blue line – protein threading

3.4. Docking of NodD Protein with Inducer

Additional analysis of protein domain organization involving Pfam database showed a correspondence between structural and functional domains. The N-terminus of the molecule contains hth-domain (‘helix-turn-helix’),responsible for binding with DNA. LysR-substrate domain which provides inductor binding localizes on C-terminal. According to the estimates, the formation of flavonoid binding site may involve amino acid residues 104, 106, 107, 110, 111, 151, 166, 168, 246, 273 (BS-score 1,05). Thus inducer binding site must be located in a recess between two sheets of C-domain (Fig. 3).
Figure 3. Probable spatial structure of the NodD protein, presented in different ways: A - ball model B - ribbon model
Figure 4. Location of flavonoid in the binding center
Docking of NodD monomer and flavonoid showed that it is theoretically possible to bind the ligand in 6 different positions. However, as predicted by other methods, the insertion of the planar flavonoid molecule into the hydrophobic pocket between beta sheets is preferred. Assessment of contact surfaces has shown their good coincidence. The specificity of the interaction between the protein and ligand can be provided by two hydrogen bonds. In this case, the hydroxyl groups of luteolin act as hydrogen donors and the carboxyl groups of asp-135 and asp-153 serve as acceptors (Fig. 4). It is possible that in interactions with other flavonoids the hydrogen bond may also occur due to ser-104.

3.5. Prediction of Oligomeric Structures

It is known that bacterial transcription factors LTTF are present in the cell in a variety of oligomeric structures. In a free state homodimers are commonly found. They are responsible for binding with low molecular weight inducer [7]. Functionally active homotetramers arise upon binding to DNA[9,10]. Role of homooligomers is not fully understood [8].
Although the domain organization of the LTTF monomers shows a sufficiently high conservatism, formed by them oligomers vary greatly due to structural flexibility. Among dimers there were found V-shaped molecules formed on the basis of contacts between NN and CC domains[48], the molecules with tightly closed subunits connected by principles head-to-head[49] or head-to-tail[50]. Open and closed tetrameric forms were described[51].
We analyzed all four variants of dimeric structures known from the literature (Fig. 5, Table 2). The formation of both types of symmetrical V-shaped structures with the N or C domains in the base is energetically allowed (Fig. 5, A-D). In analogy with the transcription factor ArgP[48] we can assume that the first type of dimer interacts with the promoter region of the gene and the second type illustrates possible contact between the C-domain during the functioning of the tetramer. Characteristically, the angle between the subunits may differ noticeably. In the extreme case the complete closure of the subunits is observed (Fig. 5, E). Probably, such flexibility of the structure is required for the functioning of the protein. In the extreme case observed complete closure of the subunits (Fig. 5, E). The binding of low-molecular inductor changes the relative position of the subunits in the dimer. As a result, its attachment to DNA is facilitated. Arising in this case conformational changes make it possible to form a tetramer. The double helix of DNA is bent to facilitate the binding of RNA polymerase. Permissibility of formation of tightly closed head-to-tail dimer draws attention (Fig. 5, F). However, in this case mechanism of its functioning remains unclear.
Table 2. The Energy of Formation of Dimeric Structures
     
Figure 5. Dimers of NodD: V-shaped structures with the N-domaisn in the base (A, B); V-shaped structures with the C-domains in the base (C, D); molecules with a tight closing of subunits according to the principle head-to-head (E) and head-to-tail (F)

4. Conclusions

The work performed an attempt to clarify the features of binding of the NodD transcription factor, involved in the regulation of nodulation, with model inducer. Since the spatial structure of the NodD protein still is not solved, we assessed probable elements of protein secondary structure and modeled its tertiary structure. It has shown the possibility of formation of four types of dimers which differ in the type of interaction between the subunits. Probably, such a structural flexibility of NodD is essential to its functioning. Docking of model flavonoid luteolin allowed to determine the position of the binding site of the ligand. Results of the study are important for understanding of the NodD structure. The created model provides a rational basis for the planning of experiments to determine the contribution of different amino acid residues in the formation of the NodD - flavonoid complex. It may be interesting from the point of view of management of NodD activity as well as change of specificity of interaction. The high specificity of such interaction is a serious obstacle in overcoming of species barrier in a symbiotic relationships ‘bacteria-plants’.

References

[1]  Wang, D., Yang, S., Tang, F., Zhu, H, “Symbiosis specificity in the legume: rhizobial mutualism,” Cell Microbiol, 14(3).334-42.Mar.2012
[2]  Fauvart, M., Michiels, J, “Rhizobial secreted proteins as determinants of host specificity in the rhizobium-legume symbiosis,“ FEMS Microbiol Lett, 285 (1).1-9. Aug. 2008
[3]  Hassan, S., Mathesius, U, “The role of flavonoids in root-rhizosphere signalling: opportunities and challenges for improving plant-microbe interactions,” J Exp Bot, 63 (9).3429-44. May.2012
[4]  Masson-Boivin, C., Giraud, E., Perret, X., Batut, J, “Establishing nitrogen-fixing symbiosis with legumes: how many rhizobium recipes?” Trends Microbiol, 17 (10).458-66. Oct.2009
[5]  Gyögypal, Z., Kiss., G.B.,Kondorosi., F, “Transduction of plant signal molecules by the Rhizobium NodD proteins,” Bioessays, 13 (11).575-81.1991
[6]  Schell, M.A, “Molecular biology of the LysR family of transcriptional regulators,” Annu. Rev.Microbio,47.597–626.1993
[7]  Maddocks, S.E., Oyston, P.S, “Structure and function of the LysR-type transcriptional regulator (LTTR) family proteins,” Microbiology, 154 (Pt 12).3609-23.2008
[8]  Sainsbury, S., Lane, L.A., Ren, J., Gilbert, R.S., Saunders, N.J., Robinson, C.V., Stuart, D.I. and Owens, R.J, “The structure of CrgA from Neisseria meningitides reveals a new octameric assembly state for LysR transcriptional regulators,” Nucleic Acids Res, 37.45-4558.2009
[9]  Muraoka, S., Okumura, R., Ogawa, N., Nonaka, T., Miyashita, K. and Senda, T, “Crystal structure of a full-length LysR-type transcriptional regulator, CbnR: unusual combination of two subunit forms and molecular bases for causing and changing DNA bend,” J Mol Biol, 328 (3).555-66. 2003
[10]  Porrúa, O., García-Jaramillo, M., Santero, E. and Govantes, F, “The LysR-type regulator AtzR binding site: DNA sequences involved in activation, repression and cyanuricacid-dependent repositioning,” Mol Microbiol, 66. 410–427.2007
[11]  Fisher, R.F., Long, S.R, “Interactions of NodD at the nod Box: NodD binds to two distinct sites on the same face of the helix and induces a bend in the DNA,” J Mol Biol, 233(3).336-48.1993
[12]  Feng, J., Li, O., Hu, H-L., Chen, X-C. and Hong G-F, “Inactivation of the nod box distal half-site allows tetrameric NodD to activatenodA transcription in aninducer-independent manner,” Nucleic Acids Res, 31 (12).3143–3156.2003
[13]  Khan, F., Agarwal, S., Mishra, B.N, “Genome wide identification of DNA binding motifs of NodD-factor in Sinorhizobium meliloti and Mesorhizobium loti,” J Bioinform Comput Biol, 3 (4).773-801.2005
[14]  Chen, X-C., Feng, J., Hou, B-H., Li, F-O., Li, Q. and Hong G-F, “Modulating DNA bending affects NodD-mediated transcriptional control in Rhizobium leguminosarum,” Nucleic Acids Res, 33 (8).2540–2548.2005
[15]  Hou, B., Li, F., Yang, X., Hong, G, “The properties of NodD were affected by mere variation in length within its hinge region,” Acta Biochim Biophys Sin (Shanghai), 41(11).963-71.2009
[16]  Hou, B., Li, F., Yang, X., Hong, G, “A small functional intramolecular region of NodD was identified by mutation,” Acta Biochim Biophys Sin (Shanghai), 41(10):822-30.2009
[17]  Peck, M.C., Fisher, R.F., Long, S.R, “Diverse flavonoids stimulate NodD1 binding to nod gene promoters in Sinorhizobium meliloti,” J Bacteriol. 188 (15).5417-27.2006
[18]  Altschul, S. F., Gisli, W., Miller, W., Mayer, E. W. and Lipman, D. J, “Gapped BLAST and PSI-BLAST: A New Gen-eration of Protein Database Search Programs,” Nucleic Acids Research, 25 (17).3389-3402. 1997
[19]  Salamov, A.a., Solovyev, V.V., Prediction of proteinsecondary structure by combining nearest-neighbor algorithms and multiply sequence alignments,” J Mol Biol, 247(1).11-15.1995
[20]  Buchan, D.W., Ward, S.M., Lobley, A.E., Nugent, T.C., Bryson, K. and Jones, D.T, “Protein annotation and modelling servers at University College London,” Nucl. Acids Res, 38.563-568. 2010
[21]  Jones, D.T, “Protein secondary structure prediction based on position-specific scoring matrices,” J Mol Biol,292.195-202.1999
[22]  Kloczkowski, A., Ting, K.-L, Jernigan, R.L., Garnier, J., “Combining the GOR V algorithm with evolutionary information for protein secondary structure prediction from amino acid sequence,” Proteins, 49.154-166.2002
[23]  Sen, T.Z., Jernigan, R.L., Garnier, J., Kloczkowski, A, “GOR V server for protein secondary structure prediction,” Bioinformatics, 21 (11).2787-2788.2005
[24]  Sen, T.Z., Cheng, H., Kloczkowski, A., Jernigan, R.L, “A Consensus Data Mining secondary structure prediction by combining GOR V and Fragment Database Mining,” Protein Sc,. 15 (11).2499-2506.2006
[25]  Bates, P.A., Kelley, L.A., MacCallum, R.M. and Sternberg, M.J.E, “Enhancement of Protein Modelling by Human Intervention in Applying the Automatic Programs3D-JIGSAW and 3D-PSSM,” Proteins: Structure, Function and Genetics, 5.39-46.2001
[26]  Bates, P.A, Sternberg, M.J.E, “Model Building by Comparison at CASP3: Using Expert Knowledge and Computer Automation,” Proteins: Structure, Function and Genetics, 3.47-54.1999
[27]  Contreras-Moreira, B., Bates, P.A, “Domain Fishing: a first step in protein comparative modeling,” Bioinformatics, 18.1141-1142.2002
[28]  Arnold, K., Bordoli, L., Kopp, J., and Schwede, T, “The SWISS-MODEL Workspace: A web-based environment for protein structure homology modeling,” Bioinformatics, 22.195-201.2006
[29]  Kiefer, F, Arnold, K, Künzli, M, Bordoli, L, Schwede, T, “The SWISS-MODEL Repository and associated resources,” Nucleic Acids Research, 37.387-392.2009
[30]  Peitsch, M. C, “Protein modeling by E-mail,” Nature Biotechnology, 13.658-660.1995
[31]  Zhang, Y, “I-TASSER server for protein 3D structure prediction,” BMC Bioinformatics, 19.40.2008
[32]  Roy, A., Kucukural, A., Zhang, Y, , “I-TASSER: a unified platform for automated protein structure and function prediction,” Nature Protocols, 5.725-738.2010
[33]  Laskowski, R. A., MacArthur, M. W., Moss, D. S. and Thornton, J. M, “PROCHECK: A Program to Check the Stereochemistry of Protein Structures,” Journal of Applied Crystallography, 26.283-291. 1993
[34]  Tina, K. G., Bhadra, R. and Srinivasan, N, “PIC: Protein Interactions Calculator,” Nucleic Acids Res,35.473–476.Jul.2007
[35]  Finn, R.D., Mistry, J., Tate, J., Coggill, P., Heger, A., Pollington, J.E., Gavin, O.L., Gunesekaran, P., Ceric, G., Forslund, K., Holm, L., Sonnhammer, E.L., Eddy, S.R. and Bateman A, “The Pfam protein families database,” Nucleic Acids Research, 38.211-222.2010
[36]  Roy, A., Yang, J. and Zhang, Y, “COFACTOR: An accurate comparative algorithm for structure-based protein function annotation,” Nucleic Acids Res, 40.471-477.2012
[37]  Yang, J., Roy, A. and Zhang, Y, “BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions,” Nucleic Acids Res, 41.1096-1103.2013
[38]  Grosdidier, A., Zoete, V., Michielin, O, “SwissDock, a protein-small molecule docking web service based on EADock DSS,” Nucleic Acids Res, 39.270-7.2011
[39]  Grosdidier, A., Zoete, V., Michielin, O, “Fast docking using the CHARMM force field with EADock DSS,” J Comput Chem. doi: 10.1002/jcc.21797[Epub ahead ofprint].May.2011
[40]  Duhovny, D., Nussinov, R., Wolfson, H.J, “Efficient Unbound Docking of Rigid Molecules”, in 2'nd Workshop on Algorithms in Bioinformatics(WABI), Rome, Italy, Lecture Notes in Computer Science 2452, 185-200. 2002
[41]  Schneidman-Duhovny, D., Inbar, Y., Nussinov, R., Wolfson, H.J, “PatchDock and SymmDock: servers for rigid and symmetric docking,” NAR, 33. 363-367. 2005
[42]  Andrusier, N., Nussinov, R. and Wolfson H. J, “FireDock: Fast Interaction Refinement in Molecular Docking,” Proteins, 69 (1).139-159.2007
[43]  Mashiach, E., Schneidman-Duhovny, D., Andrusier, N., Nussinov, R. and Wolfson H. J, “FireDock: a web server for fast interaction refinement in molecular docking,” Nucleic Acids Res, 36.229-232.2008
[44]  Schneidman-Duhovny, D., Inbar ,Y., Nussinov, R., Wolfson, H.J, “Geometry based flexible and symmetric protein docking,” Proteins, 60.224-231.2005
[45]  Kozakov, D., Hall, D. R., Beglov, D., Brenke, R., Comeau, S. R., Shen, Y., Li, K., Zheng, J., Vakili, P., Paschalidis, I. C. and Vajda, S, “Achieving reliability and high accuracy in automated protein docking: Cluspro, PIPER, SDU, and stability analysis in CAPRI rounds 13–19,” Proteins: Structure, Function, and Bioinformatics, 78.3124–3130.2010
[46]  Kozakov, D., Brenke, R., Comeau, S.R., Vajda, S, “PIPER: An FFT-based protein docking program with pairwise potentials,” Proteins: Structure, Function,and Bioinformatics, 65 (2). 392-406.2006
[47]  Comeau, S.R., Gatchell, D.W., Vajda, S., Camacho, C.J, “ClusPro: a fully automated algorithm for protein-protein docking,” Nucleic Acids Research, 32.96-99.2004
[48]  Zhou, X., Lou, Z., Fu, S., Yang, A., Shen, H., Li, Z., Feng, Y., Bartlam, M., Wang, H., Rao, Z., “Crystal structure of ArgP from Mycobacterium tuberculosis confirms two distinct conformations of full-length LysR transcriptional regulators and reveals its function in DNA binding and transcriptional regulation,” J Mol Biol. 396(4).1012-24.2010
[49]  Zaim, J., Kierzek, AM., “The structure of full-length LysR-type transcriptional regulators. Modeling of the full-length OxyR transcription factor dimmer,” Nucleic Acids Res,. 31(5): 1444–1454. 2003
[50]  Monferrer, D,. Tralau, T., Kertesz, MA,, Dix, I., Solà, M., Usón, I., “Structural studies on the full-length LysR-type regulator TsaR from Comamonas testosteroni T-2 reveal a novel open conformation of the tetrameric LTTR fold”, Mol Microbiol. 75(5):1199-214. 2010.
[51]  Muraoka, S., Okumura, R., Ogawa, N., Nonaka, T., Miyashita, K., Senda, T. “Crystal structure of a full-length LysR-type transcriptional regulator, CbnR: unusual combination oftwo subunit forms and molecular bases for causing and changing DNA bend,” J. Mol. Biol. 328: 555-556.2003.