Saturday, January 30, 2010

antiviral protein

Researchers have shown how an antiviral protein produced by the immune system, dubbed tetherin, tames HIV and other viruses by literally putting them on a leash, to prevent their escape from infected cells. The insights, reported in the October 30th issue of the journal Cell, a Cell Press publication, allowed the research team to design a completely artificial protein -- one that did not resemble native tetherin in its sequence at all -- that could nonetheless put a similar stop to the virus.









"Tetherin is essentially a rod with anchors at either end that are critical for

its function," says Paul Bieniasz of Howard Hughes Medical Institute and the

Aaron Diamond AIDS Research Center at The Rockefeller University. Either one of

those anchors gets incorporated into the envelope surrounding HIV or other

viruses as they bud through the plasma membrane of an infected cell. "



One anchor gets into the virus and the other in the cell membrane to inevitably form a tether.







"We showed we could design a completely different protein with the same

configuration -- a rod with lipid anchors at either end -- and it worked very

well,"



he continued. The finding helped to confirm that tetherin is capable of acting all on its own, he added.



They also explain tetherin's broad specificity to protect against many viruses. "It is just targeting lipids," Bieniasz said. "It's not about viral proteins." That's conceptually important, he continued, because there is no specific interaction between tetherin and any viral protein, which makes it a more difficult problem for viruses to evolve resistance. Rather than tweaking an existing protein-coding gene, "the virus has to make the more difficult adjustment of acquiring a new gene antagonist [of tetherin]."



Unfortunately, many viruses have managed to do just that. In the case of HIV, a protein called Vpu counteracts tetherin. They now show it does so by sequestering the host protein, which prevents its incorporation into the virus. The new insight into tetherin's and Vpu's modes of action, however, may lead to the development of Vpu blockers that could free up the innate host defense and inhibit HIV's spread, Bieniasz suggests.



Bieniasz said there is some possibility that tetherin exists in different forms that might explain differences among people in the progression of HIV or other viral infections. However, the only common variation they've seen in the tetherin gene so far does not appear to affect its function. The tetherin sequence does vary quite a lot from one species to the next, he added, as is often the case due to strong selection when host defense genes meet viral inhibitors.



To place the findings in context, Bieniasz says it is worth noting that tetherin is encoded by just one of more than 900 genes that get switched "on" in response to interferon, a cell signaling protein of the immune system.







"There are hundreds of interferon-induced genes," he said. "The functions are

known for only a very small number -- less than a dozen. There are potentially a

large number of antiviral mechanisms we still know nothing about."







Going forward, his team intends to look more closely at many of those others, and Bieniasz suspects more surprising mechanisms will be in store.

Saturday, January 23, 2010

Inefficient Selection: New Evolutionary Mechanism Accounts For Some Of Human Biological Complexity


A careful analysis of thousands of genes and the proteins they encode show that humans are biologically complex, at least in part, by the way humans evolved to deal with redundancies resulting from the duplicate genes.



"We found a specific evolutionary mechanism to account for a portion of the
intricate biological complexity of our species, "said Ariel Fernandez, professor of
of bioengineering at Rice University. "It's a defense mechanism, a process that
allows us to address the fitness consequences of inefficient selection. It
enables some of our proteins to be more specialized in time, and in turn
makes us more complex. "



Fernandez is the author of a paper scheduled to appear in the December issue of the journal Genome Research. The line of research is available.

Fernandez said the study draws from past performance by its own research group and from the seminal work of Michael Lynch, distinguished professor of biology at Indiana University and recently elected to the National Academy of Sciences. Lynch's work has shown that natural selection is less effective in humans, compared with simple creatures such as bacteria. This "inefficiency of selection" is derived from the smaller population size of humans compared with unicellular organisms.



"In all organisms, genes are duplicated from time to time, for reasons not
fully understand, "said Fernandez." When working efficiently, natural selection
eliminates many of these copies, which are called 'paralogs. In our previous
work, we saw an unusual number of gene copies had survived in the human
genome, which makes sense given the inefficiency of selection in humans. "

In previous research on protein structure, the team of Fernandez noted that some proteins are packaged worse than others. Moreover, they found that the least efficient Packed proteins are structurally stable only when coupled with associated proteins to form complexes.



Proteins "These poorly packed are potential troublemakers when gene duplication
happens, "said Fernandez." The more copies of paralogs encodes the protein that
the body needs. This is called an "imbalance of doses, and that can make us sick.
For example, the imbalance in the doses have been implicated in Alzheimer's disease and other
diseases.

Given the inefficiency of selection, Fernandez knew that badly packed paralogs encoding proteins could remain in the human genome for a while. So he and graduate student decided to examine whether Jianpeng Chen had kept duplicates of genes in the genome of time for random genetic mutations affect the difference paralogs. Fernandez and Chen, now a senior researcher in Beijing, China, Cross-examined databases on genomics, protein structure, regulation and protein expression in these paralogs microRNA problematic.



"The longer these duplicated genes become inefficient due to the selection, the
more likely to undergo a random mutation, "said Fernandez." Parties
gene acts to regulate the expression of proteins - by binding with microRNAs, for
example. We found numerous cases where the random mutations had caused paralogs
as opposed to express, so that eliminated the harmful dose
imbalances ".

Lynch said that one aspect of research that is potentially revolutionary Fernandez is the trend of proteins to develop a more open structure in complex organisms.



"This observation is consistent with the general theory that large agencies with
relatively small size of the population - compared with microbes - are subject to
vagaries of random genetic drift and therefore the accumulation of very slightly
deleterious mutations,

Lynch said.

At first, he said, the accumulation of these mutations may promote stability slight breakdown of proteins. This in turn opens the door to interactions with other proteins that can restore a measure of that lost stability.



"These are the roots potential for the emergence of novel protein-protein
interactions, which are the hallmark of evolution in the complex, multicellular
species, "Lynch said." In other words, the origins of some key aspects of the
evolution of complexity can be caused completely adaptive
processes.

Fernandez said the research reveals how increasingly specialized proteins can evolve. He drew an analogy to a company hired two delivery drivers initially covering the same parts of the city, but eventually specialize in offering only specific neighborhoods.



"Over time, even if times become tough, you can not dismiss any of them
because each of them became so specialized that your company needs both, "

He said.

The simplest is a creature, less specialized proteins it has. Humans and other mammals in order many specialized proteins needed to build the specialized tissues of the skin, skeleton and organs. Even more specialized proteins are required to maintain and regulate them. This complexity requires making a duplicate original of all trades of genes retained, but this will not happen unless the selection is inefficient. This is often a point of controversy between supporters of evolution and intelligent design.

Fernandez and Chen looked duplicated genes in the human genome and found that the poorer focused on a protein was most likely that distinguish paralogs through specialization.



"This supports the case of evolution, because it shows that you can drive
complexity, with random mutations in the duplicated genes, "said Fernandez." But this
also means that random drift must prevail over Darwinian selection. In other
words, if Darwinian selection has been ruthlessly efficient in humans - as it is in
bacteria and unicellular eukaryotes - then our level of complexity not
possible ".

The research is supported by the National Institutes of Health.

Phylogenetic tree


A phylogenetic tree or evolutionary tree is a tree showing the evolutionary relationships among various biological species or other entities that are believe to have a common ancestor. In a phylogenetic tree, each node with descendants represents the most recent common ancestor of the descendants, and The Edge lengths in some trees correspond to time estimates. Each node is called a taxonomic unit. Internal nodes are generally called hypothetical taxonomic units (HTUs) as they can not be directly observed.

A rooted phylogenetic tree is a directed tree with a unique node Corresponding to the (usually imputed) most recent common ancestor of all the entities at the leaves of the tree. The most common method for rooting trees is the use of an uncontroversial outgroup - close enough to allow inference from sequence or trait data, but far enough to be a clear outgroup.

Unrooted trees illustrates the relatedness of leaf nodes without making assumptions about common ancestry. While unrooted trees can always be generated from rooted ones by simply omitting the root, root can not be inferred from an unrooted tree without some means of Identifying ancestry, this is normally done by including an outgroup in the input data or introducing additional assumptions about the relative rates of evolution on each branch, such as an application of the molecular clock hypothesis. Figure 1 depicts an unrooted phylogenetic tree for myosin, a superfamily of proteins.

Both rooted and unrooted phylogenetic trees can be either bifurcating or multifurcating, and either labeled or unlabeled. A bifurcating tree has exactly two descendants Arising from each internal node, while a tree multifurcating may have more than two. A labeled tree has specific values assigned to its leaves, while an unlabeled tree, sometimes called a tree shape, only defines a topology. The number of possible trees for a given number of leaf nodes depends on the specific type of tree, but there are always more multifurcating than bifurcating trees, more labeled than unlabeled trees, and more rooted than unrooted trees. The last distinction is the most biologically relevant; it Arises Because there are many places on an unrooted tree to put the root. Among labeled bifurcating trees, the number of unrooted trees with n leaves is equal to the number of rooted trees with n - 1 leaves.

A dendrogram is a broad term for the Diagrammatic representation of a phylogenetic tree.

A cladogram is a tree formed using cladistic methods. This type of tree only represents a branching pattern, ie, its branch lengths do not represent time.

A phylogram is a phylogenetic tree that explicitly represents number of character changes through its branch lengths.

An ultrametric tree or chronogram is a phylogenetic tree that explicitly represents evolutionary time through its branch lengths.

Phylogenetic trees among a nontrivial number of input sequences are constructed using computational phylogenetics methods. Distance-matrix methods such as neighbor-joining or UPGMA, Which calculate genetic distance from multiple sequence alignments, are simplest to implement, but do not invoke an evolutionary model. Many sequence alignment methods such as ClustalW also create trees by using the simpler algorithms (ie those based on distance) of tree construction. Maximum parsimony is another simple method of estimating phylogenetic trees, but implies an implicit model of evolution (ie parsimony). More advanced methods use the optimality criterion of maximum likelihood, often within a Bayesian Framework, and apply an explicit model of evolution to phylogenetic tree estimation. Identifying the optimal tree using many of these techniques is NP-hard, so heuristic search and optimization methods are used in combination with tree-scoring functions to identify a reasonably good tree that fits the data.

Tree-building methods can be assessed on the basis of several criteria:

* Efficiency (how long does it take to compute the answer, how much memory does it need?)
* Power (does it make good use of the data, or is information being wasted?)
* Consistency (will it converge on the same answer Repeatedly, if each time given different data for the same model problem?)
* Robustness (does it cope well with Violations of the assumptions of the underlying model?)
* Falsifiability (does it alert us when it is not good to use, ie when assumptions are violated?)

Tree-building techniques have also gained the attention of mathematicians. Trees can also be built using T-theory.

Tree View X


TreeView X is an open source program to display phylogenetic trees on Linux, Unix, Mac OS X and Windows. You can read and display NEXUS and Newick format tree files (such as output by PAUP *, ClustalX, TREE-PUZZLE, and other programs). It offers a subset of the functionality of the TreeView version available for Mac and Windows Classic (which is roughly equivalent to version 0.95 of TreeView).
Download
Linux

To run the TreeView X you have to download and install wxWidgets. To recommend wxGTK Linux. Please use the stable version, not the development version. The X TreeView was then built with wxGTK 2.4.2. I placed a copy on this server.
Executable RPM (based on Red Hat 8) TV-0.5-1.i386.rpm
Note that the RPM does not support SVG images. To obtain this support will have to compile the source code. The reason for this is that SVG support is a "contribution" to wxWidgets and is not packaged in the RPM of wxGTK.
Source (tar.gz) TV-0.5.1.tar.gz
wxWidgets 2.4.2 for GTK (see wxWidgets.org for the latest version). You can get the RPM for wxGTK 2.4.2 here.
Unix

TreeView X should not rely on any Unix that supports wxWidgets. I've built for Solaris 8 wxWidgets 2.x running Motif or X11.
Source (tar.gz) TV-0.5.1.tar.gz
Windows

Installer executable (setup.exe)
Source (TV-0.5.zip)
Mac OS X 10.3 (Panther)

Unlike the Linux version, the version of Mac OS X works best with the edge of the "bleeding", the development version of wxWidgets. The next version was built with wxWidgets 2.5.3.
Executables (dmg) TreeView0.5.0.dmg
Source (tar.gz) TV-0.5.tar.gz
Previous versions

Platform release Filetype file
Linux binary RPM 0.4-0.4-1.i386.rpm TV
Linux source RPM 0.4-0.4-1.src.rpm TV
Linux 0.4 Source (gzipped tar file) TV-0.4.tar.gz
Windows binary installer 0.4 tv-0.4.zip
TV 0.4 Mac binary-0.4.zip

Sequence Extractor

Extract the sequence of clicks generate restriction map and Recovery preliminary map DNA sequences. Translated protein and intron / Exon borders as shown. Extractor use DNA sequences to build structures in silicon.

Features:
The adoption of template sequences in GenBank, EMBL, FASTA, or raw form.
Shows the reverse and forward translations included in the EMBL gene bank or Log In.
Using protein sequence included in the EMBL gene bank and enter the computing new translations (and avoid the need to obtain the correct genetic code).
For FASTA and the transfer of RAW files can show a framework for reading, and reading frames, or one of three all six reading frames.
The identification of sites can be demonstrated, and a summary of the positions of the restriction site is available.
Preliminary locations have steel in order to show forward and backward directions the template, and a summary of the preliminary sites is available.
Sites can be pre-minus can be found.
HTML in the entry on the JavaScript code. This code allows you to point to restrict the location of primer, or a translation (of the EMBL gene bank or translation) to find out more information about the location and characteristics.
You can click the two labels restrict the site to rise to the sequence between them to appear in a separate window. Use this function to obtain a series of restriction fragment Abstract.
You can double the figure for the paint (one to alleviate the tendency to forward one to alleviate the opposite direction) to produce a product recovery is expected to appear in a separate window. Sequence of primers is included in the product.
If the template is a Gene Bank and EMBL sequence record you can download the highest coding sequences, coding sequences marked with the letters, and the translation of the figure by using the links supplied

NTSYSpc, Numerical Taxonomy System


Version 2.2 for Windows NT/2000/XP & Vista
NTSYSpc can be used to detect the pattern and structure of multivariate data. For example, you may want to verify that a sample of data points indicates that the samples can come from two or more special populations or to calculate phylogenetically tree using neighbor joining or UPGMA methods to build dendrograms With equal interest is the discovery that a change in some subsets of variables are interrelated (recovered). Origin as NTSYS program in 1960, but over the years has been completely redesigned and greatly expanded use a computer.

The data may be descriptive information about collections of objects or directly measured similarities or differences between all pairs of objects. Species descriptions and used items depends on demand abundances and morphological characters, species presence and absence of property, etc. NTSYSpc can transform data, evaluation of DUI similarities between objects, and prepare a summary analysis using group relations, coordination and analysis several factors. Many of the results can be shown by numerical and graphical. The software is designed for classroom and research.

Version 2.2 for Windows is easy to use, but still has speed and functionality than previous versions. There is an interactive way (with "fill in blank" forms of entry) and the batch mode with a simple command language (useful for simulation and analysis of multiple data sets). The program makes use of the Windows environment and allows long file names and processing of large datasets. Land options window allows you to customize plots (specify titles, fonts, sizes, colors, sizes, line widths, background colors, borders, and many other aspects of what is plotted). There is also a print preview mode. NTS data files are ASCII files that can be shared with other programs. Long lines of data are maintained. A spreadsheet-like editor is included information that makes it easy to create and edit data files. It can also be used as a ASCII text editor for very large files. Matrices can be read by Excel, CSV and XLS files, and the tree can be read from one type of file. Another alternative is provided for the production of MATLAB files M.

Some features include NTSYSpc are listed below.
Similarity and difference: the correlation distance, association coefficients, 34 and 11 genetic distance coefficients.
Clustering: UPGMA and SAHN methods other hierarchical (allows connections). Proximity switch method (including new version of unweighted). Some types of consensus trees.
Graph theoretic methods: the minimum time, which includes trees. Graphics (ineradicable trees) from the neighbor to switch method.
Coordination: The main components and basic coordinates analysis, correspondence analysis, metric and non-metric multidimensional scale analysis, singular value-decompositions, projections on the axes and the method of Burnaby. Canonical variates analysis. Several programs for analysis, common principal components analysis, partial least squares, multiple correlation and canonical correlation are also included.
Interactive graphics: phenograms, phylogenetically trees, 2D scatter plots, compared to DUI / similarity matrices, the outlines of the Fourier plots, plots Procrustes and the prospect of 3-D plots.
Multivariate Tests: canonical variates analysis, tests for homogeneity of Covariance matrices, tests for the number of dimensions, generalized multivariate multiple regression analysis. There are also provisions for the Bootstrap, jackknife and simulation experiments.
Geometric Morphometrics: includes specialized modules for Procrustes analysis configuration lay important plot the results of the Procrustes analysis, Fourier analysis (including 2D and 3D ellipsoid) shapes of the contour, plot outlines and Fourier coefficients, and calculation of 2D and 3D partial results and evaluations of all stays uniform.
Other: includes comparison of matrices of cophenetic correlation, Mantel test, 3-way Mantel test, data standardization and transformations of the matrix (simple functions, delete, and move now matrix). Matrices can be split or combined.

Requirements: Windows NT/2000/ME/XP/Vista. Works with about 9 MB of disk space after installation is complete.

DNA replication


DNA is the same from uncoiling spirals, break Strand separation of hydrogen bonds between complementary strands, as the synthesis of two new strands by additional base pairing. Origin of replication begins in a special place in DNA is called replication. DNA replication origin of bidirectional replication. To begin replication of DNA unwinding enzymes called DNA helicases causes two strands of DNA to relax and parents separated from each other at the beginning of replication to form two Y "-shaped" forks repetition. These replication forks are the right place to DNA copying. Helix destabilizing protein related to a stranded regions, so that two strands are not refundable. Topoisimerases produce enzymes called DNA breaks in, and then rejoin them in order to ease tension in the helical molecule during replication. As topics continue to gain momentum and some in both directions across the DNA molecule, connecting with hydrogen-free DNA nucleotides with those produced by each parent Strand add new strands. Since the new composition of nucleotides in each of the parents Strand opposite the connection cost hydrogen, enzymes called DNA polymerase to join nucleotides by phosphodiester bonds. Fact, nucleotides are organized in a base pairing triphosphates other deoxynucleoside, composed of nitrogenous bases, deoxyribose, and three phosphates. As the link between the forms of Phosphodiester group 5 'nucleotide phosphate of new and OH 3' nucleotide of the last in a chain of DNA, two phosphates are removed providing energy for communication. Finally, any parent Strand serve as a template for synthesis of additional copies, which leads to the formation of two identical DNA molecules.

DNA transcription


The genetic code is often called the "plan" because it contains the instructions for the cell to maintain its existence. Now we know that there are more than these simple instructions in the sequence of letters in the code of nucleotides, however. For example, a large amount of evidence indicates that this code is the basis for the production of various molecules as RNA and proteins. Studies have also shown that the instructions are stored in the DNA "to read" in two stages: transcription and translation. In the transcript of the double-stranded DNA template results in the single-stranded RNA molecules. In some cases, the RNA molecule itself is a "finished product", which performs an important function in the cell. Often, however, transcription of the RNA molecule followed by a translation step, which ultimately leads to the production of the protein molecule.

Vizualizing Transcript

Transcription of the process can be captured with an electron microscope, in fact, first observed this method in 1970. In electron microscopy, these principles, the DNA molecules act as "logs" with multiple RNA "branches" that goes beyond them. If DNase and RNase (enzymes that cause degradation of DNA and RNA, respectively) were added to the molecule, the use of DNase eliminated the stem structures, whereas the use of RNase branches disappear.

DsDNA, but only one strand serves as a template for transcription in a given time, the other is called non-coding Strand chain. In most organisms, DNA strand, which serves as a model for coding genes can be no other genes coding for the same chromosome.

Transcription process

Transcription begins when the enzyme RNA polymerase (RNA POL) is associated with the template DNA strand and begins to stimulate the production of complementary RNA. Big polymerase enzymes consist of about a dozen units, and when he is active in DNA, as a whole set of other factors. In many cases, these factors signal, which is the transcription of genes.

Three different types of RNA polymerases exist in eukaryotes, while bacteria have only one.Eukaryotic RNA-Paul and restores the genes encoding most of the ribosomal RNA (rRNA) and RNA-POL III resumes rRNA genes of one small addition to the transfer RNA that play a key role in the process of translation and other small regulatory RNA molecules. Thus, an RNA-POL II that restores RNA, which serve as templates for the production of protein molecules.

Transcription initiation

Picture 2The first step in transcription initiation of RNA-POL binds to DNA up (5 ') genes in a sequence called the promoter specialist. In bacteria, promoters often consist of three elements of the sequence, while the eukaryotes, there are up to seven points.

In prokaryotes, most genes is a sequence of TATAAT Pribnow box consensus sequence located at a distance of about ten base pairs of the object, which serves as the initiation of transcription. Not all Pribnow boxes, it is an exact sequence of nucleotides, these nucleotides are simply the most commonly found at each site. Despite the substitutions made every frame of this consensus, however, seems fairly closely. Many genes have TTGCCA consensus sequence at position 35 bases upstream from the starting point, and some of them, called up the item that the AT-rich region from 40 to 60 nucleotides upstream, which improves the rate of transcription (Fig. 2 ). In any case, binding, RNA-POL "core enzyme" binds to another subunit called sigma subunit to form holoezyme possible to relax the DNA double helix, in order to facilitate access to genetics. Sigma-subunit conveys specificity of a promoter of RNA polymerase, ie, responsible for talking RNA polymerase, where they join. There are a number of different parts of Sigma, which connect the various promoters and thereby contribute to the transformation of genes and outside conditions change.

Eukaryotic promoters are more complex than their prokaryotic counterparts, partly because eukaryotes, these three classes of RNA polymerase to write different sets of genes. Many of eukaryotic genes as Enhancer, which is located at a considerable distance from the genes they affect. Enhancer sequences of the leaders of the activation of genes by binding to activator protein and change the 3-D structure of DNA, to help "draw" RNA-POL II, thus regulating transcription. Since eukaryotic DNA is packed tightly as chromatin, transcription also requires a number of specialized proteins that help to make available the chain coding.

In eukaryotes, the "core" promoter of the gene transcribed by POL II is found most often immediately before (5 '), the initiation site of the gene. Most POL II genes have a box Dad (consensus sequence TATTAA) from 25 to 35 bases upstream from the start site, which influences the transcription rate and determines the location of the starting point. Eukaryotic RNA polymerase using a number of key factors (called general transcription factors), and one of them, TFIID recognizes the TATA box and ensures that you are using the correct starting point.Other cofactor TFIIB, recognizes different sequences is a general consensus G / CG / CG / CGCCC, about 38 to 32 bases upstream.

Dates are "strong" and "weak" is often used to describe the promoters and accessories, in accordance with its effects on transcription rates and, thus, in gene expression. Changing promoter strength may have harmful effects on cells, which often leads to disease. For example, some tumors contribute to the virus to transform cells through the inclusion of healthy strong promoters in close proximity to stimulate growth genes, and translocations in cancer cells, and some genes are "Disabled" in the proximity of strong promoters or amplifiers.

Enhancer sequences to do what its name implies, measures to improve the speed with which the transcription of genes and their consequences can be very powerful. Accessories may be thousands of nucleotides of the promoters with whom they interact, they put them in the immediate vicinity of the DNA cycle. This relationship is the result of interaction between the protein linked to the Enhancer and promoter linked. Proteins that facilitate this cycle is called activators, and those that impede called repressors.

Eukaryotic gene transcription polymerase I and III begins the same way, but the promoter and transcription activator protein sequence change.

Dates are "strong" and "weak" is often used to describe the promoters and accessories, in accordance with its effects on transcription rates and, thus, in gene expression. Changing promoter strength may have harmful effects on cells, which often leads to disease. For example, some tumors contribute to the virus to transform cells through the inclusion of healthy strong promoters in close proximity to stimulate growth genes, and translocations in cancer cells, and some genes are "Disabled" in the proximity of strong promoters or amplifiers.

Enhancer sequences to do what its name implies, measures to improve the speed with which the transcription of genes and their consequences can be very powerful. Accessories may be thousands of nucleotides of the promoters with whom they interact, they put them in the immediate vicinity of the DNA cycle. This relationship is the result of interaction between the protein linked to an amplifier and a promoter linked. Proteins that facilitate this cycle is called activators, and those that impede called repressors.

Eukaryotic gene transcription polymerase I and III begins the same way, but the promoter and transcription activator protein sequence change.

Strand extension

As soon as he initiates transcription, unwinds the double helix of DNA and RNA polymerase reads the template Strand adding nucleotides to the 3 'end of the growing chain. At a temperature of 37 degrees Celsius, the new nucleotides are added at a rate of about 15-20 amino acids in the second bacteria (Dennis & Bremer, 1974), while eukaryotes move much slower rate of about five-eight amino acids per second (Izban & Luse, 1992 ).

As soon as he initiates transcription, unwinds the double helix of DNA and RNA polymerase reads the template Strand adding nucleotides to the 3 'end of the growing chain. At a temperature of 37 degrees Celsius, the new nucleotides are added at a rate of about 15-20 amino acids in the second bacteria (Dennis & Bremer, 1974), while eukaryotes move much slower rate of about five-eight amino acids per second (Izban & Luse, 1992 ).



Transcription termination

Terminator sequences near the ends of coding sequences. Bacteria have two types of these sequences. Rho-independent terminator, an inverted repeat sequences are transcribed, it can be concluded at myself in the brackets, resulting in RNA POL pause and as a result of the release of transcripts. In addition, Rho-dependent terminators, to use a factor called Rho, which is actively unwinds DNA-RNA hybrid formed during transcription, which will release the new synthesis of RNA.

Eukaryotes, termination of transcription occurs by several processes, depending on the exact polymerase used. For Paul and gene transcription ceases to use the factor of termination, through a mechanism similar to Rho-dependent termination of bacteria. Transcription POL III gene transcription ends after the termination sequence, which includes a section polyuracil, through a mechanism similar to prokaryotic Rho independent termination. Termination POL II transcripts, however, is more complex.

Transcription POL II genes may persist for hundreds or even thousands of nucleotides after the coding sequence. RNA cleaved complex, which appears to be linked with the polymerase. The split seems, along with the termination of transcription and occurs in the consensus sequence.MRNA polyadenylated POL II above, in the end of 3 ', as a result of political (A) tail, this process should be the Division and coordinates its activities with the termination.

Both polyadenylation and termination of the use of the same sequence, consensus, and the interdependence of the processes demonstrated in the 1980's through the work of several groups. A group of scientists, working with the mouse globin genes showed that mutations introduced in the sequence aataaa consensus, as is known, necessary for sex (except for), inhibits both polyadenylation and transcription termination. We measured the degree of completeness of hybridization of transcripts from various fields (A) consensus sequence of mutants with wild-type transcripts and were able to see a decrease of hybridization signal, suggesting that the termination was proper inhibited. Therefore polyadenylation to the conclusion that it was necessary for the termination (Logan ET. Al, 1987). The second group Similar results were obtained with the help of the monkey virus SV40 (Simian Virus 40). There have been mutations in the field () site, resulting in mRNAs accumulate to a level much higher than the wild type (Connelly & Manley, 1988).

The exact relationship between the Division and the termination must be defined. The gap model assumes that this has caused the termination, suggest that the polymerase activity is affected when it passes through the consensus sequence at the cleavage site, possibly through changes in factors associated with transcriptional activation. Thus, research in the field of transcription prokaryotes and eukaryotes remains focused on unraveling the molecular details of this complex process, these data will allow us to better understand how gene transcription and silent.