Türkçe ansiklopedi, sözlük, genel başvuru ve bilgi sitesi   
 
  Yardım
  Rastgele    

Human Genome

Enlarge picture
A graphical representation of the normal human karyotype.
The human genome is the genome of Homo sapiens, which is composed of 24 distinct pairs of chromosomes (22 autosomal + X + Y) with a total of approximately 3 billion DNA base pairs containing an estimated 20,000–25,000 genes. [1] The Human Genome Project has produced a reference sequence of the euchromatic human genome, which is used worldwide in biomedical sciences. The human genome is much more gene-sparse than was initially predicted at the outset of the Human Genome Project, with only about 1.5% of the total length serving as protein-coding exons, with the rest of the genome comprised by RNA genes, regulatory sequences, introns and controversially so-called junk DNA.[2]

Features

Chromosomes

Enlarge picture
The human genome is composed of 23 pairs of chromosomes (46 in total), each of which contain hundreds of genes separated by intergenic regions. Intergenic regions may contain regulatory sequences and non-coding DNA.


There are 24 distinct human chromosomes: 22 autosomal chromosomes, plus the sex-determining X and Y chromosomes. Chromosomes 1–22 are numbered roughly in order of decreasing size. Somatic cells usually have one copy of chromosomes 1–22 from each parent, plus an X chromosome from the mother, and either an X or Y chromosome from the father, for a total of 46.

Genes

There are an estimated 20,000–25,000 human protein-coding genes . The estimate of the number of human genes has been repeatedly revised down from initial predictions of 100,000 or more as genome sequence quality and gene finding methods have improved, and could continue to drop further. [4]

Surprisingly, the number of human genes seems to be less than a factor of two greater than that of many much simpler organisms, such as the roundworm and the fruit fly. However, human cells make extensive use of alternative splicing to produce several different proteins from a single gene, and the human proteome is thought to be much larger than those of the aforementioned organisms.

Most human genes have multiple exons, and human introns are frequently much longer than the flanking exons.

Human genes are distributed unevenly across the chromosomes. Each chromosome contains various gene-rich and gene-poor regions, which seem to be correlated with chromosome bands and GC-content. The significance of these nonrandom patterns of gene density is not well understood.

In addition to protein coding genes, the human genome contains thousands of RNA genes, including tRNA, ribosomal RNA, microRNA, and other non-coding RNA genes.

Regulatory sequences

The human genome has many different regulatory sequences which are crucial to controlling gene expression. These are typically short sequences that appear near or within genes. A systematic understanding of these regulatory sequences and how they together act as a gene regulatory network is only beginning to emerge from computational, high-throughput expression and comparative genomics studies.

Identification of regulatory sequences relies in part on evolutionary conservation. The evolutionary branch between the human and mouse, for example, occurred 70–90 million years ago.[5] So computer comparisons of gene sequences that identify conserved non-coding sequences will be an indication of their importance in duties such as gene regulation. [6]

Another comparative genomic approach to locating regulatory sequences in humans is the gene sequencing of the puffer fish. These vertebrates have essentially the same genes and regulatory gene sequences as humans, but with only one-eighth the "junk" DNA. The compact DNA sequence of the puffer fish makes it much easier to locate the regulatory genes.[7]

Other DNA

Protein-coding sequences (specifically, coding exons) comprise less than 1.5% of the human genome.<ref name="IHSGC2001" /> Aside from genes and known regulatory sequences, the human genome contains vast regions of DNA the function of which, if any, remains unknown. These regions in fact comprise the vast majority, by some estimates 97%, of the human genome size. Much of this is comprised of:

repeat elements

transposons

pseudogenes

However, there is also a large amount of sequence that does not fall under any known classification.

Much of this sequence may be an evolutionary artifact that serves no present-day purpose, and these regions are sometimes collectively referred to as "junk" DNA. There are, however, a variety of emerging indications that many sequences within are likely to function in ways that are not fully understood. Recent experiments using microarrays have revealed that a substantial fraction of non-genic DNA is in fact transcribed into RNA,[8] which leads to the possibility that the resulting transcripts may have some unknown function. Also, the evolutionary conservation across the mammalian genomes of much more sequence than can be explained by protein-coding regions indicates that many, and perhaps most, functional elements in the genome remain unknown.[9] The investigation of the vast quantity of sequence information in the human genome whose function remains unknown is currently a major avenue of scientific inquiry. [10]

Variation

Most studies of human genetic variation have focused on single nucleotide polymorphisms (SNPs), which are substitutions in individual bases along a chromosome. Most analyses estimate that SNPs occur on average somewhere between every 1 in 100 and 1 in 1,000 base pairs in the euchromatic human genome, although they do not occur at a uniform density. Thus follows the popular statement that "we are all, regardless of race, genetically 99.9% the same", [11] although this would be somewhat qualified by most geneticists. For example, a much larger fraction of the genome is now thought to be involved in copy number variation. [12] A large-scale collaborative effort to catalog SNP variations in the human genome is being undertaken by the International HapMap Project.

The genomic loci and length of certain types of small repetitive sequences are highly variable from person to person, which is the basis of DNA fingerprinting and DNA paternity testing technologies. The heterochromatic portions of the human genome, which total several hundred million base pairs, are also thought to be quite variable within the human population (they are so repetitive and so long that they cannot be accurately sequenced with current technology). These regions contain few genes, and it is unclear whether any significant phenotypic effect results from typical variation in repeats or heterochromatin.

Most gross genomic mutations in germ cells probably result in inviable embryos; however, a number of human diseases are related to large-scale genomic abnormalities. Down syndrome, Turner Syndrome, and a number of other diseases result from nondisjunction of entire chromosomes. Cancer cells frequently have aneuploidy of chromosomes and chromosome arms, although a cause and effect relationship between aneuploidy and cancer has not been established.

Genetic disorders

For more details on this topic, see Genetic disorder.


These conditions are caused by abnormal expression of one or more genes that matches a clinical phenotype. The disorder may be caused by a gene mutation, an abnormal number of chromosomes, or triplet expansion repeat mutations. Defective genes can be inherited from the parents, in which case it is known as a hereditary disease. There are around 4,000 known genetic disorders, with the most common being cystic fibrosis.

Studies of genetic disorders is often performed by means of population genetics. Treatment is performed by a geneticist-physician trained in clinical genetics. The results of the Human Genome Project are likely to provide increased availability of genetic testing for gene-related disorders, and eventually improved treatment. Parents can be screened for hereditary conditions and counselled on the consequences, the probability it will be inherited, and how to avoid or ameliorate it in their offspring.

One major gross effect on human phenotypes derives from gene dosage, whose effects play a role in disorders caused by duplication, omission, or disruption of chromosomes. For example, those afflicted with Down syndrome, or trisomy 21, experience high rates of Alzheimer's disease, an effect thought to be related to the overexpression of the Alzheimer's-related amyloid precursor protein whose gene is located on chromosome 21.[13] By contrast, Down's syndrome sufferers experience lower rates of breast cancer, possibly due to the overexpression of a tumor-suppressor gene.[14]

Evolution

See also:  and
Comparative genomics studies of mammalian genomes suggest that approximately 5% of the human genome has been conserved by evolution since the divergence of those species approximately 200 million years ago, containing the vast majority of genes.[9][10] Intriguingly, since genes and known regulatory sequences probably comprise less than 2% of the genome, this suggests that there may be more unknown functional sequence than known functional sequence. A smaller, but large, fraction of human genes seem to be shared among most known vertebrates.

The chimpanzee genome is 95% identical to the human genome. On average, a typical human protein-coding gene differs from its chimpanzee ortholog by only two amino acid substitutions; nearly one third of human genes have exactly the same protein translation as their chimpanzee orthologs. A major difference between the two genomes is human chromosome 2, which is equivalent to a fusion product of chimpanzee chromosomes 12 and 13.[15]

Humans have undergone an extraordinary loss of olfactory receptor genes during our recent evolution, which explains our relatively crude sense of smell compared to most other mammals. Evolutionary evidence suggests that the emergence of color vision in humans and several other primate species has diminished the need for the sense of smell.[16]

Mitochondrial genome

The human mitochondrial genome, while usually not included when referring to the "human genome", is of tremendous interest to geneticists, since it undoubtedly plays a role in mitochondrial disease. It also sheds light on human evolution; for example, analysis of variation in the human mitochondrial genome has led to the postulation of a recent common ancestor for all humans on the maternal line of descent. (see Mitochondrial Eve)

Due to the lack of a system for checking for copying errors, Mitochondrial DNA (mtDNA) has a more rapid rate of variation than nuclear DNA. This 20-fold increase in the mutation rate allows mtDNA to be used for more accurate tracing of maternal ancestry. Studies of mtDNA in populations have allowed ancient migration paths to be traced, such as the migration of Native Americans from Siberia or Polynesians from southeastern Asia. It has also been used to show that there is no trace of Neanderthal DNA in the European gene mixture.[17]

Epigenome

See also:




A variety of features of the human genome that transcend its primary DNA sequence, such as chromatin packaging, histone modifications and DNA methylation, are important in regulating gene expression, genome replication and other cellular processes.[18][19] These "epigenetic" features are thought to be involved in cancer and other abnormalities, and some may be heritable across generations.

See also

References

1. ^ International Human Genome Sequencing Consortium (2004). "Finishing the euchromatic sequence of the human gspot.". Nature 431 (7011): 931-45. PMID 15496913.  [1]
2. ^ International Human Genome Sequencing Consortium (2001). "Initial sequencing and analysis of the human genome.". Nature 409 (6822): 860-921. PMID 11237011.  [2]
4. ^ Science 316 p 1113 25-May-2007, probably in the range 20,488-20,588. (note, this is a news article in Science magazine reporting on a conference presentation. It is not a peer-reviewed publication, and therefore its figures should not be considered "authoritative")
5. ^ Nei M, Xu P, Glazko G (2001). "Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms.". Proc Natl Acad Sci U S A 98 (5): 2497-502. PMID 11226267. 
6. ^ Loots G, Locksley R, Blankespoor C, Wang Z, Miller W, Rubin E, Frazer K (2000). "Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons.". Science 288 (5463): 136-40. PMID 10753117.  Summary
7. ^ Meunier, Monique. Genoscope and Whitehead announce a high sequence coverage of the Tetraodon nigroviridis genome (English). Genoscope. Retrieved on 2006-09-12.
8. ^ "...a tiling array with 5-nucleotide resolution that mapped transcription activity along 10 human chromosomes revealed that an average of 10% of the genome (compared to the 1 to 2% represented by bona fide exons) corresponds to polyadenylated transcripts, of which more than half do not overlap with known gene locations.Claverie J (2005). "Fewer genes, more noncoding RNA.". Science 309 (5740): 1529-30. PMID 16141064. 
9. ^ "...the proportion of small (50-100 bp) segments in the mammalian genome that is under (purifying) selection can be estimated to be about 5%. This proportion is much higher than can be explained by protein-coding sequences alone, implying that the genome contains many additional features (such as untranslated regions, regulatory elements, non-protein-coding genes, and chromosomal structural elements) under selection for biological function." Mouse Genome Sequencing Consortium (2002). "Initial sequencing and comparative analysis of the mouse genome.". Nature 420 (6915): 520-62. PMID 12466850. 
10. ^ The ENCODE Project Consortium (2007). ""Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project"". Natuer 447: 799-816. 
11. ^ from Bill Clinton's 2000 State of the Union address [3]
12. ^ [4]
13. ^ Armstrong R, Cairns N, Myers D, Smith C, Lantos P, Rossor M (1996). "A comparison of beta-amyloid deposition in the medial temporal lobe in sporadic Alzheimer's disease, Down's syndrome and normal elderly brains.". Neurodegeneration 5 (1): 35-41. PMID 8731380. 
14. ^ Kwak HI, Gustafson T, Metz RP, Laffin B, Schedin P, Porter WW. "Inhibition of breast cancer growth and invasion by single-minded 2s.". Carcinogenesis epub. PMID 16840439. 
15. ^ "Human chromosome 2 resulted from a fusion of two ancestral chromosomes that remained separate in the chimpanzee lineage" The Chimpanzee Sequencing and Analysis Consortium (2005). "Initial sequence of the chimpanzee genome and comparison with the human genome.". Nature 437 (7055): 69-87. PMID 16136131. 
"Large-scale sequencing of the chimpanzee genome is now imminent."Olson M, Varki A (2003). "Sequencing the chimpanzee genome: insights into human evolution and disease.". Nat Rev Genet 4 (1): 20-8. PMID 12509750. 
16. ^ "Our findings suggest that the deterioration of the olfactory repertoire occurred concomitant with the acquisition of full trichromatic color vision in primates." Gilad Y, Wiebe V, Przeworski M, Lancet D, Pääbo S (2004). "Loss of olfactory receptor genes coincides with the acquisition of full trichromatic vision in primates.". PLoS Biol 2 (1): E5. PMID 14737185. 
17. ^ Sykes, Bryan (2003-10-09). Mitochondrial DNA and human history (English). The Human Genome. Retrieved on 2006-09-19.
18. ^ [5]
19. ^ [6]

External links

In biology the genome of an organism is its whole hereditary information and is encoded in the DNA (or, for some viruses, RNA). This includes both the genes and the non-coding sequences of the DNA.
..... Click the link for more information.
Editing of this page by unregistered or newly registered users is currently disabled due to vandalism.
If you are prevented from editing this page, and you wish to make a change, please discuss changes on the talk page, request unprotection, log in, or .
..... Click the link for more information.
Figure 1: A representation of a condensed eukaryotic chromosome, as seen during cell division.]] A chromosome is a single large macromolecule of DNA, and constitutes a physically organized form of DNA in a cell.
..... Click the link for more information.
An autosome is a non-sex chromosome. It is an ordinarily pairedIn the case of higher ploidy levels than the usual diploid, there will be the same number of an autosome as the ploidy level itself. For example, in a pentaploid, there will be five copies of each autosome.
..... Click the link for more information.
The X chromosome is one of the two sex-determining chromosomes in many animal species, including mammals (the other is the Y chromosome). It is a part of the XY sex-determination system and X0 sex-determination system.
..... Click the link for more information.
The Y chromosome is the sex-determining chromosome in humans and most other mammals. In mammals, it contains the gene SRY, which triggers testis development, thus determining sex.

Overview

Most mammals have one pair of sex chromosomes in each cell.
..... Click the link for more information.
Editing of this page by unregistered or newly registered users is currently disabled due to vandalism.
If you are prevented from editing this page, and you wish to make a change, please discuss changes on the talk page, request unprotection, log in, or .
..... Click the link for more information.
In molecular biology, two nucleotides on opposite complementary DNA or RNA strands that are connected via hydrogen bonds are called a base pair (often abbreviated bp).
..... Click the link for more information.
A gene is a locatable region of genomic sequence, corresponding to a unit of inheritance, which is associated with regulatory regions, transcribed regions and/or other functional sequence regions.
..... Click the link for more information.

..... Click the link for more information.
Euchromatin is a lightly packed form of chromatin that is rich in gene concentration, and is often (but not always) under active transcription. Unlike heterochromatin, it is found in both eukaryotes and prokaryotes.
..... Click the link for more information.
Health science is the applied science dealing with health, and it includes many subdisciplines. See also health science academic disciplines.

There are two approaches to health science: the study and research of the human body and health-related issues to understand
..... Click the link for more information.
Proteins are large organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid residues.
..... Click the link for more information.
An exon is any region of DNA within a gene that is transcribed to the final messenger RNA (mRNA) molecule, rather than being spliced out from the transcribed RNA molecule. Exons of many eukaryotic genes interleave with segments of non-coding DNA (introns).
..... Click the link for more information.
A non-coding RNA (ncRNA) is any RNA molecule that is not translated into a protein. A previously used synonym, particularly with bacteria, was small RNA (sRNA). However, some ncRNAs are very large (e.g. Xist).
..... Click the link for more information.
A regulatory sequence (also called regulatory region or ~ element) is a promoter, enhancer or other segment of DNA where regulatory proteins such as transcription factors bind preferentially. They control gene expression and thus protein expression.
..... Click the link for more information.
Introns, derived from the term "Intervening Sequences", are non-coding sections of DNA. Once this DNA section has been transcribed as a pre-mRNA sequence, the introns will be spliced out, then the mRNA will be translated into a protein.
..... Click the link for more information.
In molecular biology, "junk" DNA is a collective label for the portions of the DNA sequence of a chromosome or a genome for which no function has yet been identified. About 80-90% of the human genome has been designated as "junk", including most sequences within introns and most
..... Click the link for more information.
Figure 1: A representation of a condensed eukaryotic chromosome, as seen during cell division.]] A chromosome is a single large macromolecule of DNA, and constitutes a physically organized form of DNA in a cell.
..... Click the link for more information.
An autosome is a non-sex chromosome. It is an ordinarily pairedIn the case of higher ploidy levels than the usual diploid, there will be the same number of an autosome as the ploidy level itself. For example, in a pentaploid, there will be five copies of each autosome.
..... Click the link for more information.
The XY sex-determination system is the sex-determination system found in humans, most other mammals, some insects (Drosophila) and some plants (Ginkgo). In the XY sex-determination system, females have two of the same kind of sex chromosome (XX), and are called
..... Click the link for more information.
The X chromosome is one of the two sex-determining chromosomes in many animal species, including mammals (the other is the Y chromosome). It is a part of the XY sex-determination system and X0 sex-determination system.
..... Click the link for more information.
The Y chromosome is the sex-determining chromosome in humans and most other mammals. In mammals, it contains the gene SRY, which triggers testis development, thus determining sex.

Overview

Most mammals have one pair of sex chromosomes in each cell.
..... Click the link for more information.
A somatic cell is generally taken to mean any cell forming the body of an organism: the word "somatic" is derived from the Greek word sōma (σώμα), meaning "body". Somatic cells, by definition, are not germline cells.
..... Click the link for more information.
A gene is a locatable region of genomic sequence, corresponding to a unit of inheritance, which is associated with regulatory regions, transcribed regions and/or other functional sequence regions.
..... Click the link for more information.
Gene finding typically refers to the area of computational biology that is concerned with algorithmically identifying stretches of sequence, usually genomic DNA, that are biologically functional.
..... Click the link for more information.
elegans

Binomial name
Caenorhabditis elegans
Maupas, 1900

Caenorhabditis elegans (IPA:
..... Click the link for more information.
D. melanogaster

Binomial name
Drosophila melanogaster
Meigen, 1830[1]

Drosophila melanogaster (from the Greek for black-bellied dew-lover
..... Click the link for more information.
Alternative splicing is the RNA splicing variation mechanism in which the exons of the primary gene transcript, the pre-mRNA, are separated and reconnected so as to produce alternative ribonucleotide arrangements.
..... Click the link for more information.
The term proteome was coined by Mark Wilkins first in 1994 in the symposium: "2D Electrophoresis: from protein maps to genomes" in Siena, Italy, and was subsequently published in 1995 (1) , which was part of his PhD thesis.
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.