You are here Glossary homepage/Search > Applications > Genomics
 
Genomics glossary
Evolving terminology for emerging technologies
Comments? Revisions? Questions? mchitty@healthtech.com
Last revised December 26, 2001 
View a Printer-Friendly Version of this Web Page!
The strength of genomic studies lies in the global comparisons between biological systems rather than detailed examination of single genes or proteins. Genomic information is often misused when applied exclusively to individual genes. If one is interested only in one particular genes, there are many more conclusive experiments that should be consulted before using the results from genomic datasets.  Therefore, genomic data should not be used in lieu of traditional biochemistry, but as an initial guidelines to identify areas for deeper investigation and to see how those results fit in with the rest of the genome.  Moreover, most genomics datasets give relative rather than absolute information, which means that information about a single gene has little meaning in isolation. [Dov Greenbaum, Mark Gerstein et. al. "Interrelating Different Types of  Genomic Data" Dept. of Biochemistry and Molecular Biology, Yale Univ. 2001] http://bioinfo.mbb.yale.edu/e-print/omes-genomeres/text.pdf.

Basic genetics & genomics starts with a brief explanation of differences between genetics and genomics.

Genetic Variations  is closely intertwined with this glossary. Other related glossaries are Applications Functional Genomics, Sequencing, Biology Gene DefinitionsMaps Sequences, DNA & beyond, and ultimately, Proteins, Protein structure.  Additional references to organizations appear in the In-depth glossary, after the Bibliography.

agricultural genomics: See crop genomics

annotation: Bioinformatics glossary

applied genomics: Applications are at varying stages of development and include clinical genomics, pharmacogenomics, plant and animal breeding, drug discovery and development and microbial genomics.

biochemical genomics: Functional genomics glossary  

biocomplexity: The investigation of complex adaptive systems in all disciplines of biology. The ubiquity in nature of complex non- linear systems has already driven the search for unifying mathematical principles that can be applied to the different branches of life sciences. This effort will be of foremost importance for developing a fundamental understanding of life, and it will have great integrative power because the very same tools will be applicable to a range of problems and disciplines. The study of biocomplexity is (1) driven by theory and quantitative modeling that is explicitly linked to empirical studies and experiments, (2) interdisciplinary in nature within the life sciences, and (3) will engage researchers in bioinformatics, mathematics, physics, and computer science. We are making progress in understanding complex biological systems, but it requires a strong departure from the more familiar reductionist approach - this approach examines the parts while assuming that the behavior of the whole is simply the sum of their individual actions. To understand the whole one must understand the complex integration of these interactions, and in this sense the whole is more than the sum of the  parts. [Office of the President, Univ. of Michigan "Challenges and Opportunities in Understanding the Complexity of Living Systems: The Biocomplexity Initiative, 2000]  http://www.umich.edu/pres/committees/commreport/Biocomplexity_overview.html  Related terms complex, complexity

chemical genomics, chemogenomics: Drug discovery and development glossary.

combinatorial genomics: Similar in concept to required for recognition of all items in long- term memory combinatorial chemistry, this technology shuffles portions of genes to give a vast number of new combinations, which are then screened for a desired function. Enhanced function, or even new functions, can be generated by repeating the cycle circuits many times to "evolve" optimized recombinants in vitro. [Department of Defense Critical  Technologies Part III: Developing Critical Technologies Section 3: Biological Technology, July 1999]  http://www.dtic.mil/mctl/data3/p3sec03.pdf

comparative genomics: Functional genomics glossary.

complex: It has become common to use complicated and complex interchangeably … The essence of ‘complicated’ is hard to figure out. ..Complex, on the other hand is a term reserved for systems that display properties that are not predictable from a complete description of their components, and that are generally considered to be qualitatively different from the sum of their parts. [Editorial, "Complicated is not complex" Nature Biotechnology 17: 511 June 1999] Would it be fair to say that Mendelian genetics is linear, while genomics and polygenic diseases/traits are nonlinear?

According to the Oxford English Dictionary one of the meanings of complicated is complex, though it also means not easy to unravel or separate. Both complex and complicated are contrasted with simple. Whatever the original senses of these two words, the above distinction seems a useful one now. Related term complexity. Narrower terms complex diseases, complex genomes; In-depth complex phenotypes, complex traits.

complex diseases: Diseases characterized by risk to relatives of an affected individual which is greater than the incidence of the disorder in the population. [NHLBI] Are complex diseases essentially the same as polygenic diseases?

Genome scans for other complex diseases have met with limited success, in part because of the difficulty in detecting the small individual contribution to phenotype made by many different genes. Recent theoretical work suggests that this problem may be circumvented by performing association tests with a large number of markers (on the order of 30,000) spread across the genome. To make such a large number of tests feasible, one needs markers that can be scored without resorting to gel- based techniques, and this has led to intense interest in developing methods to identify and detect single- nucleotide polymorphism markers or SNPs. SNPs are typically biallelic markers that differ from each other by one nucleotide (this difference can be a transition, transversion or even a deletion). There are several techniques available to score SNPs, and we are exploring the use of the 5' nuclease- detection system or TaqMan to score SNPs. ["Human genetics" David Botstein Lab, Stanford Univ.] http://genome-www.stanford.edu/group/botlab/humans.html

Related terms Genetic variations glossary Omes & omics phenome, phenomics

complex genomes: Is there a specific definition of complex genomes?  Or is it a more general category (beyond viral, bacterial,  microbial?)

complex phenotypes: Those that exhibit familial clustering, which suggests at least some genetic component, but they do not occur in Mendelian proportions in pedigrees. Such phenotypes may have relatively simple underlying genetics, but their transmission appears complex because of its context, including interactions with other loci and with the environment. Alternatively, complex phenotypes may have multiple genetic and environmental causes. Most chronic, common diseases are complex by this definition. Complex phenotypes may be continuous in distribution, like height or blood pressure, or they may be dichotomous, like affected and not affected. The complexity arises from the fact that many genetic and environmental factors may interact with each other in unpredictable ways, such that the association between the phenotype and any single factor taken by itself may be imperceptible. Nonlinear interactions, including epistasis and genotype by environment interactions, mean that the expression of the phenotype may not accurately predicted from knowledge of the individual effects of each of the component factors considered alone, no matter how well understood the separate components may be. 

Genetic Architecture of Complex Phenotypes Initiative, NIGMS, NIH, US (part of its Initiative Supporting Quantitative Approaches to Complex Biological Problems, June 9,1998] http://www.nigms.nih.gov/news/announcements/complexity.html

complex trait: Has a genetic component that is not strictly Mendelian (dominant, recessive, or sex linked) and may involve the interaction of two or more genes to produce a phenotype, or may involve gene environment interactions." [NHLBI] Related term genetic architecture Genetic variations glossary

complexity:: Currently there are more than 30 different mathematical descriptions of complexity. However we have yet to understand the mathematical dependency relating the number of genes with organism complexity. [J. Craig Venter et. al. "The sequence of the Human Genome" Science 291 (5507): 1347, Feb. 16, 2001] Related term biocomplexity, complex.

crop genomics: Agricultural biotechnology continues to benefit from the insights being developed during the era of the Human Genome Project. The proliferation of mapping, genotyping, and diagnostic methodologies has rapidly expanded the analytical tools available to crop scientists for the analysis and utilization of plant genomes. Related terms  environmental genomics;  Model & other organisms glossary Arabidopsis

ARS [Agricultural Research Service] Genome Database, Cornell Univ. US http://ars-genome.cornell.edu/

FAO Glossary of Biotechnology and Genetic Engineering, Food and Agricultural Organization, United Nations, 2000.  About 5000 terms. http://www.fao.org/DOCREP/003/X3910E/X3910E00.HTM#Toc

DNA Forensics: DNA evidence takes an ever- increasing profile in criminal investigation and trials. The goal of this program is to provide an integrated examination of evidence available from DNA and the prospects for further enhancements in technology, databases, and procedures that may address issues of current controversy. DNA Forensics: Profiling PCR June 27- 28, 2002 Washington DC  

The primary forensic technology is Gene amplification & PCRRelated terms Genetic variations glossary DNA fingerprinting, markers, microsatellites, VNTR.

DNA forensics, Oak Ridge National Laboratory, US http://www.ornl.gov/hgmis/elsi/forensics.html

DNA fingerprinting - Recombinant DNA chapter of MIT's Biology Hypertextbook http://esg-www.mit.edu:8001/esgbio/rdna/rdnadir.html

deductive genomics: Functional Genomics glossary.

environmental genomics: This programme will apply genomics to the natural environment, using sequence data to advance and test evolutionary and ecological theory, and so provide a better understanding of ecosystem function in the context of biodiversity (intraspecific genetic variation, species richness and perceived redundancy). 

The programme aims to advance our understanding of how organisms perceive change (at the molecular and genetic level) and make functional responses within their local environment, and to what extent variation in these responses is adaptive. These measures may be short- term adaptations, quorum sensing, regulated gene expression, survival, reproduction etc. Alternatively, they may be longer- term adaptations in respect to horizontal gene flow, community diversity, species abundance and the evolution of speciation. Genomics also allows investigation of adaptation at the level of individual versus community diversity. Current advances in molecular techniques allow the development of predictive studies of evolutionary biology and ecology. As we enter the post- genomics era, sequence data and technological advances in genomics will facilitate the direct interrogation of habitat diversity and specific cellular activity, and in particular patterns of gene expression in different habitats and in response to environmental change. [Natural Environmental Research Council UK, Environmental Genomics Programme Summary 2001] http://www.nerc.ac.uk/funding/thematics/envgen/

evolutionary genomics: Functional Genomics glossary.

forensics: See DNA forensics

function, functional genomics:  Functional Genomics glossary.

gene: Gene definitions

genetic architecture: Genetic variations glossary

genetic variations, human: Genetic variations glossary

genetics: Clinical genomics  See also Basic genetics & genomics

genome: The complete set of chromosomal and extrachromosomal genes of an organism, a cell, an organelle or a virus; the complete DNA component of an organism. [IUPAC Biotech]

The fundamental concepts of genome, genotype and phenotype are not defined in a satisfactory manner within the biological literature. Not only are there inconsistencies in usage between various authors, but even individual authors do not use these concepts in a consistent manner within their own writings. We have found at least five different notions of genome, seven of genotype, and five of phenotype current in the literature. Our goal is to clarify this situation by (a) defining clearly and precisely the notions of genetic complement, genome, genotype, phenetic complement, and phenotype; (b) examining that of phenome; and (c) analysing the logical structure of this family of concepts. [M. Mahner, M. Kary "What exactly are genomes, genotypes and phenotypes? And what about phenomes?" Journal of  Theoretical Biology 186 (1): 55- 63, May 1997]

All the DNA contained in an organism or a cell, which includes both the chromosomes within the nucleus and the DNA in mitochondria. [NHGRI] Size expressed by the number of base pairs. [DOE].

Narrower terms: Gene Definitions chromosomal genome, mitochondrial genome

First used by H. Winkler in 1920, was created by elision of the words GENes and chromosOMEs, and that is what the term signifies: the complete set of chromosomes and their genes. [V McKusick "Genomics: Structural and Functional studies of genomes" Genomics 45:244-249 Oct. 15 1997]  

Genome databases: See Databases & software directory.

genomic DNA: The entire length of DNA, including non coding regions. (Sometimes called gDNA.) [CHI Bioinformatics]

DNA which includes exons and introns, coding and noncoding regions. Compare with cDNA.  Gene definitions 

genomic data: [Genomic datasets] are best used to identify "outlier" genes that are particularly highly expressed or have especially many interactions rather than to focus on the individual measurements for a particular gene. A gene that makes a particularly large number of interactions may indicate that it is a key component of the cell. One numerical technique that is particularly useful with regard to dealing with this information is expressing results through ranks - i.e. not giving the number of interactions of a particular gene product, but how it ranks when compared with others.  Furthermore, it provides a powerful way to combine many different heterogeneous sources of information into a common and statistically robust numerical framework (Gerstein & Levitt 1997; Gerstein & Hegyi 1998; Qian et. al. 2001). ... Many websites focus on providing a lot of information for a single gene sequence or protein, in a "non- genomic" fashion. Rather, such sites should be designed to simultaneously display and manipulation large populations of genes. In the absence of such an 'omic interface, it is important that information resources at least accommodate bulk downloading of standardized data.  [Dov Greenbaum, Mark Gerstein et. al. "Interrelating Different Types of  Genomic Data" Dept. of Biochemistry and Molecular Biology, Yale Univ. 2001] http://bioinfo.mbb.yale.edu/e-print/omes-genomeres/text.pdf.

Related terms: Expression glossary; Omes & Omics glossary interactome; Proteomics glossary

genomics: Generation of information about living things by systematic approaches that can be performed on an industrial scale. [Roger Brent "Genomic biology" Cell 100: 169-183 Jan 2, 2000]  http://www.molsci.org/htdocs/publications/omicbio/brentcell.html

If there were any questions previously, the publication of the draft sequence for the human genome has made it very clear that genomics is going to play an increasingly critical role in the development of pharmaceuticals in the years to come. Such specialties as bioinformatics, gene expression monitoring, functional genomics, proteomics, and pharmacogenomics will all need to be melded into efficient strategies to accelerate drug discovery and development. These tools are being brought to bear on the key applied tasks of target identification and validation, as well as clinical evaluation of compounds being developed. Impact of Genomics on Medicine  May 13- 14, 2002, Munich, Germany

Q: After combining all the biological and medical data resulting from different technologies, you end up with much more than the raw genetic sequence. Do we call the sum of these multiple data types genomics, or is there a better word for it? Dr. [Charles] Cantor: People have been inventing terms like "functional genomics," "physiological genomics," and "physiomics." Genomics itself is just a parts list: A car has four tires, two axles, and so on. How well you can predict what will go wrong, knowing the parts list, is arguable. It has some predictive value, but it doesn't have fully predictive value. We need to integrate the parts list with all the other information. To me, that integration is not genomics, it's medicine. It's biology.  [Incyte Genomics "Inside Genomics Archives" Interview with Charles Cantor, 2001] http://www.incyte.com/insidegenomics/int/int/int_int_0003/int_int_0003_8.shtml

Coined by Thomas H. Roderick [of the Jackson Laboratory, Maine, US] in 1986 in Bethesda, MD during a discussion of a name for a planned new journal (Genomics) that was to include sequencing data, discovery of new genes, gene mapping, and new genetic technologies. According to Roderick, the term genomics "also had the comparative aspect of genomes of various species, their evolution, and how they related to each other. Although we didn’t come up with the term ‘functional genomics we thought of the genome as a functioning whole beyond just single genes of sequences spread around a chromosome." [B Kuska "Beer, Bethesda, and Biology" JNCI 90(2): 93 Jan 21, 1998]

Although I haven't found any references to "genomics" prior to 1987, "genomic" is easily found in Medline from 1966 on, and probably could be located in journals earlier than that.

genomics- commercialization: Commercialization of Genomics report, August 2001 Cambridge Healthtech Institute http://www.chireports.com/upcoming-reports.asp

genotype: The genetic constitution of an organism as revealed by genetic or molecular analysis, i.e. the complete set of genes, both dominant and recessive, possessed by a particular cell or organism. [IUPAC Biotech]

The observed alleles at a genetic locus for an individual. [NHLBI] 

An organism’s genetic makeup, as revealed through molecular analysis. [CHI SNPs Update]

May refer to identifying one or more, up to the entire gene sequence of an organism. Compare phenotype.

genotyping: The determination of relevant nucleotide- base sequences in each of the two parental chromosomes. [CHI SNPs Update]

genotyping technologies: Sequencing glossary

haplotype: A particular pattern of sequential SNPs found on a single chromosome. These SNPs tend to be inherited together over time and can serve as disease- gene markers. The examination of single chromosome sets (haploid sets), as opposed to the usual chromosome pairings (diploid sets), is important because mutations in one copy of a chromosome pair can be masked by normal sequences present on the other copy. [CHI SNP Update]

The linear, ordered arrangement of alleles on a chromosome.  Haplotype analysis is useful in identifying recombination events. [NHLBI]  

The resulting genotype for a chromosomal locus constructed by combining the allele assignments from two or more genetically linked allele systems. Each individual will have two haplotypes for the 'megalocus', representing the markers on the two homologous chromosomes, and the haplotypes will segregate in a pedigree following Mendelian inheritance. [C. Helms, Washington Univ. St. Louis, US "Haplotyping" Nov. 1990] http://hdklab.wustl.edu/lab_manual/14/14_4.html

From  “haploid genotype.”  

Related terms  Cell biology glossary diploid, haploid, ploidy; Sequencing glossary haplotyping, haplotyping technologies. Narrower term: Genetic variations glossary SNP haplotype

high throughput genomics: Genomic studies are now approaching "industrial" speed and scale, thanks to advances in gene sequencing and the increasing availability of high- throughput methods for studying genes, the proteins they encode, and the pathways in which they are involved. Data can now be acquired on many genes at once, either sequentially or in parallel. It is also possible to expand the range of genomic effects being examined. The abundance of new data available means that more targets should now be identifiable, but the key to finding such new targets will be to use the right combination of technologies and have them ideally integrated. [CHI High Throughput Genomics] Related terms Bioinformatics high throughput Drug discovery & development target validation.

Human Genome Project HGP : An international effort formally begun in October 1990. The project was planned to last 15 years, but rapid technological advances have accelerated the expected completion date to 2003. Project goals are to discover all the approximate 30,000 human genes (the human genome) and make them accessible for further biological study ..  Another project goal is to determine the complete sequence of the 3 billion DNA bases in the human genome. As part of the HGP, parallel studies are being carried out on selected model organisms such as the bacterium E. coli to help develop the technology and interpret human gene function. The Department of Energy's Human Genome Program and the National Institutes of Health's National Human Genome Research Institute (NHGRI) together make up the U.S. Human Genome Project. [Oak Ridge National Lab, HGP "FAQ"  About the Human Genome Project 2001]  http://www.ornl.gov/hgmis/faq/faqs1.html  http://www.ornl.gov/hgmis/project/about.html

The total human genome, contained in a set of 23 chromosomes, is now estimated to contain 3,164.7 million letters (or nucleotides). In humans the actual part of the genome that codes for proteins makes up less than 2% of the genome Compared with the organisms whose genomes have been sequenced before, humans have a particular abundance of proteins involved in cell structure, defense and immunity, DNA copying, the synthesis of RNA and proteins, and communication between cells. Humans have an unusually high number of complex proteins that fit into more than one functional category and many proteins that are embedded in the surface of cells. [NHGRI "Summary of the Initial Sequencing and Analysis of the Human Genome" press release, Feb. 11, 2001] http://www.nhgri.nih.gov/NEWS/summary_of_sequence.html

Horace Freeland Judson writes in "Talking about the genome" (Nature 409:769, 15 Feb. 2001) "The language we use about genetics and the genome project at times limits and distorts our own understanding, and the public understanding. Look at the phrase - or marketing slogan - 'the human-genome project'. In reality, of course we have not just one human genome but billions. ... Then, too, the entire phrase - the human- genome project: singular, definite, with a fixed end- point, completed by 2000, packaged so it could be sold to legislative bodies, to the people, to venture capitalists. But we knew from the start the genome project would never be complete. The maps, or the sequences, are just the start of many lines of research, polyphiloprogenitive, multiply multiple genome projects."

The human genome sequencing effort has enabled researchers to search for genes that encode the proteins that run our cellular machinery and have a role in disease progression in order to identify targets for therapeutic intervention. The emphasis is now shifting from identifying to validating and prioritizing targets and selecting compounds directed against those targets. This conference will explore the world of information technology that is impacting genomics and the life- science industry as a whole, as well as the latest efforts to create operating systems for biology.  Human Genome Discovery: Commercial Implications Feb. 25-27, 2002, Santa Clara, CA

Related terms In-depth DDBJ, DOE, EMBL, GenBank, NCBI, NHGRI, RIKEN, Sanger Centre; Sequencing glossary  resequencing

Human Genome Project Links
International Human Genome Sequencing Consortium special issue: Nature 409 (6822) 15 Feb., 2001   http://www.nature.com/nature/journal/v409/n6822/ http://www.nature.com/genomics/human/papers/analysis.html

Celera Genomics special issue: Science 291 (5507) Feb. 16, 2001 http://www.sciencemag.org/content/vol291/issue5507/index.shtml

Human Genome Project Working Draft Univ. of California Santa Cruz, US, http://genome.ucsc.edu/

GenomeWeb, Medical Research Council, UK.   http://www.hgmp.mrc.ac.uk/GenomeWeb/

Human Genome Project Dept. of Energy (DOE), Oak Ridge National Lab, US http://www.ornl.gov/hgmis/

Human Genome Central, Sanger Centre, UK  http://www.ensembl.org/genome/central/

Human Genome Project, NHGRI, NIH, US http://www.nhgri.nih.gov/HGP/

DOE’s Primer on Molecular Genetics is a useful introduction to the HGP,  the concepts and technologies underlying genomics. http://www.ornl.gov/hgmis/publicat/primer/intro.html

industrial genomics: Related term: high throughput genomics

integrative genomics: The journal Functional & Integrative Genomics is devoted to large- scale studies of genomes and their functions, including systems analyses of biological processes. Topics covered include: – Whole genome analysis, – Comparative genomics, – Genetic variation, – Human disease genes and loci, – Proteomics, – Bioinformatics, – Expression profiling, – Behavioral genomics, – Structural genomics, – Chemical genomics, – Pharmacogenomics, – Integrative genomics, – Large- scale analysis of biological systems. [Springer website, Functional & Integrative Genomics - Aims & scope, 1999] http://link.springer.de/link/service/journals/10142/aims.htm

Mendelian genetics: Classical genetics, focuses on monogenic genes with high penetrance, the tip of the iceberg of genetics. It is useful to remember that Mendelian genetics itself was a true paradigm shift, and not at all intuitively obvious.

This is poignantly described in Robin Henig's artfully crafted biography of Gregor Mendel The Monk in the Garden. Mendel was not recognized by scientists until 1900 --  35 years after his initial publication and 16 years after his death.  Those who heard his talks did not seem to understand them.  Some of the reprints he sent out have vanished.  Others were found (years later) with leaves uncut and unread.

It is also instructive to remember that Mendelian genetics were quite applicable to the breeding of plants and animals (including racehorses) with serious economic implications. This may well have encouraged Mendel's superiors to let him pursue his work with peas.

Mendelian genetics from MIT's Biology Hypertextbook http://esg-www.mit.edu:8001/esgbio/mg/mgdir.html

microbial genomics: Includes microbial sequencing projects and post- sequencing, functional  genomic projects, plant or animal pathogens, environmentally interesting microbes, and related technology development. [Interagency Report on the Federal Investment in Microbial Genomics, White House, 2000] http://www.ostp.gov/html/microbial/start.htm   

Related terms In-depth Beowulf Genomics; Omes & omics glossary microbiome

nanogenomics: The overall objective of this meeting is to bring together investigators who have a common interest in the development and use of nanotechnology for understanding genome organization and function. This meeting provides a unique opportunity for established investigators and students to interact and exchange ideas regarding the use of new and developing technologies to study the nanostructural aspects of the genome, nuclear architecture and biomolecular machinery required for genome- scale management of gene expression. This meeting is particularly relevant in light of the availability of complete model genome sequence data and the implications that sequence elements, both known and unknown, may have on genome structure and organization within the nucleus. It is especially important that scientific communication keep pace with discovery, particularly with respect to advances across disciplines which may have profound impact on the pace of breakthrough discovery. [Jackson Laboratory, US Genomics Meets Nanoscience, October 9 - 12, 2001, Bar Harbor Maine] http://www.jax.org/courses/documents/nano_0901.html

network genomics: The information provided by completely sequenced genomes can yield insights into the multi- level organization of organisms and their evolution. At the lowest level of molecular organization individual enzymes are formed, often through assembly of multiple polypeptides, and at a higher level, sets of enzymes group into metabolic networks. Such context information of metabolic networks combined with genomic context on co- occurrence of genes, fusion of genes or gene- order are powerful approaches in prediction of new functional features and analysis. [Christian Forst "On Network Genomics" CHI's Functional Genomics Oct. 9-10, 2001, Cambridge MA  http://www.functionalgenomics2001.com/fgen.htm

penetrance: The probability of expressing a phenotype given a genotype. Penetrance is described as either "complete" or "incomplete" … .Penetrance may also be dependent on a susceptible individual’s current age… incomplete penetrance is usually a matter of chance or modifiers in the genetic background. [NHLBI] 

Mendelian genetics focuses on genes with high penetrance. These were the easiest genes to identify. Related terms Genetic variations glossary.

phenotype: The observable structural and functional characteristics of an organism determined by its genotype and modulated by its environment. [IUPAC Biotech]

The observed manifestation of a genotype, which may be expressed physically, biochemically or physiologically. [NHLBI]

Refers to all biological consequences from the presence of the mutation in question. Phenotype is logically the subject of functional genomics. In its broadest definition, phenotype includes phenomena at all organizational levels of biology, and is defined as the consequence of mutations in one or more genes relative to an non- mutated or wild type genotype in a given organism. At the molecular level, for example, phenotype includes all temporal and spatial aspects of gene expression as well as related aspects of the expression, structure, function and spatial localization of proteins. The latter has acquired the name "proteomics"   [Univ. of California - Davis "What is Genomics", c. 1999] http://genomics.ucdavis.edu/what.html#compare

Systematic collection and organization of data on phenotypes is still at an early stage.  The International Mouse Mutagenesis Consortium (IMMC), writing in the Human Genome issue of Science notes that improved phenotyping technologies are needed, as are "more efficient and reliable methods for archiving, managing, analyzing, displaying and disseminating the complex phenotype data sets resulting from mutagenesis programs, and that there are no large-scale phenotype databases. Phenotype vocabularies seem to be in the works and a Mouse Phenome Project is based at Jackson Labs, US.  [IMMC JH Nadeau et. al "Functional Annotation of Mouse Genome Sequences" Science 291: 1251-1255 Feb. 16, 2001] 

See note on variant meanings for phenotype, genome and genotype under genome, Journal of Theoretical Biology 1997 article.

Phenotype databases See Databases & software directory

Compare genotype. Related terms genetic architecture; Microarrays glossary phenotype array; Omes & omics glossary phenome, phenomics; Drug discovery & development glossary phenotypic screening; Functional genomics phenotypic profiling; Model & other organisms glossary

phenotypic screen: See under chemical genomics Drug discovery & development glossary

phenotyping: A test that measures some aspect of an organism's functions, for example, the amount of a certain drug needed to inhibit the growth of HIV in a test- tube culture. If HIV has developed resistance to a certain drug, higher- than- normal amounts of that drug will be necessary to inhibit viral activity. Phenotyping assays for HIV test the effect of one drug at a time on the virus in vitro. Results may not correspond to what happens in vivo where the virus is exposed to a combination of antiretroviral agents because marked patient- to- patient variability in the concentrations of the agents may be present and factors such as drugs binding to blood proteins make direct comparison of the in vitro phenotype result and in vivo response incompatible. Phenotyping testing results are expressed as IC50 values [Inhibitory Concentration 50]  the concentration of drug required to reduce the virus' growth in cell culture by 50%. [Visible Genetics NewsRoom Press Kit Glossary of  Terms]  http://www.visgen.com/News_Room/Press_Kit/glossary_of_terms.shtm

phylogenomics: Functional Genomics glossary.

physiological genomics: Functional Genomics glossary.

polygenic: Genetic disorder resulting from the combined action of alleles of more than one gene (e.g. heart disease, diabetes and some cancers). Although such disorders are inherited, they depend on the simultaneous presence of several alleles; thus the hereditary patterns are usually more complex than those of single gene disorders. [DOE] See also multigenic, oligogenic, pleiotropy. Genetic variations glossary

[Axel Kahn] also suggests that the notion of interacting genetic factors in polygenic conditions remains an uncertainty. "Thus far, when we have looked at what we have thought of as multifactorial polygenic conditions, we have only seen diseases with several monogenic causes. Thus for each of a number of conditions we can identify several separate genes, each of which on its own in different patients is associated with diseases which we classify as diabetes or Alzheimer’s disease or obesity. We simply have not really started looking at the additive or multiplicative contributions of several factors to disease inheritance." J. Hodgson "Crystal gazing the new Biotechnologies" Nature Biotechnology 18: 29-31 Jan 2000]

population genetic, population genomics: Genetic variations glossary

post-genomic: The genome era is generally regarded to have started on 28 July 1995, with the publication of the genome of the bacterium Haemophilus influenzae. ["A point of entry into genomics" Nature Genetics 23:273 Nov. 1999]

The "post- genomic era" holds phenomenal promise for identifying the mechanistic bases of organismal development, metabolic  processes, and disease, and we can confidently predict that bioinformatics research will have a dramatic impact on improving  our understanding of such diverse areas as the regulation of gene expression, protein structure determination, comparative  evolution, and drug discovery. The availability of virtually complete data sets also makes negative data informative: by mapping entire pathways, for example, it becomes interesting to ask not only what is present, but also what is absent. [David Roos "Bioinformatics -- Trying to Swim in a Sea of Data" Science 291:1260-1261 Feb. 16, 2001]

Post-genomic can also refer to the increasing emphasis on functional genomics. With an increasing number of organisms for which we have (more or less) complete genomes we are beginning to see glimpses of the power of having fully mapped sequences. Still, in most contexts talk about being "post- genomic" seems a little premature. "Post Mendelian" seems more accurate as we move from an era in which genetics has been rooted in monogenic diseases with high penetrance to a greater awareness (but limited understanding) of polygenic diseases (and traits) often with relatively low penetrance. However with the publication of the draft human sequences in Feb. 2001 we are beginning to be truly "post- genomic".

post-Mendelian: See under post- genomic

structural genomics: Structural genomics glossary.

whole genome: There are several reasons for completely sequencing a genome.  The first and most simple reason is that it provides a basis for the discovery of all the genes. Despite the power of cDNA analysis and the enormous value in interpreting genome sequence, it is now generally recognized that a direct look at the genome is needed to complete the inventory of genes. Second, the sequence shows the long- range relationships between genes and provides the structural and control elements that must lie among them. Third, it provides a set of tools for future experimentation, where any sequence may be valuable and completeness is the key. Fourth, sequencing provides an index to draw in and organize all genetic information about the organism.  Fifth, and most important over time, is that the whole is an archives for the future, containing all the genetic information required to make the organism (the greater part of which is not yet understood). [C. elegans Sequencing Consortium “Genome sequence of the nematode C. elegans; A platform for investigating biology” Science 282: 2013 Dec 11 1998]  Narrower term whole genome index

whole genome index: All the data related to genes, with the information indexed and catalogued in a uniform manner ... [and with] access [to] information about what tissues [those] genes are known to be expressed in, and under what circumstances. The family that the protein encoded by the gene belongs to would be quickly identified, as would valuable information about how genetic variation, or other processes, alter the structure and function of that protein. The active sites on the protein, and the types of compounds that are most likely to interfere with its activity, would be available through a few strokes on a keyboard. ... Without a complete gene index for humans (as well as other medically important organisms), the data- integration problem will be huge, and those researchers who have not adapted their systems to allow for the constant and dramatic changes ahead will spend much of their time just making sure their old data remain usable [CHI Bioinformatics]

Bibliography

[CHI High Throughput Genomics] Genomic Report, Dec. 2001.

Nature “Genome Gateway  http://www.nature.com/genomics/

List of Terms

In-depth Genomics glossary

Beowulf Genomics: The Wellcome Trust established the Beowulf Genomics Initiative in January 1998.  The initiative was established to ensure that the genome sequences of microbial pathogens were available to as wide a range of researchers as is possible.  So far, [Sept. 2000] this initiative has supported the sequencing of 16 bacterial genomes and participated in the funding of two further bacterial genomes, 3 protozoan parasites and one fungus. [Beowulf Genomics, Wellcome Trust, UK, Sept 2000]  http://www.beowulf.org.uk/ Related term microbial genomics.  Any relation to Beowulf computing? Computers & computing

DDBJ: DNA DataBank of Japan Shares information daily with EMBL and GenBank. http://www.ddbj.nig.ac.jp/

DOE Department of Energy: Human Genome Initiative announced by DOE OHER [Office of Health and Environmental Research] in 1986. Congressionally chartered DOE advisory committee, HERAC [Health and Environmental Research Advisory Committee], recommended a 15 year multidisciplinary, scientific, and technological undertaking to map and sequence the human genome in 1987. DOE and NIH outlined plans for cooperation on genome research in 1988. [Oak Ridge National Laboratory (ORNL), Tennessee, US "Major events in the US Human Genome Project"] http://www.ornl.gov/hgmis/project/timeline.html

EMBL (European Molecular Biology Laboratory: Main laboratory is in Heidelberg, Germany, with outstations in Hamburg, Grenoble, France (access to high powered instruments for structure studies) and Hinxton, UK (bioinformatics). Supported by 14 European countries and Israel, shares data daily with DDBJ and GenBank.  http://www.embl-heidelberg.de/

GenBank: Located at NCBI, shares information daily with DDBJ and EMBL. NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. Currently estimated (early 2000) that over 2 million bases are deposited here each day. This growth will only accelerate in the future.  http://www.ncbi.nlm.nih.gov/Genbank/index.html

Now accommodates >10 10 nucleotides and more than doubles in size every year. [David Roos "Bioinformatics -- Trying to Swim in a Sea of Data" Science 291:1260-1261 Feb. 16, 2001]

Genetic Annotation Initiative: Genetic variations glossary

Genome Analysis Pipeline: Purpose: Submit a sequence and get back the results of (1) gene and exon model predictions, (2) GRAIL annotated features, and eventually (3) BLAST analysis. This tool is currently undergoing development and testing. Genome Centers wanting to help us refine the options available through this tool are encouraged to contact us. Hosted by the Computational Biosciences Section, Oak Ridge National Laboratory, Oak Ridge, Tennessee, US. http://grail.lsd.ornl.gov/tools/pipeline/

Genome Catalog, Genome Channel: Functional annotation pipeline is being applied to the genome sequences of human, mouse, and over 23 other organisms. This analysis integrates experimental data and predictions around a genome sequence framework. The data is periodically obtained from the GenBank/ EMBL/ DDBJ collaboration and processed through a large- scale computational framework consisting of several analysis modules. Gene models and other features are predicted. Links are made to other databases and experimental data. The results are stored in the Genome Annotation Data Warehouse. There are two major set of interrelated interfaces to this annotated genomes and the links to external databases: Genome Catalog , an HTML browsing and querying interface with summary and detail data about annotation organized around chromosome, contigs, submitted GenBank clones, GenBank annotated genes, GRAIL- EXP gene models, GENSCAN gene models, STS markers, and other features and Genome Channel. a JAVA interactive browser interface provides a rich graphical view of contigs, clones, genes, and other annotation features. [Oak Ridge National Lab, TN, US] http://genome.ornl.gov/GCat/

Genomes to Life: An ambitious program to achieve the most far-reaching of all biological goals: a fundamental, comprehensive, and systematic understanding of life. DOE's proposed Genomes to Life program would make important contributions in the quest to venture beyond characterizing such individual life components as genes and other DNA sequences toward a more comprehensive, integrated view of biology at a whole- systems level. The DOE offices of Biological and Environmental Research and Advanced Scientific Computing Research have formed a strategic alliance to meet this grand challenge. [US Dept. of Energy 'Genomes to Life"]  http://doegenomestolife.org/

HUGO: Human Genome Organization, an international organization of scientists involved in the Human Genome Project, the global initiative to map and sequence the human genome. Established 1989. http://www.gene.ucl.ac.uk/hugo/

International Human Genome Sequencing Consortium: Published the draft human genome sequence in Nature 15 Feb. 2001 See also Human Genome Project.

International Nucleotide Database: Composed of  DDBJ, EMBL and GenBank.

Joint Genome Initiative: Collaboration between Los Alamos National Lab, Lawrence Livermore National Lab and Oak Ridge National Lab. Organized in 1997. http://www.jgi.doe.gov/

NCBI National Center for Biotechnology Information Bioinformatics glossary

NHGRI  National Human Genome Research Institute http://www.nhgri.nih.gov  The National Center for Human Genome Research (NCHGR) became a NIH Institute in 1997 Original NIH funding of genome projects was through NIGMS (National Institute of General Medical Sciences) in 1985. [Oak Ridge National Laboratory (ORNL), Tennessee, US "Major events in the US Human Genome Project"] http://www.ornl.gov/hgmis/project/timeline.html

RIKEN: Rikagaku Kenkyusy Institute of Physical and Chemical Research, Japan. http://www.riken.go.jp/eng/

Sanger Centre, UK: A genome research centre founded by the Wellcome Trust and the Medical Research Council. Our purpose is to further the knowledge of genomes, particularly through large scale sequencing and analysis.   http://www.sanger.ac.uk/

 


Cambridge
Healthtech Institute
1037 Chestnut Street
Newton Upper Falls, Ma 02464
Phone:
617-630-1300
Fax:  617-630-1325
Email: chi@healthtech.com