| You are here Glossary
homepage/Search > Biology > Genetic variations Genetic variations glossary Evolving terminology for emerging
technologiesComments? Questions? Revisions? mchitty@healthtech.com Last revised December 27, 2001 Our knowledge of genetic variations has been so profoundly influenced
by Mendelian genetics that it is difficult to speculate about the
ways in which our thinking will need to change with further insights
into genomics. We have far to go in teasing apart the multiple variables
of complex traits and diseases, the relationships between hereditary, somatic
and environmental factors, and in making the transition from focusing on monogenic diseases with high penetrance to polygenic conditions with greatly
varying degrees of penetrance. In addition some fairly common words
(allele, polymorphism, wild- type) may carry an explicit (or more frequently
implicit) connotation of "normal" and/ or functional, dating from the early days of
genetics when only mutant phenotypes revealed the presence of genetic
variations. Currently identified disease related genetic variations are relatively rare. Gene expression studies are giving some insight into
clusters of alleles which may be linkable to diseases and phenotypes. Recent work on SNPs seems promising,
but powerful new methods for integrating data and detecting variants undetectable
by current technologies are still needed. Related Glossaries include Applications Functional
Genomics Pharmacogenomics
Sequencing Informatics Algorithms,
Bioinformatics Technologies Chromatography
& electrophoresis, Gene
amplification & PCR, Microarrays
Biology Expression, Gene definitions,
Maps-
genomic & genetic, Sequences
DNA & beyond Additional definitions appear in the In-depth
glossary, after the Bibliography. alleles: One of several alternate forms of a gene which occur
at the same locus on homologous chromosomes and which become separated
during meiosis and can be recombined following fusion of gametes. [IUPAC
Biotech, IUPAC Compendium] Mutually exclusive forms of the same gene, occupying the same locus
on homologous chromosomes, and governing the same biochemical and developmental
process. [MeSH] A related individual or strain contains stable, alternative forms of
the same gene which differs from the presented sequence at this location
(and perhaps others). superceded by 'Variation'. Allele will become illegal
from April 15th, 2000 [DDBJ/ EMBL/ GenBank Feature Table] http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html Related terms mutation, polymorphisms, SNP ; In-depth bi-allelic, di-
allelic, multiple- allelism, tri- allelic. Narrower term slightly deleterious
allele. Broader term variants From alleomorph, which is from the Greek meaning one
another form, used by Bateson & Saunders 1902 [OED] The word gene
didn't come along until 1911, coined by W. Johanssen. Alleles databases See Databases & software directory under
'variations'. association studies: Looking at particular genes or variations in two
groups (e.g., affected patients and controls or responders and nonresponders) to
establish an association with a phenotype by finding significant
variations in the two groups.
[CHI SNPs Update] In human genetic linkage studies frequently involve the comparison of
allele frequencies for a marker locus between a disease population and
in a control population. When statistically significant differences in
the frequency of an allele(s) are found between a disease and control population,
the disease and allele(s) are said to be in association. [NHLBI] Narrower
term In-depth random genome-wide association studies. Related term linkage bioinformatics: Bioinformatics glossary biological markers: Measurable and quantifiable biological parameters
(e.g. specific enzyme concentration, specific hormone concentration, specific
gene phenotype distribution in a population, presence of biological substances)
which serve as indices for health - and physiology related assessments,
such as disease risk, psychiatric disorders, environmental exposure and
its effects, disease diagnosis, metabolic processes, substance abuse, pregnancy,
cell line development, epidemiologic studies, etc. [MeSH] Also referred
to as "biomarkers". biomarkers: Pharmacogenomics glossary
Related term surrogate markers Related terms biological markers, genetic markers; Broader term markers. crossing over: See under genetic recombination. DNA marker: A cloned chromosomal locus with allelic variation
that can be followed directly by a DNA- based assay such as Southern blotting
or PCR. [NHLBI] detection technologies: See genetic variation - detection technologies. EST Expressed Sequence Tags Gene definitions fingerprinting: SEE In-depth DNA fingerprinting. forensics: Genomics glossary functional polymorphisms: Promoter and non- silent codon changes. http://www.cgr.ki.se/cgb/groups/brookes/Articles/Human%20Genetic%20Bi.pdf
Compare non-functional polymorphisms. gene-based SNPs: Not just coding SNPs but also intron SNPs and
promoter SNPs. [CHI SNPs Update] gene-disease associations: The discovery of gene- disease associations
requires scanning these same hundreds of thousands of SNPs in as many as 1,000
individuals, possibly using microarray technology.. Validation of gene- disease
association studies requires measuring very few SNPs in thousands of
individuals. [CHI SNPs Update] genetic architecture: The full range of genetic effects on a
trait, with emphasis on the context dependence of the expression of those
genes in manifesting a phenotype. A full cataloging of the genetic architecture
of complex phenotypes consists of a description of all of the genetic and
environmental factors that affect the phenotype, along with the magnitude
of their individual effects and the magnitudes of interactions among the
factors. It is, in principle, possible to describe the genetic components
in terms of Mendelian segregation and location along a genetic map. Environmental
factors are, in general, much less easily partitioned into separate factors
whose individual effects and interactions can be sorted out. ... Genetic
architecture is less a fundamental biological property of the phenotype than a
characteristic of a phenotype in a particular population. The genetic
architecture is a moving target that changes according to gene and genotype
frequencies, the distributions of environmental factors, and such biological
properties as age and sex. The implication of the population dependence is that
the predominant genetic factors contributing to a complex phenotype may seem to
differ from population to population. This is one explanation for the apparent
heterogeneity sometimes found in the results of genetic linkage studies in
different populations. Alternative explanations include insufficient statistical
power in the linkage tests and the possibility that superficially identical
complex phenotypes in different populations may actually have different
biological causes. It is not now possible, in most cases, to differentiate among
these various explanations. [NIH Program
Announcement Genetic architecture of complex phenotypes, June 9, 1998] .
http://grants.nih.gov/grants/guide/pa-files/PA-98-078.html Related term complex traits genetic markers: A phenotypically recognizable genetic trait
which can be used to identify a genetic locus, a linkage group, or a recombination
event [MeSH] Consist of specific nucleotide patterns. [NHLBI] Related term ESTs Gene
definitions UniGene (database) has marker information genetic recombination: Happens during the cell division
(meiosis) that occurs during the formation of sperm and egg cells. In this
process, chromosomes pair up and may swap portions of genetic material in a
phenomenon known as crossing over. The chromosomes then reassemble
and separate, with each containing some material from the other. The chromosomes
are then divvied out into individual sex cells. During crossing over, it is more
likely that far- apart genes will be separated by a break than those that are
close together. The genes that tend to stay together are said to be linked and
therefore may serve as markers for one another — a pattern that is of
particular interest when, for example, one of the genes is a disease gene. [CHI
SNPs Update] Production of new arrangements of genes
by various mechanisms such as assortment and segregation, crossing over,
gene conversion, transformation, conjugation, transduction, F-duction,
or mixed infection of viruses. [MeSH] Related terms recombinant,
recombination genetic variation - detection technologies: Includes microarrays,
In-depth: dHPLC, DNA fingerprinting, DNA footprinting, DNA ligase enzymes, endonucleases,
SSCP, SSEP Related terms: Genomics
glossary genotype, haplotype,
scanning, scoring. genetic variation - human: The race is on to secure worthwhile
information from the published Human Genome. As the emphasis shifts from
de novo sequencing to understanding differences in individuals, a variety
of approaches will be used to most quickly unlock the value of genomics.
While there are many finishing lines across wide- ranging boundaries, to
claim the prize it is still imperative to get there first. That there are
fewer genes than supposed has ironically increased complexity, rather than
reducing it. More than ever technological feats must be united with clinical
significance. This area of study has evolved into a science that impacts
daily lives Human Genetic
Variation September 17-18, 2001, Cambridge MA genome scan: A traditional genome scan uses microsatellite
markers and linkage analysis to scan genomic DNA. Now being combined
with genome- wide SNP scans (using hundreds of thousands of markers
in thousands of patients) and large-scale
association studies [CHI
SNPs] Compare In-depth candidate gene studies genotype: Genomics glossary
Narrower terms: genetic variation detection, mutation detection genotyping, genotyping technologies: Sequencing
glossary haplotype: Genomics glossary haplotyping, haplotyping technologies: Sequencing
glossary idiomorphism: A polymorphism
or any type of variation in DNA sequence occurring with less than 1% frequency.
[Nicholas Schork to Malorye Branca, personal communication Sept. 1999]
Compare with SNP. linkage: Two loci are physically
connected to one another on the same chromosome at a distance that is measured
at less than 50% recombination. Two traits are linked when they fail
to be transmitted to offspring independently from one another. The
more closely linked two loci are to one another, the greater the chance
that both loci will be transmitted to offspring together. The tendency
for genes that are located close to each other on the same chromosome to
be inherited together. [NHLBI] The proximity of two or more markers (e.g. genes, RFLP markers) on a chromosome; the closer together the markers are,
the lower the probability that they will be separated during DNA repair
or replication processes (binary fission in prokaryotes, mitosis or meiosis in eukaryotes), and hence the greater the probability that they will be
inherited together. [DOE] Related terms In-depth linkage analysis, linkage disequilibrium. localize: Gene definitions locus, loci (plural): Gene definitions markers: 1. (DNA) A fragment of known size used as reference
for analytical purposes. 2. (genetic) A gene with known phenotype and mapped
position. 3. (chromatography) A reference substance
co- chromatographed
with the sample to assist in identifying the components. [IUPAC Compendium] A segment of DNA with an identifiable physical location on a chromosome
whose inheritance can be followed. A marker can be a gene, or it can be
some section of DNA with no known function. Because DNA segments that lie
near each other on a chromosome tend to be inherited together, markers
are often used as indirect ways of tracking the inheritance patterns of
genes that have not yet been identified, but whose approximate locations
are known. [NHGRI] Narrower terms biological markers, biomarkers, DNA markers, ESTs, genetic
markers, polymorphisms, In-depth DNA fingerprints, microsatellite markers, microsatellite
repeats, microsatellites, minisatellite repeats, RFLPs, restriction enzymes,
SSRs, STRs,
STSs, satellites Related terms Maps- genomic & genetic
Types of genetic maps are differentiated largely by the type of marker used. minisequencing: Sequencing glossary
In-depth molecular variants: Narrower terms alleles, mutations, polymorphisms,
SNPs. Broader term variants. multifactorial: Two or more genes, together with an environmental
effect, work together to lead to a phenotype. [NHLBI Glossary,] Compare
multigenic, polygenic,
oligogenic. multigenic: Traits controlled by
more than one allele. Compare multifactorial, oligogenic, polygenic. multiplicative: See under epistasis. mutation: A heritable change in the nucleotide sequence of genomic
DNA (or RNA in RNA viruses), or in the number of genes or chromosomes in
a cell, which may occur spontaneously or be brought about by chemical mutagens
or by radiation (induced mutation). [IUPAC Bioinorganic] A related strain has an abrupt, inheritable change in the sequence at
this location Mutation superceded by 'Variation' Mutation will become illegal
from April 15th, 2000. [DDBJ/ EMBL/ GenBank Feature Table]
http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html Any type of change (including insertions, deletions, point mutations,
and rearrangements in the nucleotide sequence of DNA) which leads to variations
in the population. Genetic changes that have been associated with disease
risk or were caused by damage inflicted by external agents (such as radiation)
are particularly described as mutations. [CHI SNPs] A change, deletion, or rearrangement in the DNA sequence that may lead
to the synthesis of an altered inactive protein the loss of the ability
to produced the protein. If a mutation occurs in a germ cell, then it is
a heritable change in that it can be transmitted from generation to generation.
Mutations may also be in somatic cells and are not heritable in the traditional
sense of the word, but are transmitted to all daughter cells. [NHLBI] Related terms alleles, polymorphisms, SNPs. Narrower terms In-depth
deletions, duplications, frame shift mutations, leaky mutations, loss of function
mutations, null mutations, neutral mutations, point mutations, suppressor mutations; Functional
genomics glossary targeted mutation. Mutation databases see Databases & software directory under
'variations'. HUGO MUTATION DATABASE INITIATIVE: Issues, Databases, and Perspectives for the New Millennium
(special issue) Human Mutation Vol.15 (1) January 2000 http://www.wiley.com/products/subject/life/genetics/genetics_humu_mdi.html Special issue Mutation Research DNA Repair 40 years of DNA
Repair Volume 485 Issue 1 (25 February 2001) http://www.mutationresearch.com/mutat/show/ mutation detection: See genetic variation detection. mutation rate: The frequency with which a mutation occurs within
an organism or gene. In general, rates of spontaneous mutation vary between
one in 104 and one in 108 per gene per generation,
and can be considerably increased by mutagens. [IUPAC Biotech] non-functional polymorphisms: Intron sequence differences. http://www.cgr.ki.se/cgb/groups/brookes/Articles/Human%20Genetic%20Bi.pdf oligogenic: A trait is considered to be oligogenic when two or
more genes work together to produce the phenotype. Implies that ‘few’ genes
are involved and should be contrasted with a polygenic trait, which
implies that many genes are involved in phenotype expression. [NHLBI] Compare multifactorial,
multigenic, polygenic phene: See Databases &
software directory under OMIA Online Mendelian Inheritance in Animals phenotype: Genomics glossary
Related term genetic architecture. pleiotropy: Gene definitions polygenic: Genomics glossary Related
terms multifactorial, multigenic, oligogenic In-depth
epistasis, polymorphisms: Genetic variations, broadly encompassing any of
the many types of variations in DNA sequence that are found within a given
population. Specific subtypes of polymorphisms include mutations, point
mutations, and SNPs. [CHI SNPs] A term formulated by population geneticists to describe loci at which
there are two or more alleles that are each present at a frequency of at
least 1% in a population of animals. The term has been co- opted for use
in transmission genetics to describe any locus at which at least two alleles
are available for use in breeding studies, irrespective of their actual
frequencies in natural populations. [NHLBI] A gene that exists in more than one version (allele), and where
the rare allele can be found in more than 2% of the population. [NHGRI]
Polymorphisms have the sense of being more neutral than mutations. Related
terms alleles, association, linkage, mutations, population genetics, SNPs. Narrower terms:
In-depth functional polymorphisms, idiomorphisms, non- functional polymorphisms;
SNP-
human, microsatellite repeat polymorphisms, RFLPs, SSRs simple sequence repeats,
STRs, tandem
repeats. Related terms: CEPH, International SNP Map Working Group,
linkage analysis, SSCP, SSEP. Broader term: variants population genetics: The study of the genetic composition of
populations and of the effects of factors such as selection, population
size, mutation, migration, and genetic drift on the frequencies of various
genotypes and phenotypes. [MeSH] Until recently, a small rather esoteric
specialty. population genomics: From genomic polymorphisms, to changes in gene
expression, to altered protein activity there is a probabilistic link between misspellings
in the genetic code and changes in biological function. In some cases, this link
is strong and depends on a single gene but often it is weak and multigenic.The
hypothesis driving many population studies is that my carefully phenotyping and
genotyping large populations will allow these associations to be elucidated. We
are investigating this hypothesis in our collaboration with large
epidemiological studies (e.g the Channing Laboratory) and in developing
efficient algorithms for exploring genetic association (e.g. Bayesian model
induction). [Children's Hospital, Boston, US Informatics Program
"Population studies using genomic data, 2001] http://www.chip.org/chip/research/population.html positional cloning: Functional
genomics glossary primer extension: Gene
amplification & PCR glossary protein polymorphisms: Polymorphisms in exons (protein coding
DNA). cSNPs are a subset of protein polymorphisms. recombinant: The result of a crossover
in a doubly heterozygous parent such that alleles at two loci that were
present on opposite homologs are brought together on the same homolog.
The term is used to describe the chromosome as well as the animal in which
it is present. [NHLBI] Related terms genetic recombination; recombinant DNA technology Technology
& instrumentation overview; recombinant antibodies, recombinant DNA, recombinant proteins Pharmaceutical
biology. recombination: The formation of
new combinations and arrangements of genes during meiosis; recombination
is achieved by crossing over, independent assortment, and segregation.
[NHLBI] Narrower ? term genetic recombination. repeats: Narrower terms: In-depth include
microsatellite repeats, minisatellite repeats, short tandem repeats, satellites,
tandem repeats, VNTRs. SNP Single nucleotide polymorphism: SNPs involve the change of
one DNA base to another. SNPs and point mutations are structurally
identical, differing only in their frequency. Variations that occur in 1% or
less of a population are considered point mutations, and those occurring in more
than 1% are SNPs. SNPs can occur in coding regions of the genome (cSNPs),
in regulatory regions (rSNPs), or, most commonly, in "junk DNA"
regions, in which case they are referred to as anonymous SNPs. [CHI SNP
Update] SNPs are single base pair positions in genomic DNA at which different
sequence alternatives (alleles) exist in normal individuals in some
population(s), wherein the least frequent allele has an abundance of 1% or
greater. Thus single base insertion/ deletion variants (indels) would not
formally be considered to be SNPs. ... In practice, the term SNP is typically
used more loosely than required by the above definition. ... Complications with
the above definition also exist. Specifically, some people might not want to
consider disease predisposing single base variants to be SNPs - but the above
definition would encompass such things as recessively acting, low penetrance,
dominant, quantitative trait loci, or risk associated alleles, since all of
these will occur in some normal (non- diseased) individual. Also the 'some
population' component of the definition is limited by practical challenges of
attaining and surveying representative global population samples. Consequently,
claims of non- polymorphic sequences should always be accompanies by statements
of the actual populations and the numbers of chromosomes tested. Overall, it is
therefore apparent that the term 'SNP' is being widely and imprecisely used as a
catch-all label for many different types of subtle sequence variation. [Anthony
Brooks "The essence of SNPs" Gene 234: 177-186, 1999] http://www.cgr.ki.se/cgb/groups/brookes/Articles/essence_of_snps_article.pdf. Narrower terms SNPs- human, single amino acid polymorphisms SAPS; In-depth anonymous SNPs, cSNPs, candidate SNP, pSNP, rSNP,
SNP haplotypes, synonymous
SNP. Related terms idiomorphism, protein polymorphisms SNP Consortium,
SNP discovery, SNP scans, SNP databases See Database & software directory under 'variations'. SNP discovery: Sequencing glossary SNP discovery - commercialization: One of the fastest- growing areas
of genomic research. This field has advanced much more rapidly than ever
anticipated, and extensive progress has been made in mapping
SNPs both in the public (The SNP Consortium) and private arenas. Also, throughput
capacity is rapidly increasing in this field. Now, the focus is shifting to
validating SNPs and to using SNP data to identify novel disease
targets. Commercial Implications of Advances in the Identification, Mapping, and Application of Single Nucleotide Polymorphisms,
CHI Genomic Reports, Oct. 2001. http://www.chireports.com/content/reports/snpupdate01.asp SNPs - human: Human Mutation- Special Issue SNP 2000:Third International Meeting on Single Nucleotide Polymorphism and Complex Genome Analysis
Vol. 17 (4) April 2001 http://www.wiley.com/products/subject/life/genetics/genetics_humu_snp2000.html A single site in a nucleotide sequence that contains two to four allelic
variations within a population at relatively high frequencies (1.0% by
convention). [S. Sunyaev "SNP frequencies in human genes" Trends in Genetics
16:8): 335-337 August 2000] About 30 million SNPs are thought to exist, making them much better markers
than alternative markers, such as micro- satellite repeats or short tandem
repeats. But it has been the discovery that some SNPs are linked to particular
diseases that has fueled the rising interest in this field. ... to determine
whether and how particular SNPs correlate to specific conditions, one may need
to study hundreds of thousands of SNPs in thousands of patients. At this point
in time, this remains a costly prospect. Also, the software tools are only now
emerging to deal with the analysis challenges. [CHI Bioinformatics] SNP scans: See under genome scans. scanning: Technology used to discover new or unknown SNPs. [M Phillips
CHI Nucleic Acid Detection Technologies conference, June 7-9, 2000] Related term
genome scan. scoring: Determining, by comparison the base pairs (genotypes)
at the locus for many individuals for particular SNPs that have already
been discovered. Related terms: genotyping Single Amino acid Polymorphisms SAPs: Additional factors of complexity are polymorphisms at the protein sequence level. While some of these
polymorphisms are linked to disease states, most are not, yet have in many cases a direct or indirect effect on the activities of the
proteins. [Rolf Apweiler, Amos Bairoch "The Human Proteomics Initiative of SIB and
EBI" Bioinformer 5:3 Autumn 1999] http://bioinformer.ebi.ac.uk/newsletter/archives/5/hpi.html Polymorphisms, which differ by a single amino acid. Related term: cSNPs surrogate markers: Pharmacogenomics
glossary variants: Includes narrower terms: alleles, mutations, polymorphisms,
SNPs and even narrower terms In-depth glossary deletions,
duplications, enhancements, frame- shift mutations, idiomorphisms, indels,
insertions, leaky mutations, loss of function mutations, null mutations, point
mutations, reassortments, VNTRs. Variants seems to be replacing these terms in sequencing databases. There is no single technology at present that will detect all the types
of abnormality (large deletions, rearrangements, base substitutions, small insertions and deletions, amplifications, and epigenetic changes such as
methylation) that are present in cancer cells. [PSA. Futreal et. al "Cancer
and genomics" Nature 409: 850-852, 15 Feb. 2001] validation of gene-disease associations: See under gene- disease
associations wild-type: The most frequently encountered genotype in natural
breeding populations. [IUPAC Biotech] To what extent is this a theoretical concept?
Is it as slippery and elusive as the concept of "normal" in many
cases in clinical medicine? The term "wild-type" was fixed in the lexicon in the early days of
fruit-fly genetics, when one could go out and catch one; now it means the
original line of normally functioning individuals. [HF Judson, Eighth Day of Creation Cold Spring Harbor Laboratory Press 1996
p. 276] Related term: Proteins glossary wild-
type proteins Bibliography [CHI SNP Update] Commercial Implications of Advances in the Identification, Mapping, and Application of Single Nucleotide
Polymorphisms,
CHI Genomic Reports, Oct. 2001. http://www.chireports.com/content/reports/snpupdate01.asp [CHI SNPs] Single
Nucleotide Polymorphisms: Commercial and Scientific Prospects Cambridge Healthtech Institute, Malorye Branca and Kenneth Rubenstein Report
2, Genomic Technologies, 1999. [SNP Consortium] Glossary, 2001.
30+ definitions. http://snp.cshl.org Alpha
glossary index IUPAC definitions are reprinted with the
permission of the International Union of Pure and Applied Chemistry. In-depth Genetic variations glossary aggregate: Related term segregation allele-specific hybridization (ASH): A method of SNP detection.
ASH technologies use oligonucleotides that differ at a single base position
corresponding to the SNP to be detected. In some instances, two oligos
are provided, one for a "wild-type" or normal allele and the
other for the SNP. In other instances, four oligos, corresponding to each of the
four possible bases at the SNP position, are provided. ASH technology shows up
in several microarray products. [CHI SNP
Update] Related terms Gene
amplification & PCR glossary anonymous SNPs : SNPs that have
no known effect on gene function. Thought to be the most common type
of SNPs and possibly valuable as markers for linkage disequilibrium
studies, when they are relatively close to the gene being sought.
[CHI SNPs] bi-allelic: In principle, SNPs could be bi-, tri-, or
tetra-allelic polymorphisms. However, in humans, tri- allelic and tetra-
allelic SNPs are rare almost to the point of non- existence, and so SNPs
are sometimes simply referred to as bi- allelic markers (or di- allelic to be
etymologically correct). This is somewhat misleading because SNPs are only
a subset of all possible bi- allelic polymorphisms (e.g., multiple base
variations). [Anthony Brooks "The essence of SNPs" Gene 234: 177-186,
1999] http://www.cgr.ki.se/cgb/groups/brookes/Articles/essence_of_snps_article.pdf. Variant of di-allelic.
Related terms allele, SNPs. cSNPs: Located within protein
coding sequences and may interfere with gene and related protein function
by altering the protein’s amino acid composition. Generally thought
to be the least common and most valuable type of SNP. At least
200,000 are estimated to be found. Many companies are patenting these
for use in the candidate gene approach. [CHI SNPs] Broader term SNP CEPH: Centre d’Etude du Polymorphism Humain, Paris FRANCE. Collects pedigrees appropriate for reference genetic
mapping. These are characterized by the availability of a large number
of offspring (average 8.5) and both sets of paternal and maternal grandparents.
The structure of these pedigrees renders them “linkage phase known”. [NHLBI]. candidate gene studies: These studies, in contrast to genomewide
scans, focus on particular SNPs thought to have a functional effect or to
be involved in specific conditions. They are generally considered more practical
than genomewide scans [CHI SNPs Update] Related term Gene definitions
candidate gene candidate SNP: Particular
SNPs thought to have a functional effect. correlation studies: For SNPs
will likely be attempts to correlate phenotype or drug response with candidate
SNPs. May or may not be carried out with population validation studies.
[CHI SNPs] DGGE: Chromatography &
electrophoresis glossary dHPLC Denaturing HPLC: Chromatography
& electrophoresis glossary DNA fingerprinting: A procedure
in which multilocus band patterns of a DNA sample are generated by digestion
of the DNA with restriction enzymes followed by electrophoresis and visualization
by hybridization with probes specific for repetitive sequences. In
forensic medicine the probes used are "core" sequences specific for simple
tandem repetitive sequences (MINISATELLITE REPEATS or VNTRs). The multilocus
band patterns, known as DNA fingerprints, are evaluated for similarities
with DNA from an individual. [MeSH] Related term Genomics
glossary forensic applications DNA footprinting: A method for determining
the sequence specificity of DNA- binding proteins. DNA footprinting utilizes
a DNA damaging agent which cleaves DNA at every base pair; DNA cleavage
is inhibited where the ligand binds to DNA. [MeSH (Rieger et al., Glossary
of Genetics: Classical and Molecular, 5th ed)] DNA ligase enzymes: Can link two
adjacent oligonucleotide probes that are hybridized to a template.
A SNP detection technology. [CHI SNP] deletions: A genetic rearrangement
through loss of segments of DNA or RNA, bringing sequences which are normally
separated into close proximity. This deletion may be detected using
cytogenetic techniques and can also be inferred from the phenotype, indicating
a deletion at one specific locus. [MeSH ‘gene deletion’] A type of mutation caused by loss of one or more nucleotides from a DNA segment. Deletions can be very large, encompassing many genes and megabases of DNA, to the point of producing a visible cytological abnormality in a chromosome. Small deletions within a gene can alter the reading frame, and thus the amino acid sequence of the encoded protein
[Mouse Genome Informatics, Jackson Lab] If a particular nucleotide (or series of
nucleotides) is not copied during DNA replication deletions arise. Deletions
may have no effect, results in disease, or in rare cases be beneficial.
[CHI SNPs] Related term indels; Functional
genomics In-depth Cre-lox di-allelic: See under bi-allelic. direct approach: See candidate gene approach. Alternative
to shotgun sequencing. Sequencing glossary duplication: A particular kind of
mutation: production of one or more copies of any piece of DNA, including
a gene or even an entire chromosome. [NHGRI] An additional copy of a DNA segment present in the genome. Gene duplication is the source of
paralogous genes. [Mouse Genome Informatics] Narrower term: whole genome duplication [need definition] endonucleases: A high throughput
fragment analysis SNP scanning technology [M Phillips CHI Nucleic Acid
Technologies conference, June 2000] epigenetic: Gene definitions epistasis: : Two or more genes interacting
with one another in a multiplicative (the effects of alleles at loci which
together contribute to a phenotype when their combination is not equal
to the sum of the individual contribute of each allele by itself) fashion. [NHLBI] founder
populations: Those in which an identifiable current population derives from
a relatively small group of founders. The small size of the founder group
suggests that one will find fewer genes and fewer alleles involved in
susceptibility to a given disease within the population. [CHI SNPs Update] frame-shift mutation: A type of
mutation in which a number of nucleotides not divisible by three is deleted
from or inserted into a coding sequence, thereby causing an alteration
in the reading frame of the entire sequence downstream of the mutation.
These
mutations may be induced by certain types of mutagens or may occur spontaneously.
[MeSH]. Related term reading frames. Sequences,
DNA & beyond. functional cloning: Functional
genomics glossary Genetic Annotation Initiative: Until recently, genetic dissection of
phenotypes has been limited largely to investigations in model organisms.
However, advances in genomic technologies have provided an important opportunity
to identify cancer genes through genetic analysis. A key to performing
such analyses is the ability to detect variation, which will facilitate
the tracking of transmission and identification of alternative forms of
a gene. The NCI has established the Genetic Annotation Initiative (GAI) to delineate, characterize, and validate gene variants. The GAI is
part of the Cancer Genome Anatomy Project (CGAP), National Cancer Institute, US.
http://www.nci.nih.gov/initiatives/gai.html
See also Bioinformatics glossary
annotation genetic drift: Random fluctuations
in gene frequencies, particularly in small populations. [CHI SNP]. Hardy-Weinberg law:
A principle of population genetics that predicts genetic equilibrium in large
populations, assuming standard variables. GH Hardy was a British
mathematician and Wilhelm Weinberg a German physician. HUGO Mutation Database
Initiative: http://ariel.unimelb.edu.au/~cotton/mdi.htm
Variation databases and related sites indels: See under insertion. indirect approach: See random genome-wide association studies. insertion: Several nucleotides
can be added to a sequence, resulting in an insertion. Effects of
an insertion are variable. [CHI SNPs] Insertions and deletions
can be hard to tell apart and are sometimes referred to collectively as
“indels”. International SNP Map Working Group: Identifies and localizes 1.42.
millions SNPs in the human genome. ["A map of
human genome sequence variation containing 1.42 million single nucleotide
polymorphisms" International SNP Map Working Group Nature
409:928-933, 15 Feb. 2001] Related terms SNP maps, SNPs. http://www.nature.com/cgi-taf/DynaPage.taf?file=/nature/journal/v409/n6822/full/409928a0_fs.html leaky mutation: Some function remains,
but not at the level of the wild type allele. [Philip McClean , Intermediate
Genetics PLSC 431, North Dakota State University, 2000] http://www.ndsu.nodak.edu/instruct/mcclean/plsc431/mutation/mutation4.htm linkage analysis: Studying the genetics of families to find out
where in the genome there are particular genes that may be associated with
a disease. This approach is most effective with monogenic disorders that
have high penetrance. Tallies up within family associations
at a particular locus. In cases of genes with many functional polymorphisms,
linkage analysis with a linked marker can identify the gene, but association
analysis will not. May require large numbers of affected families, often
many thousands, to achieve statistically significant results for complex
diseases. [CHI SNPs] Related terms linkage, linkage disequilibrium. linkage disequilibrium: The tendency of closely spaced alleles to be
inherited together. Linkage disequilibrium reduces the number of polymorphic markers
that must be studied when using random markers to detect association between a
gene and a trait. In the absence of linkage disequilibrium, only a causative polymorphism
shows any appreciable difference between the case and the control group.
However, in the presence of linkage disequilibrium, polymorphisms that are
physically near a causal polymorphism will also show a difference in frequency
between cases and controls, and an enhanced association with the trait in
questions. [CHI Breaking Bottlenecks] When alleles
at two distinctive loci occur in gametes more frequently than expected
given the known allele frequencies and recombination fraction between the
two loci, the alleles are said to be in linkage disequilibrium. Evidence
for linkage disequilibrium can be helpful in mapping disease genes since
it suggests that the two may be very close to one another. [NHLBI] Related terms
haplotype, haplotyping technologies, linkage, linkage analysis. loss of function mutation: Wild
type alleles typically encode a product necessary for a specific biological
function. If a mutation occurs in that allele, the function for which it
encodes is also lost. The general term for these mutations is loss-of-function
mutations. [Philip McClean , Intermediate Genetics PLSC 431, North Dakota
State University, 2000] http://www.ndsu.nodak.edu/instruct/mcclean/plsc431/mutation/mutation4.htm microsatellite markers: See microsatellite
repeat polymorphisms microsatellites: Consist of tandem repeats, which
contain repetitive runs of the same short base sequence (e.g., GTA, GTA, GTA…).
Among individuals, these sections of DNA may vary in the number of repeats they
contain and can serve as markers and signs of genetic variation. [CHI SNPs
Update] minisatellite repeats: Tandem arrays
of moderately repetitive (5- 50 repeats) short (10- 60 bases) DNA sequences
found dispersed throughout the genome and clustered near telomeres. Their
degree of repetition is two to several hundred at each locus. Loci number
in the thousands but each locus shows a distinctive repeat unit. Minisatellite
repeats are often called variable number of tandem repeats [VNTRs]. [MeSH] Also
known as SSRs Related terms tandem repeats. Broader term polymorphisms multiple allelism: The existence of several known alleles
of a gene. [Schwindlein] negative selection: Many mutations
are deleterious to the fitness of an organism. These will be selected against
and eventually lost from the population. [S. Sunyaev “SNP frequencies
in human genes” Trends in Genetics 16:8): 335-337 August 2000] neutral mutation: Substitutions
that have no selective advantage or disadvantage. [S. Sunyaev “SNP
frequencies in human genes” Trends in Genetics 16:8): 335-337 August 2000] nucleotide diversity: The number of base differences between
two genomes, divided by the number of base pairs compared. A sensitive
indicator of biological and historical factors that have affected the human
genome. [A. Chakravarti "...to a future of genetic medicine" Nature 409:
822-823, 15 Feb. 2001] Related terms population genetics, population genomics null mutation: A mutation where
function is totally lost. pSNPs: If a cSNP or an rSNP
leads to an altered amino acid, which in turn leads to altered protein
function or expression and an observable change in the organism’s phenotype,
the SNP may be labeled a pSNP. [CHI SNPs] Broader term SNP phenocopy: A phenotype that is not genetically controlled
but looks like a genetically controlled phenotype. An environmentally
induced phenotype that resembles the phenotype produced by a mutation.
[Edinburgh] pleiotropism or pleiotropy: Gene definitions. point mutations: A variation in just a single nucleotide. These
are the most common type of genetic variation. Point mutations that occur
in greater than 1% of the members of a group or species are called SNPs.
[CHI SNPs] A point is a single nucleotide on a chromosome. [PhRMA] In classical genetics, a point mutation was originally defined as a change in
an inherited trait that was not accompanied by any chromosomal change that could
be seen with a light microscope (even in the giant polytene chromosomes of Drosophila
larval salivary glands). In current usage, point mutations are usually
understood to involve only one base pair, but to include both substitutions
(transitions and transversions), and the addition or deletion of a single base
pair. A point mutation can result in missense (amino acid substitution),
nonsense (insertion of a stop codon), or frameshift (either positive or
negative). [Richard Ham, Molecular Basis for Mutation, Univ. of Colorado, US
Fall 2000] http://www.colorado.edu/MCDB/MCDB2150Fall/notes00/L0006.html positional candidates: Functional
genomics glossary Not to be confused with the candidate gene strategy. rSNPs: These SNPs affect regulatory
regions that govern gene expression. Thought to be relatively uncommon
and potentially as valuable as cSNPs. Limited commercial activity
at this time. [CHI SNPs] Broader term SNP RFLP Restriction Fragment Length Polymorphism:
Restriction fragment length polymorphism, which can be used in SNP
genotyping. This method relies on enzymatic cleavage of DNA followed by electrophoresis.
[CHI SNPs Update] A variant among individual organisms in
the size and number of DNA fragments generated by the actions of restriction
enzymes. These variations can be detected by the differences in the distribution
of DNA fragments during electrophoresis. [NHLBI] Broader term marker. random genome-wide association studies: Looks at many random
SNPs in the hope that some will be in linkage disequilibrium with a particular
gene or genes. [CHI SNPs] Broader term association studies. reassortment:
The rearrangement of genes from two distinct strains to produce a novel viral
strain. [CHI SNPs Update] regional scanning: Developed
by Genset. Used linkage studies to identify sequence regions from
the public domain and their own studies to narrow the parts of the genome
they would subsequently scan. Then using high-density mapping, making
a first pass using a map with SNPs every 30- 40 kb in the regions of interest
and then increasing the density to every 5-10 kb as they closed in on the
genes of interest. Can only be used for studies of disease genes
for which family linkage data is available. [CHI SNPs] See also Maps-
genomic & genetic restriction enzymes: Endonucleases which recognize specific base sequences within a DNA helix, creating a double- strand break of DNA. Type I restriction enzymes bind to these recognition sites but subsequently cut the DNA at different sites. Type II restriction enzymes both bind and cut within
their recognition or target sites. [IUPAC Biotech] A group of
enzymes isolated from bacteria that cut DNA molecules at specific sites
characterized by certain nucleotide sequences. [NHLBI] Broader term markers. restriction fragment length polymorphism:
See
RFLP. The SNP Consortium (TSC): Established (April 1999) to identify
SNPs and add them to the public domain rather than patenting them for commercial
use. A joint enterprise of pharmaceutical companies (AstraZeneca,
Bayer, Bristol-Myers Squibb, F. Hoffmann-La Roche, Glaxo Wellcome, Aventis/Hoechst
Marion Rouse, Novartis, Pfizer, Searle, and SmithKline Beecham) and the
Wellcome Trust, with sequencing carried out by three public genomics institutes
(Sanger Centre, Washington University School of Medicine, and the Whitehead
Institute) with Stanford University Human Genome Center contributing mapping
and Cold Spring Harbor Laboratory coordinating bioinformatics activities.
[CHI SNP] Motorola joined in October 1999 and IBM in March 2000. SNP Consortium
http://snp.cshl.org http://www.wellcome.ac.uk/en/1/biovensnp.html Within two years of its establishment, the consortium had far exceeded its
initial goals, and its final map will have one SNP for every 5,000 to 10,000
base pairs. The technology used in this project was finely tuned, and 95% of the
SNPs called (i.e., identified) on the consortium map can be validated using a
gold standard. [CHI Bioinformatics] SNP haplotypes: There has been much interest in the recent publication
of the single nucleotide polymorphism (SNP) map of the human genome,
particularly with reference to the role it might play in disease gene mapping
and developing a better understanding of patterns of linkage
disequilibrium (LD) across chromosomes. For these types of analyses with samples
of population data, SNP haplotypes are essential. However, current genotyping
technology cannot determine phase. This has prompted the development of
statistical methodology to infer phase, allowing haplotypes to be reconstructed
from the available genotype data. Existing methods utilise parsimony and maximum
likelihood, implemented via the E-M algorithm. The disadvantage of these
approaches is that they do not assign uncertainty to phase assignment, leading
to over- confidence in the inferences made by subsequent analyses using the
extracted haplotype data. [Andrew Morris "A Bayesian MCMC method for
extracting phase information from SNP genotypes" Statistics &
Probability Seminars, Univ. of Bristol, 16 Nov. 2001] http://www.stats.bris.ac.uk/Seminars/Abstracts/morris.html SNP maps: Maps genomic & genetic SSCP Single Strand Conformational
Polymorphism: High throughput fragment analysis, a technique for screening
for SNPs. [CHI SNPs] SSEP Single Strand Electrophoretic Polymorphism:
A
technique for screening for SNPs. [CHI SNPs] SSRs Simple Sequence Repeats:
Stretches of 1 to 6 nucleotide units repeated in tandem and randomly spread in eukaryotic
genomes. SSR
are very polymorphic due to the high mutation rate affecting the number of repeat units. Such
length-polymorphisms can be easily detected on high resolution gels (e. g. sequencing gels), by running PCR
amplified fragments obtained using a unique pair of primers flanking the repeat (Weber and May 1989). SSR have
several advantages over other molecular markers. For example, (i) microsatellites allow the identification of many
alleles at a single locus, (ii) they are evenly distributed all over the genome, (iii) they are co-dominant, (iv) little
DNA is required and (v) the analysis can be semi-automated and performed without the need of radioactivity. In our
lab as well as in others (Gianfranceschi et al. 1998, Guilford et al. 1997, Maliepaard et al. 1998) SSRs for the
genetic analysis of apple have been developed. We focused on the isolation and characterisation of (GA)n repeats,
since those are the most abundant class of repeats in plants, after (TA)n, which have some technical disadvantages
that hinder the selection of TA rich sequences. [Luca Gianfranceschi
"Simple Sequence Repeats: Markers of Choice in Apple Breeding, ETHZ,
Switzerland, INRA France] http://www.inra.fr/Internet/Projets/DARE/textes/focus/genetics/geneticspublication/ssr.htm STRs Short Tandem Repeats : Often
serve as polymorphic markers. [CHI SNPs] STSs Sequence-Tagged Sites:
Unique short sequence of DNA, typically less than 400 bases long.
Detected by PCR. Allows identification of where in the genome the
studied sequence is localized. ESTs are STSs derived from cDNA.
Useful for orienting the physical mapping and sequence data reported from
different laboratories. [DOE] A short DNA segment that occurs only once
in the human genome and whose exact location and order of bases are known.
Because each is unique, STSs are helpful for chromosome placement of mapping
and sequencing data from many different laboratories. STSs serve as landmarks
on the physical map of the human genome. [NHGRI] Related terms vectors, cloning Cell
biology glossary positional cloning Functional
genomics. STS databases see Databases &
software directory. satellite: Many tandem repeats (identical
or related) of a short basic repeating unit; many have a base composition
or other property different from the genome average that allows them
to be separated from the bulk (main band) genomic DNA. [DDBJ/ EMBL/ GenBank
Feature Table] http://www.ebi.ac.uk/embl/Documentation/FT_definitions/feature_table.html
Narrower term microsatellite, minisatellite. segregation: The principle
that the two partners of a chromosome pair are separated during meiosis
and distributed randomly to the germ cells. Each germ cell has an
equal chance of receiving either chromosome. [NHLBI]. Related terms Mendelian,
aggregate. ‘slightly’ deleterious alleles:
An allelic variant subject to negative selection, but the selection coefficient
is relatively low. The frequency of a slightly deleterious allele in a
population is subject both to the stochastic fluctuations of genetic drift,
depending on population size, and to a very weak negative selection.
Unlike strongly deleterious alleles, which are quickly eliminated by selection,
slightly deleterious alleles can be kept in a population for a long time
owing to drift. Due to selective pressure, they are predominantly observed
at low frequencies (in comparison to purely neutral alleles. [S. Sunyaev “SNP frequencies in human genes” Trends in Genetics 16:8): 335-337 August
2000] suppression: Second mutation at
a site distinct from the first mutation reverses, at least partially, the
phenotypic expression of the first mutation. [CHI Functional Genomics] The restoration of the wild- type phenotype
in an organism possessing a mutationally altered genotype. The effects
of the mutation may be suppressed by a second ‘suppressor’ mutation on
a different gene, by a suppressor mutation on the same gene but located
at a distance from the site of the primary mutation, or by the presence
of a cytoplasmic suppressor due to a change in non chromosomal DNA. [Metathesaurus] suppressor mutation: A reversion to original phenotype but by a
mutation in another gene. [Kenneth G. Wilson, Genetics, Miami Univ. US] http://www.cas.muohio.edu/~wilsonkg/genetics/Gene_code/mutation/mutation.htm synonymous SNP: When, despite
the substitution of a base in a cSNP, the cSNP still codes for the same
amino acid. [CHI SNPs] Substitutions in coding regions that result
in the same amino acid. [S. Sunyaev “SNP frequencies in human genes”
Trends in Genetics 16:8): 335-337 August 2000] Narrower term: SAP single amino acid polymorphism tandem repeats: Multiple copies
of the same base sequence on a chromosome; used as a marker in physical
mapping. [DOE] Related term microsatellite repeat polymorphisms, STR,
satellite tri-allelic: Blood groups
(A, B, O) are an example of tri-allelism. Related terms allele,
bi-allelic. VNTRs Variable Number of
Tandem Repeats: See minisatellite repeats.
|