|
PhD Thesis
Targeted mapping of the chicken
genome
Hester Mary Wain
A thesis submitted in partial
fulfilment of the
requirements of the Universtiy of Hertfordshire
for the degree of Doctor of Philosophy
Division of Molecular Biology,
Institute for Animal Health,
Compton Laboratory,
Compton,
Berkshire. RG20 7NN
University of Hertfordshire
Hatfield,
Hertfordshire.
Zoology Department,
University of Leicester,
Leicester,
Leicestershire.
May 1997
Abstract
There is now a very comprehensive chicken linkage map containing well
over 800 loci. However, in order to use this map to identify the genes
responsible for traits it is necessary to progress from linkage information
to physical clones of the relevant region. The identification of chicken
yeast artificial chromosome (YAC) clones and the gene of interest is the
first step towards physical cloning of a candidate gene but at present
the efficiency of this process would be greatly improved if the density
of genetic markers in the targeted region could be increased. The initial
approach to this problem was to place as many known genes as possible on
the map, by using expressed sequence tags (ESTs) as markers. This placed
seven novel chicken genes on the linkage map. However, this method was
inefficient, so a method of saturating the genome with markers was applied.
Random amplified polymorphic DNA (RAPD) analysis was investigated as a
means of increasing marker density over the whole genome, but was also
inefficient at producing large numbers of markers. Then the potential of
representational difference analysis (RDA) was investigated to target markers
to specific regions of the chicken genome. As an initial test of the method
the genomes of two inbred lines were compared. This provided a relatively
undemanding test of the technique and generated large numbers of polymorphic
clones. As a more stringent test of the method, and to provide marker loci
in a region of particular interest, a second comparison was made targeting
chicken chromosome 16. Chromosome 16 contains the major histocompatibility
complex (MHC), nucleolar organiser region (NOR) and Rfp-Y complex. This
succeeded in targeting 10 out of 18 RDA clones to chromosome 16. It appeared
possible to target RDA towards the direct identification of a trait gene
or genes and as a model for this, RDA was applied to the identification
of genes affecting resistance to Marek's disease. This generated four RDA
loci which were located on chromosome 1 three of which showed association
with both mortality and Marek's disease virus quantitative PCR product
values. These RDA loci appear to identify a novel region of the chicken
genome containing a trait gene conferring resistance to Marek's disease.
Thus RDA appears to provide a viable solution to the generation of targeted
markers, which will aid YAC identification in specific areas.
Acknowledgements
I would like to thank Professor John Bourne and the Institute for Animal
Health, Compton for allowing me to undertake the research for this degree.
I would also like to thank my supervisor Dr Nat Bumstead for all his
help and support, especially with the proof reading of this thesis and
his belief that PCR is not controlled by the tooth fairies! My thanks also
go to Dr Ayo Toye, for generating the chicken YAC library, letting me use
it and giving positive encouragement in the final stages of writing up.
I would also like to thank all those within the Avian Genetics lab. especially
Julie Sillibourne and Nigel Salmon without whose technical support everything
would have been a great deal harder.
I would also like to acknowledge the help of my external supervisor
Dr Terry Burke and all those who have previously provided markers for the
Compton chicken linkage map, without which a great deal of this work would
have been impossible.
I am also grateful to all other staff at the Institute who have made
my working time more pleasant: particularly the librarians Chris Gibbons
and Diane Collins, the DID Jim White-Cooper, Bernard Clark and formerly
Dave Hawkins, the Computing Dept. Roger Harrison, Fiona Wycherley, the
canteen staff and Alison Thomson and everyone in Media and Wash-up. I would
specifically like to thank Christine Jones for her understanding of my
need for isotope and Dr Pete Kaiser for his reference checking, proof reading
and bullying.
I would like to say a special thank you to Janene, Thomas and Simon
Bumstead for tolerating my somewhat unexpected intrusions into their lives
and Marie Quick for encouraging me to start this PhD. Another very special
thank you must go to my parents Drs Chris and Bill Wain without whom I
would never have got this far.
Finally I am also extremely appreciative of my husband Graham for putting
up with my computer problems, long hours away from home and continuous
Dial-a-pizzas for dinner.
Declaration
This thesis is wholly the result of my own work. No part of it has been
submitted to any other board for another qualification. The views expressed
are those of the author and not of the University.
Contents
The HTML contents of this thesis vary from the
original publication, due to size contraints. For any further information
please email
Hester.
Title page
Abstract
Acknowledgements
Declaration
Contents
List of Tables (removed)
List of Figures (removed)
Abbreviations
Genes and Loci
Chapter One
Introduction
1.1 The Chicken
1.2 Genomic structure and organisation of the chicken
1.2.1 Repetitive Sequences
1.3 Genomic Mapping
1.3.1 Genetic Mapping
1.3.1.1 Restriction Fragment Length Polymorphism
1.3.1.2 Single-Stranded Conformational Polymorphism
1.3.1.3 VNTR-Hypervariable Minisatellites
1.3.1.4 Microsatellites
1.3.1.5 Random Amplified Polymorphic DNA markers
1.3.1.6 Amplified Fragment Length Polymorphism
1.3.1.7 Targeted Mapping
1.3.1.8 Expressed Sequence Tags
1.3.1.9 Subtractive hybridisation
1.3.1.10 RFLP subtraction
1.3.1.11 Differential Display-Reverse Transcription (DD-RT)
1.3.1.12 Genomic Mismatch Scanning
1.3.1.13 Representational Difference Analysis
1.3.2 Physical Mapping
1.3.2.1 Pulse Field Gel Electrophoresis
1.3.2.2 Radiation Hybrids
1.3.2.3 Flow Karyotyping
1.3.2.4 Chromosome microdissection
1.3.2.5 Yeast Artificial Chromosomes (YACs)
1.3.2.6 Sequence Tagged Sites (STS)
1.3.2.7 Fluorescent In-Situ Hybridisation (FISH)
1.4 Traits of interest in the chicken
1.4.1 Production Traits
1.4.2 Disease Traits
1.4.2.1 The major histocompatibility complex (MHC)
1.4.2.2 Salmonellosis
1.4.2.3 Infectious bursal disease virus
1.4.2.4 Marek's disease virus
1.4.2.5 Newcastle disease
1.4.2.6 Infectious bronchitis virus
1.4.2.7 Avian leukosis and sarcoma virus
1.4.2.8 Coccidiosis
1.4.2.9 Fowl cholera
1.5 Objectives
1.6 Approaches
1.6.1 Expressed Sequence Tags (EST)
1.6.2 Random Amplified Polymorphic DNA markers (RAPD)
1.6.3 Representational Difference Analysis
Chapter Two
Materials and Methods
2.1 Materials
2.2 Bacterial strains and plasmids
2.2.1 cDNA Library IAHchB1
2.2.2 Transformation of IAHchB1 by electroporation of
Escherichia coli MC1061/P3
2.2.3 Cloning RDA products into pGEM-T vector
2.3 Chicken Strains and Crosses
2.4 DNA Preparation
2.4.1 Genomic DNA Extraction
2.4.2 Plasmid DNA Preparation-ABI Protocol
2.4.3 Plasmid DNA Preparation-Hybaid maxiprep Protocol
2.5 Restriction Endonuclease Digestion
2.6 Electrophoresis
2.6.1 Agarose Gel Electrophoresis
2.6.2 ABI Genescan electrophoresis
2.6.2.1 Creation of a Matrix File
2.6.2.2 RAPD-PCR Electrophoresis
2.6.3 Sequence Gel Electrophoresis
2.7 PCR Amplification
2.7.1 Plasmid insert PCR Amplification
2.7.2 Colony PCR of RDA products
2.8 Gel purification of PCR products
2.9 Automated Sequencing
2.9.1 Sequence Analysis
2.10 Southern Blotting
2.10.1 Probe preparation and hybridisation
2.10.2 Nick translation system (Gibco BRL)
2.10.3 Prime-It RmT random primer labelling kit (Stratagene)
2.11 Mapping Techniques
2.11.1 Restriction Fragment Length Polymorphism Analysis
2.11.1.1 Mapping Segregation Analyses
2.11.2 RAPD Analysis
2.11.2.1 RAPD PCR Reaction
2.11.2.2 Genescan Analysis
2.12 Representational Difference Analysis
2.12.1 RDA comparison of Line N and line 15I
2.12.1.1 Screening and characterisation of RDA clones
2.12.2 RDA targeted to chromosome 16
2.12.2.1 BamHI Representation
2.12.2.2 TaqI Representation
2.12.2.3 NheI Representation
2.12.3 RDA targeted to Marek's Disease Resistance Genes
Chapter Three
Mapping Expressed Sequence Tags
3.1 Introduction
3.2 Results
3.2.1 Mapping ten novel chicken genes
3.2.2 RFLP Mapping
3.3 Discussion
3.3.1 Analysis of clones from the bursal cDNA library
3.3.2 RFLP Analysis and Mapping of clones
Chapter Four
Random Amplified Polymorphic DNA Analysis
4.1 Introduction
4.1.1 Choice of RAPD Primer
4.2 Results
4.2.1 Creation of the Matrix File
4.2.2 RAPD Analyses using fluorescent primers
4.3 Discussion
Chapter Five
Representational Difference Analysis comparison of line N
and line 15I
5.1 Introduction
5.2 Results
5.2.1 Comparison of Line N and line 15I
5.2.2 Colony PCR of RDA products
5.2.3 Screening and characterisation of RDA clones
5.2.4 Mapping RDA Clones
5.2.5 YAC Hybridisation
5.2.6 Nucleotide Sequences
5.3 Discussion
Chapter Six
Targeted mapping of chromosome 16 by Representational
Difference Analysis
6.1 Introduction
6.2 Results
6.2.1 BamHI Representation
6.2.2 TaqI Representation
6.2.3 NheI Representation
6.2.4 Screening and characterisation of RDA clones
6.2.5 Mapping RDA Clones
6.2.6 Nucleotide Sequence
6.2.7 YAC Hybridisation
6.3 Discussion
Chapter Seven
Resistance to Marek's disease targeted by Representational
Difference Analysis
7.1 Introduction
7.2 Experimental Design
7.3 Results
7.3.1 RDA using BamHI representation
7.3.2 Screening and Characterisation of RDA clones
7.3.3 Mapping RDA Clones
7.3.4 Analysis of association of RDA clones to MD resistance
7.3.5 Segregation of hV32 in the F2 population
7.4 Discussion
Chapter Eight
General Discussion
8.1 The Chicken
8.2 Genome Mapping
8.3 Saturation Mapping
8.4 Random amplified polymorphic DNA (RAPD) analysis
8.5 Microsatellite Markers
8.6 Amplified fragment length polymorphisms (AFLP)
8.6 EST mapping
8.7 Subtractive Hybridisation
8.8 Differential display-reverse transcription (DD-RT)
8.9 RFLP subtraction
8.10 The RDA Approach
8.11 The Future of Mapping
8.12 Future Work
References
Appendices (removed)
Abbreviations and Acronyms
°C degree(s) Celsius
A adenine
aa amino acid
ABI Applied Biosystems (Perkin Elmer)
AceDB A C.elegans database
AFLP Amplified fragment length polymorphism
AIDS Acquired Immunodeficiency syndrome
ALSV Avian leukosis and sarcoma viruses
ALSV-A Subgroup A Avian leukosis and sarcoma viruses
Alu-PCR PCR of inter Alu fragments
ALV Avian leukosis virus
Amp Ampicillin
APC Antigen presenting cell
App. Appendix
APS Ammonium Persulphate
ARS Autonomous Replication Sequence
BCE Before common era
bp base pair
BSA Bovine Serum Albumen
C cytosine
C band chromatin band
cDNA complementary Deoxyribonucleic acid
CH3COONH4 Ammonium acetate
CHEF Clamped homogeneous electrophoresis field
cM centi-Morgan(s)
cm centimetre
CMRP Compton mapping reference population
contig contiguous
COS7 Monkey kidney cell line
CpG Cytosine and guanine dinucleotide
CR1 Chicken Repeat 1
Cys Cysteine
dCTP 2'-deoxycytidine triphosphate
DD-RT Differential Display-Reverse Transcriptase
DMF Dimethyl formamide
DNA Deoxyribose nucleic acid
dNTP deoxynucleoside triphosphate
DOP-PCR Degenerate oligo. primed PCR
dsb Double stranded break
dT deoxy-Thymine
DTT dithiothreitol
dUTP 2'-deoxyuridine triphosphate
EDTA Ethylenediaminetetra acetic acid
EE EDTA and EPPS buffer
EPPS N-(2-hydroxyethy)piperazine-N'-(3-propanesulfonic
acid)
EMBL European molecular biology laboratory
ES Embryonic stem cell
EST Expressed sequence tag
F1 First cross
F2 Intercross between two F1
FAM 5-carboxyfluorescein (ABI fluorescent label)
Fig Figure
FISH Fluorescent In Situ Hybridisation
g Acceleration due to gravity
G guanine
GCG Genetics computer group
GCN4 DNA binding protein of ds DNA
GDB Genome database
GDRDA Genetically Directed Representational Difference Analysis
GL Suggestive linkage
GMS Genomic mismatch scanning
GTE Glucose, tris, EDTA buffer
HAT Hypoxanthine/aminopterin/thymidine
HepG2 Liver cancer cell line
HEX 6-carboxy-2',4',7'4,7-Hexachlorofluorescein
(ABI fluorescent label)
His Histidine
HPRT Hypoxanthine phosphoribosyltransferase
HSL Highly significant linkage
HVT Herpes virus of turkeys
IAH Institute for Animal Health
IBDV Infectious bursal disease virus
IBV Infectious bronchitis virus
IFGT Irradiation and fusion gene transfer
IPTG Isopropyl-[beta]-D-thiogalactopyranoside
JBAM 2nd RDA oligo for BamHI derived fragments
JTAQ 2nd RDA oligo for TaqI derived fragments
kb kilobase
kV kilo volts
l litre
[lambda] Wavelength
L-agar Luria-Agar
LOD Logarithms of odds ratio
LR Long Ranger
LTR Long terminal repeat
M Molar
MAS Marker assisted selection
Mb Megabase
Mbq Mega bequerel
µCi micro Curie
MD Marek's disease
MDV Marek's disease virus
mer Oligomer
mF milli Farad
µg microgram
mg milligram
MgCl2 Magnesium chloride
MGD Mouse genome database
MHC Major Histocompatiblity Complex
MIC Micro-chromosome
min minute(s)
µl microlitre
ml millilitre
µm micrometer
MPC Biomagnetic separator
MQ Milli-Q pure water
mRNA messenger Ribonucleic Acid
N Normal
NaCl Sodium chloride
NaClO4 Sodium perchlorate
NaOH Sodium hydroxide
NBAM 3rd RDA oligo for BamHI derived fragments
ng nano gram
nmol nano mole
NOR Nucleolar organiser region
NTAQ 3rd RDA oligo for TaqI derived fragments)
OD Optical Density
oligo Oligomer
p short arm of chromosome
P Probability
PB Qiagen's PB buffer
PBS Phosphate Buffered Saline
PCR Polymerase Chain Reaction
PCR-ELA PCR-Enzyme Linked Assay
PCR-SSP PCR-Sequence Specific Primer
PEG Polyethylene Glycol
pers. com. Personal communication
PFGE Pulsed field gel electrophoresis
pfu plaque forming units
pg pico grams
PI Post infection
pmol pico moles
Poly A Poly Adenylate
PVP Polyvinylpyrrolidone
q long arm of chromosome
QTL Quantitative Trait Locus
rad Unit of Radiation dose
RAPD Random Amplified Polymorphic DNA
RBAM 1st RDA oligo for BamHI derived fragments
RDA Representational Difference Analysis
rDNA Ribosomal DNA
RE Restriction Endonuclease
RFLP Restriction Fragment Length Polymorphism
RH Radiation hybrid
RNA Ribonucleic Acid
RNaseA Ribonuclease A
RNHE 1st RDA oligo for NheI derived fragments
rpm revolutions per minute
RPRL Regional poultry research laboratory
RT Room Temperature
RTAQ 1st RDA oligo for TaqI derived fragments
SB Special buffer
SDS Sodium Dodecyl Sulphate
sec Second(s)
Seq Ed Sequence editor
SL Significant linkage
ss single stranded
SSC Saline sodium citrate
SSCP Single Stranded Conformational Polymorphism
STS Sequence tagged sites
T Thymine
Ta Annealing Temperature
TAE Tris-acetate EDTA
TAP2 Transporter associated with antigen processing 2
Taq Thermus aquaticus DNA polymerase
TB Terrific broth
TBE Tris-borate EDTA
TE Tris EDTA
Te Extension Temperature
TEMED N,N,N',N'-tetramethylethylenediamine
Tet Tetracyclin
Tm Melting Temperature
Tris 2-amino-2-(hydromethyl) propane-1, 3 diol
tRNA transfer RNA
U Unit
UK United Kingdom
UTR Untranslated region
UV Ultraviolet
V Volt
VNTR Variable number tandem repeat
vs. versus
[Omega] Ohm
WG-RH Whole genome radiation hybrid
X-Gal 5-Bromo-4-chloro-indoyl-[beta]-D galactoside
YAC Yeast artificial chromosome
YS Yoshida sarcoma
Genes and Marker Loci
Loci/prefixes
ADL Microsatellite markers from East Lansing
COM Marker generated at IAH Compton
MCW Microsatellite markers from Wageninen
28S 28S ribosomal protein
60S 60S ribosomal protein
B-F Chicken MHC Class I locus
B-L Chicken MHC Class II locus
B-G Chicken MHC Class IV locus
BAT8 (homologue of G9a) function unknown
bcl-x bcl2 family, apoptosis gene
BG32.1 Chicken MHC Class IV locus marker 32.1
Bu-1 antigen expressed on B cells (marker=CHB6)
C4 MHC class III gene
CMYC28 Chicken myelocytomatosis viral oncogene 28
CNBP Cellular nucleic acid binding protein
COL4a1 Collagen type 4, alpha 1 chain
Db Eumuelanin restrictor gene
ERV Endogenous retrovirus
ev21 endogenous virus locus
FN1 fibronectin 1 gene
G6PD Glucose 6 phosphate dehydrogenase gene
G9a (homologue of BAT8)
GDI GDP dissociation inhibitor
GDID4 GDP dissociation inhibitor for the rho (GDI) protein
GDP Guanosine diphosphate
hg high growth gene
HLA Human leukocyte antigen loci A-D
HMG1 High mobility group 1
HPRT Hypoxanthine phosphoribosultransferase
Insr Insulin receptor gene
ITPR2 Inositol tri-phosphate receptor 2
Ity Salmonella typhimurium resistance gene
jcpk Juvenile congenital polycystic kidney disease
Lec Lectin type C gene
Ly-4 receptor gene on lymphocytes
MHCII[beta] Major Histocompatiblity Complex II [beta]
Ml Eumuelanin extension gene
NF2 Neurofibromatosis 2 gene
NOR Nucleolar organiser region
Nramp Natural resistance associated macrophage protein
nude Nude locus (mouse chromosome 1)
Pg Pattern gene
pol polymerase gene
PSF PTB associated splicing factor (marker SFPQ)
PTB Polypyrimidine tract binding
pudgy Pudgy locus
RAG-1 Recombination activation gene 1
RAG-2 Recombination activation gene 2
Rfp-Y Restriction fragment pattern-Y
SFPQ PTB associated splicing factor (PSF)
SRE Sterol regulatory element
TAP2 Transporter associated with antigen processing 2
TET Tetracycline resistance gene
TFRC transferrin receptor gene
Th-1 antigen expressed on T cells
tottering tottering gene
TUBA Tubulin alpha
tv-a receptor locus for ALSV-A
unl unlinked
VIL villin gene
Xic X inactivation centre
Xist X-inactive specific transcripts
Chapter One
Introduction
1.1 The Chicken
The domestic chicken (Gallus gallus domesticus) originated from
south-western Asia as a descendant of the Red Jungle Fowl and was first
introduced into China in about 1400 before the common era (BCE). Chickens
are also depicted in Babylonian carvings of about 600 BCE and are mentioned
by ancient Greek writers, particularly Aristophanes in 400 BCE (reviewed
by Stockbridge, 1995). Since that time, chickens were kept in small flocks
for home consumption until the 20th Century when poultry farming
became commercialised. A modern poultry farm may contain from several hundred
thousand to over a million chickens, either layers for egg production or
broilers for the meat industry. The chicken industry is now a multi-million
pound business with a gross output value of £1339 million in the
UK in 1991 (Law and Payne, 1996), which depends on high-production egg
layers, very rapidly growing broilers and disease-free stock. These three
factors have been the targets of poultry breeders who have tried to select
the best genetic traits in their stock, possibly eradicating some of the
main poultry pathogens.
However, disease is still a major cause of loss in production and still
proves of great economic concern to poultry farmers, and from the point
of view of the birds' welfare is unethical. Control of disease has been
improved by good husbandry and the use of vaccines, but, it would be far
more efficient if the birds could be bred to be disease resistant. Therefore
it would be useful to know the identity of the gene or genes affecting
the resistance traits for each disease, as with this detailed information
it will be possible to directly select for the required trait. To this
end linkage maps of the chicken have been developed, with the intention
of generating enough marker loci to identify linkage with the resistance
trait, and ultimately by positional cloning to isolate the gene concerned.
Once resistance genes have been isolated and characterised it will be possible
to use them to screen commercial breeding stock and selectively breed for
resistance. Knowledge of the gene may also suggest improved pharmaceutical
or immunological therapies.
Chickens are animals of agricultural importance as well as a valuable
model organism. Their use as a model organism is due to the many inbred
lines of chickens, which are well characterised for disease and production
traits. The parent stocks produce large numbers of progeny which eases
genetic experimentation. All commercially bred chickens have accurately
detailed pedigrees and are of low cost per animal. Because it is possible
to produce and assess large progeny families, the chicken is well suited
to genome analysis, particularly as the red blood cells are nucleated,
facilitating large scale good quality DNA extraction. Chickens have a small
genome, about half the size of mammals' but three times as big as that
of the pufferfish (Elgar
et al., 1996). The chicken genome is therefore
more easily accessible to cloning than those of mammalian species, and
yet retains sufficient sequence homology to mammals to provide comparisons
which could elucidate function. It is not yet clear whether this similarity
also applies to gene order between these species.
Chickens can harbour a number of different types of pathogen; viruses,
bacteria and parasites, all of which cause major diseases in poultry. A
wide variety of resistance mechanisms operate against these pathogens.
The identification and characterisation of these resistance genes in chickens
could help elucidate basic mechanisms for resistance in other organisms
as well as chickens.
1.2 Genomic structure and organisation of the chicken
The genome of the domestic chicken has a haploid number of 39 chromosomes;
the ten largest are referred to as macro-chromosomes, and the other 29
are termed micro-chromosomes (MICs) (Yamashina, 1944). In chickens chromosomes
have been numbered in size order, the biggest first. The large number of
MICs is typical of avian species (Abbott and Yee, 1975). In comparison
to man, the first six chromosomes are of similar size, the largest being
8 µm. However the MICs are much smaller (the smallest being about
7 Mb) than the smallest human chromosome, which contains about 50 Mb of
DNA (Bloom and Bacon, 1985). There is a size difference of 23 times between
the largest and the smallest chromosomes in the chicken. Chickens, like
other avian species, differ from mammals in that the female is the heterogametic
sex (ZW) and the male is the homogametic sex (ZZ), the Z and W chromosomes
displaying heteromorphism. The chicken chromosomes are mostly euchromatic
with the exceptions of a large terminal C-band (chromatin-band) on the
Z chromosome and an almost totally heterochromatic W chromosome, with small
C-bands on most of the MICs (reviewed by Fecheimer (1990)). The chicken
genome is relatively small, about 1.2 x 109 bases (Olofsson
and Bernardi, 1983), less than half that of the mouse and human genomes.
This makes the chicken's genomic structure and organisation particularly
interesting, as evolution appears to have pruned the genome to a minimal
size. Alternatively it is possible that mammalian genomes have expanded
in the 300 million years since splitting from the avian lineage. However
whatever the mechanism of change between the two lineages, the reason for
this difference is still unknown. One theory is that small genome size
was favoured by directional selection in birds in order to cope with the
metabolic demands of flight (Hughes and Hughes, 1995). However, one aspect
of the chicken's unique genome is the relative paucity of repetitive sequences.
1.2.1 Repetitive Sequences
The chicken genome like that of other animals contains repetitive sequences.
In the genomes of many animals, for example the amphibian Xenopus,
there is a short interspersion pattern of 0.3 kb repetitive sequences with
2 kb single copy sequences (Davidson et al., 1973). However, the
genomes of avian species show organisation similar to that of the long
period interspersion pattern of the Drosophila genome (Crain et
al., 1976). Estimates vary as to the degree of repetition in the chicken
genome. Eden and Hendrick (1978) concluded that 87% was single copy and
13% repetitive sequences, 42% of which contained ten to twenty copies and
58% contained about 1500 copies. Another study by Epplen et al.
(1978) using reassociation kinetics obtained similar results showing that
the chicken genome contained 15% highly repetitive sequences, 10% intermediate
repetitive sequences and 70% single copy sequences, of which 28% are 2.3
kb single copy DNA pieces interspersed with 1.5 kb long repetitive fragments.
Olofsson and Bernardi (1983) fractionated the genome using density gradient
centrifugation and similarly concluded that the genome was composed of
84% unique sequences and 13% repetitive sequences. Arthur and Straus (1983),
however, concluded that the genome contained 34% of unique sequences interspersed
with repeated sequences, 38% of long stretches of unique sequences and
19% of foldback elements. Sequences complementary to mRNA were found to
be randomly distributed with respect to the interspersion patterns, implying
that the distribution of the repetitive sequences does not parallel that
of the structural genes.
The only characterised chicken repeat sequences identified so far are
the chicken repeat 1 (CR1) elements, which are middle repetitive sequences
discovered by Stumph et al. (1981). CR1 elements are an ancient
group showing six sub-families in chickens, with four sharing a common
progenitor (Vandergon and Reitman, 1994). Over 95 CR1 elements have been
identified, of an average length of 300 bp, the largest being 2.3 kb (Burch
et
al., 1993). They have truncated 5' ends and a consensus 3' end containing
two or more repeats of an eight nucleotide sequence. CR1 elements are non-long
terminal repeat (LTR) retrotransposons (Burch et al., 1993), with
a
pol-like open reading frame encoding reverse transcriptase which
is responsible for dispersal of the element throughout the genome. There
are an estimated 100,000 copies of CR1 throughout the chicken genome (Vandergon
and Reitman, 1994). CR1 elements are also represented in other avian genomes.
Nine orders have so far been studied all of which contain CR1 elements
(Chen
et al., 1991). All CR1 elements have two regions of high homology
(Stumph et al., 1984), which extend to other avian species. These
regions appear to be conserved, the first coding for a silencer domain
and the second for a nuclear protein binding domain located at the 3' end
of the element, probably involved in transposition (Chen et al.,
1991).
Reptiles have also been shown to possess CR1 elements (Vandergon and
Reitman, 1994), showing that CR1 elements existed before the divergence
of birds and reptiles. CR1 elements are not randomly distributed but occur
mostly in G-C rich regions of the genome (Olofsson and Bernardi, 1983).
These regions are also the most gene-rich and at least 16% of the [beta]-globin
gene cluster contains CR1 elements (Reitman et al., 1993). CR1 elements
are associated with a number of genes, particularly near transition regions
of chromatin structure, for example the ovalbumin gene (Stumph et al.,
1983).
In mammals repeat elements are highly useful as, like the Alu
repeats of man, they can be used to characterise the genome by hybridisation
or PCR. PCR amplification of inter-Alu fragments (Alu-PCR)
has been used for high resolution physical mapping of radiation hybrids,
cosmids and yeast artificial chromosomes (Monaco et al., 1991; Aburatani
et
al., 1996). However, CR1 repeats in chickens are less common and also
much less conserved than the repeats in mammals and it seems unlikely that
they can be used for these purposes. Similarly although chickens possess
simple microsatellite repeats, there are far fewer of these than in humans
and mice. For example in chickens estimates of the numbers of (AC)n repeats
range as low as 7,000, almost 10 fold less than in mammals and they appear
not to be uniformly distributed throughout the genome (Toye, 1993).
1.3 Genomic Mapping
The first published genetic map was of the Drosophila X chromosome
(Sturtevant, 1913). Since then, genome mapping technology has greatly improved
and genetic maps for over one hundred species have been generated.
Two types of genomic maps have been established for many animals. Linkage
maps are determined from the frequency of meiotic exchange among linked
polymorphic markers in sexual crosses. Physical maps relate traits to actual
chromosomal location. Ultimately, the aim is to merge these two maps into
one giving both physical and linkage data.
1.3.1 Genetic Mapping
Genetic or linkage maps are necessary for the location of genes responsible
for monogenic and polygenic traits. Linkage analysis can be applied to
backcrosses of the same breed, or interspecies crosses can be used to generate
more polymorphisms. The murine genome has been analysed using interspecific
crosses (Avner et al., 1988). The advantages of interspecific crosses
are that they maximise the number of possible genetic polymorphisms, and
that pedigree analysis can be aligned with linkage analysis in order to
place loci on a particular chromosome (Copeland and Jenkins, 1991).
In chickens, genome mapping was first achieved by the analysis of phenotype
and limited maps established (Hutt, 1949). The first genetically studied
crosses were generated by Bateson and Saunders (1902) but it was not until
1936 that Hutt generated the first chicken linkage map. Chickens of differing
phenotypes were crossed and the linkage between different genes established.
For example, the eumelanin extension gene (Ml) lies between the loci of
the eumelanin restrictor (Db) and the pattern gene (Pg) (Carefoot, 1987).
This system is inherently limited as these markers require expression as
phenotypes and are only present in rare lines of chickens. These morphological
polymorphisms are also infrequently observed, as only one or two occur
in any one line. This makes segregation analysis difficult as no single
population is capable of giving scores for each polymorphism, thus leading
to ambiguity in the results.
Bumstead and Palyga (1992) published the first map of the chicken genome
derived from Restriction Fragment Length Polymorphism (RFLP) analysis.
68 White leghorn progeny were generated from a backcross between an
F1 (Line 15I x line N) female and her male line 15I parent.
An initial map of 100 markers was developed from segregation analyses,
covering a minimum of 585 cM of the chicken genome and identifying 18 linkage
groups.
More recently, Crittenden et al. (1993) crossed a Red Jungle
Fowl and a White Leghorn in order to maximise the chance of finding DNA
polymorphisms in the backcross. It was determined that RFLP were observed
between the parental lines with in the order of 91% of those enzymes used,
compared to 50% in the inbred White Leghorn lines used by Bumstead and
Palyga (1992). This gave a polymorphic population providing a good resource
for collaborative mapping. A variety of marker types were used, including
linkage mapping of feather colour and blood group loci, RFLP analysis and
RAPD (Random Amplified Polymorphic DNA) analysis. Further analysis of this
intercross has provided a linkage map (Levin et al., 1994b). Recent
comparisons of the Compton and East Lansing maps has shown that there is
in fact greater recombination in the Compton population, containing 2700
cM in comparison to 2100 cM in the East Lansing population (Mariani et
al., In Press). Within the Compton Linkage map there are over 100 genes
and 300 other loci, and now fewer than 2% of markers do not show linkage
(of LOD>3.0).
There are two main strategies for genetic mapping; saturation or random
mapping and targeted mapping. Saturation or random mapping entails the
generation of large numbers of markers in an attempt to cover the whole
genome densely, thus ensuring that any trait of interest must lie near
to at least one marker. Examples of techniques which achieve this are:
microsatellite analysis, RAPD STS, AFLP, RFLP and SSCP. Targeted mapping
is the isolation of markers in a specific region, or for a specific trait.
This is achieved by a variety of methodologies including EST, which can
be targeted by the isolation of cDNAs from specific tissues, as well as
more refined methods of subtractive hybridisation and representational
difference analysis.
Markers generated by these methods can be used for comparative mapping,
but are defined as two classes based on their polymorphic nature. Comparative
mapping is the positioning of identical markers on maps of different species
or strains. These markers are termed anchor or index loci and are mostly
used to define new linkage maps. Type I anchor loci are coding sequences
which are relatively conserved across different species. However, this
sometimes means that they contain no identifiable polymorphisms. So, although
they are particularly useful in interspecific mapping for the identification
of genes they cannot always be used. Type II anchor loci are defined as
high resolution polymorphisms. These are typified by microsatellite and
minisatellite polymorphisms which have traditionally not been used as loci
for interspecific mapping, due to their non-coding nature. However, O'Brien
and Graves (1991) have shown that these type II anchor loci can also play
an important role in the identification of syntenic regions between maps
of different species.
1.3.1.1 Restriction Fragment Length Polymorphism
RFLPs are generated by the digestion of DNA with restriction endonucleases.
RFLP was used by Donis-Keller et al. (1987) in the construction
of the first genetic linkage map of the human genome. RFLP are often visualised
by hybridisation of radioactive cDNA probes to the digested genomic DNA
fragments. RFLP analysis of the chicken B-F and B-L genes has been used
to confirm serological B-typing (Juul-Madsen et al., 1993).
There are disadvantages with RFLP analysis, as often a number of probe
and enzyme combinations have to be tested in order to generate significant
numbers of RFLP. This method generally uses radioactive probes, and requires
a large amount of target DNA. In fact, RFLPs detect about 1 in 10000 polymorphic
nucleotides in the human genome (Soller and Beckmann, 1986). The problem
with searching for direct associations between RFLP and traits of economic
value is the low likelihood of finding one; at best 1:200, but probably
1:20000 (Soller and Beckmann, 1986). RFLP markers are also difficult to
transfer between different linkage maps, as what is polymorphic in one
population may well not be in another. This, however, is true for any randomly
selected type of marker, but RFLPs do generate a reasonable number of markers
which can be easily placed onto a map. This technique has been extensively
employed by Bumstead and Palyga (1992) to place new markers on the chicken
linkage map.
1.3.1.2 Single-Stranded Conformational Polymorphism
SSCP involves the digestion of genomic DNA by restriction endonucleases,
denaturation with NaOH and EDTA and electrophoresis on a non-denaturing
polyacrylamide gel. Polymorphisms are detected as a mobility shift of the
single stranded DNA, due to conformational changes in alternative secondary
structures (Nakabayashi and Nishigaki, 1996). Even single-base substitutions
can be detected as shifts in electrophoretic mobility (Orita et al.,
1989a). This technique has been successfully used to detect three point
mutations in highly conserved residues found in mildly-affected cystic
fibrosis patients (Dean et al., 1990).
SSCP-PCR is a development of this technique, where the sequences to
be examined are amplified by PCR using labelled primers for the clones
of interest; these could include fluorescently-labelled primers. The samples
are then denatured and analysed by gel electrophoresis (Orita et al.,
1989b). This technique is usually used for genetic screening, as it requires
sequence data for the design of the primers. However, if RFLP analysis
shows no polymorphism with a particular clone, this method can be used
once the genomic sequence has been determined permitting design of primers
to the most variable region possible, usually introns or 3' UTR. But it
is costly in primer synthesis and often many primer pairs have to be tested
before a polymorphism is detected.
1.3.1.3 VNTR-Hypervariable Minisatellites
Minisatellites or VNTR (Variable number tandem repeats) are regions
of DNA containing simple tandem repeats of 10-15 bp segments. They are
detected by hybridisation of a probe to a core sequence within the repeat.
This generates a multi-band pattern or fingerprint which can vary between
individuals (Jeffreys et al., 1985). These band patterns are analysed
by radioactive labelling of the products and electrophoresis on a polyacrylamide
gel. This technique is mainly used in forensic and paternity investigations
but can also be used for mapping. Fingerprinting usually detects a large
number of loci from a single probing but these loci are mostly unlinked
and scattered throughout the chicken genome; however, hybridisation with
cloned single locus minisatellites shows that some could be clustered (Bruford
et
al., 1994). However, this necessitates knowledge of the core sequence
and the identification of polymorphisms between the parents of the cross
before it can be used for linkage mapping. VNTR analysis is also time consuming
and involves the use of large amounts of DNA and radioactive label.
1.3.1.4 Microsatellites
Microsatellite markers consist of di-, tri-, or tetra-nucleotides repeated
up to 40 times. Oligonucleotides are synthesised to sequences flanking
these repeats and can be used in PCR, to amplify loci from genomic DNA
(Holmes et al., 1993). Microsatellites are now being used as markers
for linkage analysis as they can be highly polymorphic, polyallelic, very
abundant and usually behave as single locus markers (Weber, 1990). Disadvantages
include the large amount of labour required and initial expense of the
oligonucleotide primers necessary to identify enough microsatellite markers
(Rafalski and Tingey, 1993). Microsatellites are useful type II anchor
loci as they are often polymorphic between more than one linkage cross.
However, within chickens less than 50% of microsatellites are polymorphic
between two inbred lines (Khatib et al., 1993). This is particularly
noticeable in the Compton mapping reference population where microsatellite
polymorphisms occur in less than 40% of primers (Bumstead pers. com.).
However, many microsatellites have been used in the chicken linkage maps
and recently a third map has been generated, based entirely on microsatellites
(Crooijmans et al., 1996). With increased use of fluorescence technology,
using the ABI Genescan system, many fluorescent primers can be used in
each PCR reaction and the products separated by electrophoresis. While
re-loading gels three or four times greatly increases the productivity,
cloning and screening the initial microsatellite is still a time-consuming
process.
1.3.1.5 Random Amplified Polymorphic DNA markers
RAPD markers can be generated using short arbitrary primers to amplify
genomic DNA, giving a genotype-specific pattern of bands. This is discussed
later in section 1.6.2.
1.3.1.6 Amplified Fragment Length Polymorphism
AFLP is based on selective amplification of digested genomic DNA by
a series of extended primers. The genomic DNA is digested with two enzymes,
a frequent and a rare cutter, e.g. MseI and EcoRI.
Adaptors, with a core sequence and the appropriate overhangs for the enzymes
used are ligated to the ends of the fragments. Pre-amplification of these
fragments by PCR is achieved using primers with a core sequence, an enzyme
specific sequence and a random-extension nucleotide. This generates a pool
of fragments which are re-amplified by two primers of core sequence, enzyme
specific sequence and this time further random-extension nucleotides. The
primer with the rare cutting enzyme sequence is end-labelled before the
reaction so that those fragments only will be detected by electrophoresis
and autoradiography (Vos et al., 1995). Fragments generated by MseI
digestion will be small enough to amplify by PCR, while those with one
MseI
and one EcoRI end will be much rarer but will also easily amplify.
During the second amplification, further selection of fragments occurs
using the extension nucleotides and finally only those fragments generated
by the
EcoRI-labelled primer will be resolved visually by autoradiography
following polyacrylamide gel electrophoresis. The use of two enzymes also
enables the AFLP reaction to be fine-tuned for the production of the maximum
number of bands. This technique was originally developed by plant geneticists
but can easily be transferred to the analysis of the chicken genome. The
development of many polymorphic bands for each reaction should enable large
numbers of marker loci to be mapped for a relatively small amount of work
and input DNA. However, all these markers will be anonymous, and to clone
one requires the extraction of DNA from a gel matrix. This method can be
visualised by silver staining rather than by radioactive labelling, but
the complexity would not then be further reduced, and might need a further
PCR step, which will ultimately be more expensive.
1.3.1.7 Targeted Mapping
Recently, a number of strategies have been directed towards the isolation
of candidate genes. In comparison with the saturation mapping approach,
these techniques would not necessarily place other informative loci on
the map, but would significantly reduce the amount of work needed to isolate
a candidate gene.
1.3.1.8 Expressed Sequence tags
EST are partial sequences of cDNA clones. These will be discussed later
in section 1.6.1.
1.3.1.9 Subtractive hybridisation
Subtractive hybridisation has been used to find candidate tumour suppressor
genes. This entails the positive selection of cDNA expressed only in non-tumour
cells of interest, when radioactively-labelled cDNA is hybridised with
tumour cell mRNA. Unhybridised labelled cDNA can be recovered and used
as a probe to screen a cDNA library constructed from non-tumour cells.
New suppressor genes were identified in this way (Lee et al., 1991).
This method also takes up considerably more time than that of, for example,
differential display reverse transcriptase (DD-RT) see below, although,
in comparison, it does have the advantage of being a positive selection
technique. However, it does rely on the selection of a small amount of
unique DNA from a large amount of initial DNA. For this technique to work
effectively for the isolation of candidate genes, the pattern of tissue
expression must be known for each gene. This is very difficult to achieve
with anything but tumours or highly expressed tissue specific genes.
1.3.1.10 RFLP subtraction
RFLP subtraction (Rosenberg et al., 1994) purifies small restriction
fragments from one genome (the tracer) from sequences which reside on fragments
in a related genome (the driver). The genomic DNA is digested with
HindIII,
and then the driver is ligated to biotinylated adaptors, whilst the tracer
is ligated to unlabelled adaptors. Both are PCR-amplified separately and
then mixed together, the driver in 100 times excess of the tracer. After
hybridisation, the biotinylated driver and its heteroduplexes are removed
by streptavidin-coated beads, leaving the tracer-specific DNA. This technique
can be used to generate DNA for library construction or to obtain markers
near to a gene of interest. However, only tiny amounts of DNA will be generated
and small under-represented sequences may well be lost as there is no enrichment.
Sequences of interest may also be lost in the removal of heteroduplexes
from the mixture as the tracer-specific DNA is not positively selected.
1.3.1.11 Differential Display-Reverse Transcription (DD-RT)
DD-RT is a method of analysing changes in gene expression in cells
at different stages of differentiation, thus isolating genes expressed
at a particular stage (Bauer et al., 1993). The mRNA is systematically
amplified and the 3' termini visualised by electrophoresis on a denaturing
polyacrylamide gel. An anchored oligo dT primer is used to anneal the mRNA
polyA tails, in conjunction with a decamer oligodeoxynucleotide in solution.
The decamer is arbitrary in sequence, so that it can anneal at different
positions in relation to the first primer for PCR amplification. Thus,
when the RNAs from two or more relevant cell types are compared, a small
fraction of differentially expressed mRNAs are revealed. Probes can be
made from the products, and the genes isolated by genomic library screening
(Liang et al., 1993).
DD-RT is useful as it displays subsets of mRNA as short cDNA bands,
enabling visualisation of cell mRNA composition. Alterations in gene expression
can be simply detected between parallel samples as variable band patterns.
The cDNA can easily be sequenced and compared to sequences in databases,
or used for probing (Liang and Pardee, 1992). This technique has the advantage
of being quick to apply. It is also non-selective, isolating all genes
present at any given stage. Again this technique relies on knowledge of
tissue expression patterns and times during which the gene of interest
could be isolated, which may not be known or fully understood. For the
isolation of genes associated with development during particular stages
this technique can be very useful, but as a general technique it is too
complex and technically demanding.
1.3.1.12 Genomic Mismatch Scanning
Genomic mismatch scanning (GMS) enables mapping in two related individuals.
The genomic DNA of one individual only is methylated at GATC sites, while
the other remains untreated. The two samples of DNA are then denatured,
mixed, and hybridised in solution. The samples are digested at fully methylated
and fully unmethylated sites leaving only whole heterohybrids. Base-mismatch
hybrids are nicked using a single mismatch repair system. The DNA is then
degraded by digestion, leaving mismatch-free heterohybrids. These hybrids
should be from regions of identity by descent. This DNA can be labelled
and used for probing DNA from the whole genome, yielding positive results
with identity by descent and negative results at sites of meiotic recombination
(Nelson et al., 1993). This methodology is very prone to failure
as there are many separate enzymatic steps, which must initially be optimised
in order to generate the complete methylation of one genome. GMS is performed
on the whole genome, without reducing its complexity, thus at hybridisation
not all homoduplexes will have time to reanneal. The under-represented
sequences which differ between the two genomes are also unlikely to be
detected as there is no enrichment for target sequences.
1.3.1.13 Representational Difference Analysis
RDA is a new technique developed by Lisitsyn et al. (1993) and
will be discussed later in section 1.6.3.
1.3.2 Physical Mapping
Physical mapping relates genes and markers to their actual location
on a specific chromosome. Physical mapping of chicken genes is difficult
as cytological discrimination between the 29 MICs is difficult on chromosome
spreads. Physical mapping in other species is now achieved using techniques
such as fluorescent in situ hybridisation (FISH) of single copy
molecular clones to metaphase chromosomes (Korenberg et al., 1992).
1.3.2.1 Pulse Field Gel Electrophoresis
PFGE is a technique for the separation of large segments of DNA, up
to 1 Mb, using alternating changes in the angle of an electric field to
separate the DNA on an agarose gel (Schwartz and Cantor, 1984). There are
now many variations of the basic concept, the most common being CHEF (Clamped
homogeneous field electrophoresis), in which the agarose gel is surrounded
with a series of electrodes (Lognonne, 1993). Modern systems can resolve
fragments from 10 kb to 10 Mb, helping to close the gap between the standard
cloning systems in plasmid, phage and cosmid vectors and genetic linkage
maps. However, PFGE still cannot resolve the chromosomes of larger genomes
like the chicken, as even the smallest MIC is over 7 Mb in size (Bloom
et
al., 1993). Since the development of yeast artificial chromosome clones,
however, PFGE has become essential to all mapping labs.
1.3.2.2 Radiation Hybrids
Initially hybrids were made by fusion of somatic cells with either
single or total alien chromosomes. However, a more useful method of irradiation
and fusion gene transfer (IFGT) was initiated by Goss and Harris (1975)
to produce a resource for mapping genes, and was further refined by Cox
et
al. (1990). Goss and Harris (1975) discovered that genes could be rescued
from cells killed by X-rays if the dose was increased so that double stranded
breaks (dsb) occurred along the chromosome. A dose of 1500 rad is adequate
to kill the cells whilst 4000 rad was sufficient to give a dsb between
two loci. Cox
et al. (1990) applied this technique to the
production of radiation hybrids (RH) from a Chinese hamster-human somatic
cell hybrid, containing a single copy of human chromosome 21. The somatic
cell hybrid was irradiated with 8000 rad to kill the cell and produce dsb
and then rescued by non-irradiated enzyme-deficient hamster cells. The
dsb recombine with the chromosomes in the healthy cell to give RH clones
containing fragments of human chromosome 21.
Walter et al. (1994) took this technique further to develop
whole genome radiation hybrids (WG-RH). Human fibroblasts were irradiated
with 3000 rad and rescued by thymidine kinase-deficient hamster cells.
These WG-RH were grown in HAT (hypoxanthine/aminopterin/thymidine) medium
to select for the human chromosomes and were initially analysed for the
presence of chromosome 14 markers by FISH. In the forty-four cell lines
produced, a marker retention of 20-50% was attained, so in theory a set
of about 100 WG-RH should provide good coverage of the whole genome. WG-RH
are a powerful resource, as they enable the mapping of YACs and ESTs which
are non-polymorphic within the linkage map. The amount of recombination
can be controlled by the radiation dosage (rad), as increased dose leads
to increased dsb and therefore increased recombination. However, good coverage
of the chicken genome in WG-RH might be difficult, again due to the large
numbers of MICs which need large doses of radiation to induce enough dsb
for significant recombination. It is possible that some of the MICs would
remain whole within the RH clones and would quickly disappear. There is
an added complication in that the RH clones will eject the alien chromosomes
fairly quickly, so once made aliquots of each clone should be stored frozen
while large amounts of DNA are prepared as a future resource.
1.3.2.3 Flow Karyotyping
Flow karyotyping is a technique used to separate individual chromosomes.
A large number of chromosomes are isolated as a suspension, and stained
with fluorescent dyes. The chromosomes are passed singly through a laser
beam and sorted by their fluorescence pattern. The fluorescence pattern
depends on DNA content, as the different dyes have differing affinities
for certain sequences. The dye Hoechst 33258 binds preferentially to A/T
sequences, whilst chromomycin A3 prefers G/C regions. This technique is
very sensitive and has to be carefully optimised, only using the most pure
chromosomes in order to prevent loss of resolution (Gray et al.,
1975). Once single chromosomes are obtained they can be used as a resource
for PCR amplification, cloning into single chromosome libraries and direct
selection of yeast artificial chromosome clones (YACs) into single chromosome
pools. However, although flow karyotyping has been used with great success
for man and plants, the large numbers of MICs in the chicken genome prevent
resolution beyond the first ten chromosomes. Flow cytometry can also be
highly variable and needs to be verified by conventional cytogenetic methods,
which can be problematical in the chicken.
1.3.2.4 Chromosome microdissection
Chromosome microdissection has been achieved with improved laser technology
(Greulich, 1992). Chromosomes are held in an optical trap, whilst they
are dissected by laser microbeam, and then removed by glass needle. This
has the advantage of removing a specific piece of a specific chromosome.
Microdissected fragments could be used in the construction of a contiguous
YAC library.
Unfortunately, there is little information to aid selection of the
correct chromosome in the chicken, as after the ten largest chromosomes
the MICs are cytologically indistinguishable. This technique yields miniscule
amounts of DNA (a few hundred femtograms to a picogram) which makes cloning
of the dissected fragments difficult. However, PCR amplification of the
fragments by DOP-PCR (Degenerate oligonucleotide primed-PCR) in a general
and non-specific manner, using 22-mer degenerate primers in a dual annealing
temperature protocol, can generate enough DNA for cloning the dissected
chromosome (Arumuganathan et al., 1994). The actual region of the
dissected fragment is still not that small, in the order of tens of megabases,
which still leaves a significant gap from the 700 kb sequences achievable
by conventional molecular techniques. This technique can generate enough
DNA to be used as a probe for FISH to specific chromosomes.
1.3.2.5 Yeast Artificial Chromosomes (YACs)
YACs are constructed by inserting yeast chromosomal fragments into
plasmid vectors. These yeast fragments include an autonomous replication
sequence (ARS), centromere and telomeres, as well as selectable markers.
These constructs are linearised and transformed into yeast cells where
they replicate as synthetic chromosomes (Burke et al., 1987). Large
fragments of exogenous DNA up to 2 Mb in length (Little, 1992) can be cloned
faithfully into these vectors, although on average in the available chicken
library they contain about 630 kb of inserted DNA (Toye et al.,
In Press).
YACs have been used in many cases to isolate genes, for example the
Duchenne muscular dystrophy gene (Kunkel et al., 1985), where the
gene is greater than 106 bp in length. YACs have also been used
for fine mapping around the murine Xist (X-inactive specific transcripts)
sequence, which could not be achieved by interspecies backcross mapping
(Heard et al., 1993). YAC clones for the Xic (X inactivation
centre) region were isolated from the total genomic DNA library by probing
with Xist. These isolates were then mapped by RFLP, giving localised
marker loci, spanning a 1 Mb region. Construction of a chicken YAC library
has been completed and this library is now being characterised in this
laboratory (Toye et al., In Press).
Characterisation of YAC clones initially involves their alignment into
a contig of a particular chromosomal region using hybridisation or PCR
amplification of markers known to be present in that region. After verification
of these results, end cloning of the YAC clones and their subsequent hybridisation
can be used to check the contig alignment. Once a panel of YAC clones have
been identified, these are subject to a number of techniques to refine
the search for the candidate gene. This is initially achieved by targeting
coding regions using CpG island mapping and exon trapping. CpG islands
are short unmethylated CpG-rich sequences often positioned at the 5' end
of vertebrate genes and can be identified by restriction mapping. They
have been shown to be present in 14 out of 20 chicken genes selected at
random from the EMBL DNA database (McQueen
et al., 1996). Exon trapping
utilises RNA splice sites to remove exons and insert them into vectors
containing an intron. These constructs are transfected into COS7 cells
and if the original splice sites can pair with those in the intron, mRNA
is produced. This is then PCR amplified and cloned to isolate the original
exon which can be investigated more fully (Buckler
et al., 1991).
YACs are very important as cloning vectors because they can also be
used for germ-line transmission of DNA into mouse embryonic stem (ES) cells.
YAC clones containing the human hypoxanthine phosphoribosyltransferase
(HPRT) gene (size 670 kb) were fused as yeast spheroplasts with an HPRT-deficient
ES cell line and selected in HAT medium for the expression of HPRT. These
transformed ES cells were then injected into mouse blastocysts and the
subsequent chimaeric mice were shown to express the human HPRT gene (Jakobovits
et
al., 1993). Copy number dependent and position independent expression
has also been shown by expression of the YAC derived tyrosinase gene in
transgenic mice (Schedl et al., 1993). This technique has been adapted
by Choi et al. (1993), by co-lipofection of the YAC clone into the
ES cells in order to reduce the amount of yeast chromosomal DNA being introduced.
Choi et al. (1993) also succeeded in creating chimaeric mice with
a human heavy chain immunoglobulin gene which should lead to mice producing
fully human monoclonal antibodies, a significant improvement on current
technologies.
The use of YAC clones to produce transgenic animals has a further advantage
as gene mutations can be engineered within the YAC by homologous recombination
to create deletion mutants or knockouts. Constructs are designed, containing
the mutant gene with an antibiotic resistance gene inserted in the middle,
edged with antibiotic susceptibility genes. Once introduced into the yeast
this construct will insert into the YAC. If homologous recombination has
taken place the mutated gene with the antibiotic resistance will have inserted
in the place of the original gene. In contrast, randomly inserted genes
would still carry the ends encoding antibiotic susceptibility.
1.3.2.6 Sequence Tagged Sites (STS)
STS were proposed Olson et al. (1989) as an ideal way of integrating
all markers into physical maps, by making sets of PCR-based markers. STS
are 200-500 bp tracts of unique single-copy sequence which can be amplified
by two 20-mer primers. Details of the primers, reaction conditions and
product are available to all researchers, so any DNA clone can be tested
for STS content. STS are particularly useful in the construction of YAC
contigs, although they cannot be relied upon in isolation (reviewed by
Ward and Davies, (1993)). However, Olson's original concept of an STS map,
with a coverage of one marker per 100 kb of the human genome by 1995, has
proved unrealistic. STS provide a relatively expensive procedure as they
rely on the production of primer pairs and the sequencing of large numbers
of clones. Although STS give some information about the genomic sequences
they are not useful as anchor loci, and are only really useful in the characterisation
of YAC or cosmid clones.
STS are currently being employed for sequence scanning in the pufferfish
(Fugu rubripes) genome. This entails random sequencing of the well
characterised cosmid library to give at least 50 STS per cosmid. The small
size of this genome, 400-500 Mb means that there are a maximum of 8 genes
per cosmid. This approach should lead to STS which are within the genes,
and when compared to other genome databases these genes will be identified
by homology. Once genes are identified, conservation of synteny can be
examined (Elgar et al., 1996). Although the Fugu genome is
considerably smaller than those of other vertebrates, including the chicken,
it should still contain the same number of genes, estimated at 100,000.
Thus randomly sequencing this genome should lead to the isolation of many
genes, and comparative mapping will enable these to be mapped in other
organisms.
1.3.2.7 Fluorescent In-Situ Hybridisation (FISH)
FISH enables the localisation of regions of DNA to a particular chromosome
or chromosomes in metaphase or interphase cells following their detection
by fluorescent label. Probes of any size can be used, from cDNAs or cosmids
to YACs, but the bigger the clone the more fluorescence can be detected.
The probe is usually labelled with biotin-16-dUTP and hybridised overnight
to pre-treated slides of chromosome spreads. After washing at the required
stringency, fluorescein-avidin is added and after a further washing biotinylated
anti-avidin is added. This acts as a FISH "sandwich" to increase the number
of binding sites for fluorescein-avidin, which is again added before the
slides are sealed. The slides are viewed by fluorescence microscopy and
the results photographed (Korenberg et al., 1992). A single copy
gene should hybridise to both chromatids of an homologous pair of chromosomes,
usually visualised as two fluorescent dots. However it is difficult to
obtain good data from this technique in birds, due to the large number
of chromosomes and the small amount of spread achieved with the metaphase
chromosomes. It is also impossible to cytologically distinguish between
the 29 pairs of MICs, so if a FISH probe hybridises to MIC, there is no
way of deciding which chromosome it is. However, this technique is very
useful for determining the chimaerism of the YAC libraries, and is being
employed in our laboratory for this purpose. FISH can also show to which
size of chromosome the probe hybridises.
1.4 Traits of interest in the chicken
Chickens exhibit a number of traits differing between the commercial
lines available. These include associations of particular lines with increased
productivity and disease resistance. This gives an initial starting point
for the selection of birds in which a trait of interest is expressed.
1.4.1 Production Traits
The main production traits of interest in chickens are rapid weight
gain and increased egg-lay. To this end, two types of commercial chickens
are produced, broilers and egg-layers. These traits are often polygenic
but may also be affected by other non-hereditary factors, which makes the
identification of genes difficult. In order to map the genes controlling
production traits, markers which are linked to the quantitative trait,
termed Quantitative Trait Loci (QTL), are identified within a large population
(about 1000 per sire). Quantitative traits are measurements of a phenotypic
characteristic such as body weight, shank length (Dunnington et al.,
1993) or abdominal fat deposition (Plotsky et al., 1993). The marker
loci used are often derived from DNA fingerprints which are randomly distributed
throughout the genome. QTLs can subsequently be used directly in the screening
of commercial flocks for marker assisted selection (MAS), as they segregate
with the trait of interest. The identification of genes controlling these
complex production traits requires very large numbers of progeny and the
QTL linkage is only as good as the number of markers in those regions.
The final outcome of QTL mapping is usually identification of a number
of highly significant areas of chromosomes which then need to be studied
in greater detail using YAC contigs to identify putative candidate genes,
for the trait under investigation.
1.4.2 Disease Traits
Chickens suffer from a number of diseases, some of which are fatal,
all of which affect the growth, production and welfare of the birds. Thus
disease resistance is an important economical consideration to both the
poultry farmer and breeder. The most important diseases of chickens are
those for which vaccination is now proving useless against very virulent
strains and those which could affect man. Resistance to disease is fundamental
to survival for all organisms. However, two species will not necessarily
share the same mechanism of resistance to the same disease. Disease resistance
will vary across a population, and is particularly clear in comparisons
between the inbred chicken lines at the IAH, which have been studied by
Bumstead et al. (1991). There are three basic mechanisms of genetic
resistance: recognition of the pathogen by the immune system, under the
control of the MHC complex; metabolic variants which affect the penetration
or reproduction of the pathogen; and incorporation of pathogen DNA into
the host genome which prevents infection.
1.4.2.1 The major histocompatibility complex (MHC)
A major element of disease resistance is the major histocompatibility
complex (MHC) which in chickens lies on chromosome 16. Chromosome 16 is
a MIC which has been shown by silver-staining to contain the nucleolar
organiser region (NOR) containing the 5.8S, 18S and 28S ribosomal RNA genes
(rDNA) for protein synthesis. Chickens only have one copy of rDNA consisting
of a cluster of 145 40 kb repeats, which represents about 0.5% of the chicken
genome (Bloom et al., 1993). The NOR was shown to exist on the same
chromosome as the chicken MHC by trisomy mapping of a bird expressing B6,
B13
and B15 antigens (Bloom and Bacon, 1985). Estimates of
the size of the microchromosome range from 8 Mb upwards; however, the NOR
occupies a large portion of the chromosome, perhaps as much as 6 Mb (Bloom
and Bacon, 1985). The MHC region is therefore perhaps 2 Mb or 0.17% of
the genome.
The MHC B complex in chickens was first identified by the ability
of leukocytes to give strong graft rejection (Schierman and Nordskog, 1961).
It was subsequently discovered to contain three loci, B-F, B-L
and
B-G (Pink et al., 1977), producing class I, II and IV
antigens respectively. It has been shown that frequent recombination occurs
between
B-F and B-G regions, although none has been detected
between
B-F and B-L (Hála et al., 1976). Class
I antigens are expressed on the surface of virtually all cells including
leukocytes and erythrocytes, as a single transmembrane polypeptide chain
folded into three extracellular domains associated non-covalently with
[beta]2
-microglobulin. Class I presents antigens for recognition
by cytotoxic T cells. This region is homologous to HLA-A, B, and C genes
in man. Class II antigens are expressed on antigen presenting cells (APC),
B-cells and macrophages, as heterodimers with two non-variable immunoglobulin-like
domains near the membrane and two variable domains, furthest from the cell.
The class II antigen is recognised by T helper cells, which become activated
to stimulate T cell proliferation. This region is homologous to the HLA-D
genes in man. Class IV antigens are expressed on the cell surface of erythrocytes
and are unique to avian species. They consist of two glycoproteins which
are highly polymorphic, but of unknown function. However, they are distantly
related to butyrophilin and myelin-oligodendrocyte glycoprotein (Kaufman
et
al., 1995). At present, no regions have been identified for class V
loci (Sander, 1993). MHC haplotypes have traditionally been determined
by serotyping each bird by haemagglutination of the B-G antigens
(and in some cases
B-F antigens), but give no information for the
other MHC loci.
The chicken MHC region has now been more extensively explored and there
are chromosome 16 linkage maps for both the Compton mapping reference population
(Bumstead and Palyga, 1992) and the East Lansing mapping reference population
(Levin et al., 1994b), as well as a resource of cosmid clusters
(Guillemot et al., 1988). The class II genes are interspersed with
the class I genes along a relatively small region of chromosome 16, the
B-F/B-L
region, represented in a cosmid cluster of 130 kb. There are at least 5
class II [beta] genes in total, two of which are known to be expressed
and lie within the B-F/B-L region about 8 kb apart. The intron/exon
structure for class II is similar to that in mammals, but with much smaller
introns. The total gene sizes are less than 2 kb in comparison to 8-20
kb in man (Guillemot and Auffray, 1989). At present, only one class II
[alpha] gene has been found (Kaufman et al., 1993), about 5 cM away
from the B-F/BL region. There are four class I genes, two
within the
B-F/B-L region (Kaufman et al., 1993) which
lie in a 20 kb segment of the chromosome. Between these two genes, in the
B-F/B-L
region, lies the TAP2 (Transporter associated with antigen processing-2)
gene (Bumstead et al., 1994b). In the same
B-F/B-L
region are also located one of many B-G genes and the gene encoding
the [beta] subunit of a GTP binding protein (Guillemot and Auffray, 1989).
There are many B-G genes, a number of which are transcribed. However,
their positions on the chromosome are not as yet defined, although one
clone maps to the same linkage position as TAP2 (Mariani
et al.,
In Press). Another gene mapping to this linkage position is the chicken
homologue of G9a (BAT8), which is linked to the class III genes
in man (Spike and Lamont, 1995). One of the few identified class III genes,
C4, also lies within this region (Kaufman pers. com.).
There is a second MHC region, termed the Rfp-Y system (Briles
et
al., 1993), which was originally unlinked to the B-complex.
This region contains two closely-linked class I and two class II genes
(Miller et al., 1994a). Recent work (Miller et al., 1996)
using trisomy mapping has shown that the Rfp-Y system contains two
class I and three class II genes and lies on chromosome 16, in the same
cosmid cluster as a class II MHC gene and the rDNA genes (NOR). Between
the two class I [alpha] genes lies a lectin, type-C gene, Lec (Bernot
et
al., 1994).
Thus the MHC region in chickens is significantly smaller than that
in mammals, although it appears to contain two regions of class I and class
II genes, these are interspersed while the mammalian regions are separate.
The association of the MHC region, or genes associated with it, with
disease resistance is shown in a number of diseases some of which are discussed
below.
1.4.2.2 Salmonellosis
Salmonellosis is caused by the colonisation of the gut following ingestion
of the Salmonella bacterium. This can lead to a systemic infection
of the chicken with high morbidity and mortality by species such as S.
typhimurium, S. gallinarum and S. pullorum. Alternatively
chickens can be infected by S. enteriditis which causes little clinical
infection in the birds, but is excreted into the eggs which when eaten
uncooked by man can result in infection. At present control is by oral
administration of antibiotics. Salmonellosis resistance in mice has been
shown to be provided by the presence of a glycine at amino acid position
105 in a variant of the Nramp1 protein, formerly known as Ity (Vidal
et
al., 1993). The Nramp protein is expressed in macrophages as a putative
transmembrane protein. However, although expressed in chickens, Nramp1
does not control resistance to Salmonellosis in the cross studied (Hu et
al., 1996). Experiments have shown that chicken lines 61
,WL and N were resistant to all serotypes tested, whereas lines 15I, 72
and C were susceptible (reviewed by Bumstead et al. (1991)). Crosses
and backcrosses between resistant and susceptible birds showed that resistance
is dominant and not linked to the sex or the MHC chromosomes, and is probably
controlled by a single gene.
1.4.2.3 Infectious bursal disease virus
IBDV (Infectious bursal disease virus) is a birnavirus which mostly
affects chicks 3-6 weeks old, causing lesions within the bursa, and subsequent
immunosuppression. This usually leads to secondary infection by opportunistic
pathogens, which causes increased suffering for the birds and great expense
for the farmers. There are a number of different strains, some of which
cause high mortality and are termed very virulent. Bacon (1987) found some
resistance associated with the MHC; as haplotypes B2 or
B21
were
more resistant than haplotypes B5,B15
or B12. However, Bumstead et al. (1993) examined
11 inbred and partially inbred lines of chickens and determined by F1
and
F2 crosses that resistance was partially dominant and did not
involve the MHC. Although there is a vaccine against this virus, in 1989
a very virulent strain of the virus reached Britain and, despite vaccination
with "hot" strains, has increased mortality by 1.5-2%. In 1994 the National
Farmers' Union estimated that IBDV had cost the industry £15 million
(Law and Payne, 1996). It would be very useful to identify genes for resistance
to this disease as they could be used directly to screen birds for resistance
or used to initiate a more informed approach to vaccine development.
1.4.2.4 Marek's disease virus
MDV (Marek's disease virus) is a herpes virus, causing a lymphoproliferative
disease in chickens leading to paralysis and tumours. The initial disease
phase is acute and occurs about 3 days PI involving virus replication in
B lymphocytes, and as a consequence of an immune reaction to antigen particles
the T lymphocytes become activated. These T lymphocytes disperse around
the body giving a persistent viraemia. Some lymphocytes are transformed
and proliferate as lymphomas in visceral organs. After two weeks a second
cytolytic infection occurs in the feather follicle epithelia resulting
in the shedding of cell-free virus into the environment (Payne, 1996).
MDV can be controlled by administration of the Rispens vaccine or the HVT
vaccine, which is a live, attenuated herpes-virus of turkeys. However,
there are now a number of very virulent strains of MDV which are unaffected
by vaccination and are increasingly causing high mortality, up to 80% in
comparison to 10% associated with the classical from of MDV (Payne, 1996).
In 1994 the National Farmers' Union estimated that MDV had cost the industry
£3 million (Law and Payne, 1996), however this is a gross under-estimate.
The genetic differences within the B-complex have been associated
with resistance to a number of diseases; the initial association of the
MHC was with MDV resistance. The B21 haplotype has been
shown by a number of experimenters to confer resistance to MDV (Hansen
et
al., 1967; Longnecker et al., 1976; Briles et al., 1977;
Powell, 1984; Hedemand et al., 1993). However, the B19
and
B2 haplotypes tend to confer susceptibility. The B21
haplotype
resistance is not associated with the bursa, but these birds have a higher
virus-neutralising-antibody titre than susceptible birds which are immunosuppressed.
At 7 days post-infection the virus titre falls in the blood and spleen
whilst there is development of humoural antibody and cell-mediated immunity.
Non-MHC genes also have an effect on the bird's resistance to disease.
This area has again been most intensively studied for MDV, where two lines
of birds, from the Regional Poultry Research Laboratory (RPRL) line 6 and
7, have identical MHC haplotypes of B2 , yet show very
different resistances to MDV. Line 6 was shown by Stone (1969) to suffer
from only 6% mortality in comparison to 100% of line 7 birds with MDV.
It has been shown by thymic transplant from line 7 to thymectomised line
6 birds that lymphomas will occur in cells derived from line 7. However,
no resistance is shown if the transplant is from line 6 to line 7. Transplants
of bursa or spleen show no alteration in resistance, indicating that line
6 resistance is associated with its T cells (Powell et al., 1982).
Line 6 birds show a high level of immune response which correlates with
viral disappearance not seen in line 7 birds. It has also been shown that
macrophages in uninfected line 6 birds have a higher phagocytic index than
those of line 7 birds (Powell et al., 1983). The genes which control
these differences between lines 6 and 7 are not fully understood, but a
number of candidate genes have been suggested. These include Ly-4
(Payne, 1991), Bu-1, and Th-1 (Gilmour et al., 1976),
alkaline phosphatase (Yotova et al., 1990), polyesterase (Bachev
and Lalev, 1990), thermal regulatory genes (Yotova, 1988) and immune response
and competence genes (Okado and Yamamoto, 1987). If the genes controlling
resistance can be identified, breeders will be able to select genetically
resistant stock and it should be possible to target vaccines or chemical
prophylaxes to both the very virulent and normal forms of the virus. At
present these candidate genes appear to have some effect on resistance
but do not appear to be the major genes involved.
1.4.2.5 Newcastle disease
Newcastle disease, originally imported from the far east, is caused
by a myxovirus which can infect chickens as well as turkeys and occasionally
man. It has a 2-7 day incubation period culminating in lassitude of the
birds, respiratory distress, diarrhoea and drooping wings. Egg production
is greatly affected and there is high mortality in young chicks. Oral administration
of live vaccine has significantly reduced the incidence of this disease.
1.4.2.6 Infectious bronchitis virus
IBV (Infectious bronchitis virus) is a highly contagious respiratory
disease of chickens caused by a coronavirus. The respiratory infection
reduces egg production and quality and can lead to secondary infections
with other micro-organisms and thus death. This is controlled to some degree
by vaccination. Resistance to IBV is also non-MHC derived. However, it
does not appear to be controlled by a single gene as the levels of infection
and organs affected are considerably different between the inbred lines
of birds (Bumstead
et al., 1991).
1.4.2.7 Avian leukosis and sarcoma virus
ALSV (Avian leukosis and sarcoma virus) are a group of retroviruses
which are characterised by autonomous proliferation of blood cell precursors.
They are acquired by infection with exogenous virus or vertical transmission
from the hen. The infection leads to erythroblastosis at a few months of
age or to lymphoid leukosis in the more mature birds. This can also result
in tumours, as in Rous sarcoma virus infection. Resistance to infection
induced by the MHC has also been observed for the ALVS group (reviewed
by Schierman and Collins (1987)). Rous sarcoma virus-induced tumours have
regressed dramatically in birds of haplotype B6 or B2;however,
haplotypes B5 or B13 are susceptible.
It has been determined that the genes influencing regression lie within
the B-F region of the MHC and are also associated with inhibition
of the development of metatastic tumours. There is a small amount of resistance
to ALV shown in birds with B2 haplotype whereas those
with B5 haplotype have increased erythroblastosis and
a higher incidence of haemangiocarcinomas. ALSV resistance is due to a
lack of a cell surface receptor for the specific sub-group of virus reviewed
by Bumstead
et al. (1991). Recently the chicken gene tv-a, which
confers susceptibility to infection by ALSV subgroup A, was isolated by
Young et al. (1993).
1.4.2.8 Coccidiosis
Coccidiosis is caused by an apicomplexan protozoan, Eimeria,
which inserts itself intra-cellularly into the gut wall, where it reproduces.
It is spread by ingestion of contaminated faeces, and so is prevalent in
litter-kept chickens. Infected chickens show a reduction in growth rate
and feeding efficiency, which can lead to increased fatality. There has
been some control of infection using attenuated vaccines such as Paracox,
a vaccine based on an initial infection with small numbers of parasites
which produces a strong protective immune response. MHC haplotype affects
the resistance of birds to coccidiosis, where those with B21
haplotype
are more resistant to Eimeria maxima than those with B15
haplotype.
However this differs for the various species of Eimeria
(Bumstead
et
al., 1991). Resistance to coccidiosis is also determined by non-MHC
as well as MHC genes. Although these have not yet been defined, it has
been shown that different lines of birds show differing resistance to coccidiosis.
However, it again seems that the genes involved act differently for each
parasite as, for example, those birds with resistance to
E. maxima
show susceptibility to
E. tenella, (reviewed by Bumstead et al.
(1991).
1.4.2.9 Fowl cholera
Fowl cholera is caused by the gram-negative, facultative aerobe Pasturella
multocida. This disease has a sudden onset with fever leading to haemorrhagic
septicaemia. Oral antibiotics can be administered, but this has little
effect on those birds already showing the disease. Lamont et al.
(1987) have also shown that MHC haplotype affects the resistance of birds
to fowl cholera. Lines of birds homozygous for MHC haplotype and the F1
and F2 progeny were tested by intra-nasal administration of
P.
multocida. Homozygote B1 haplotype birds demonstrated
significantly higher resistance to the bacteria than B19homozygote
birds.
It should be noted that resistance to different diseases associated
with the MHC, is not generated by a single haplotype, thus suggesting an
important role for non-MHC genes in the determination of resistance (Lamont,
1993).
1.5 Objectives
The genetic map of the chicken is now essentially complete, containing
more than 800 loci at an average spacing of around 3 cM (Bumstead and Palyga,
1992; Levin et al., 1994b). A YAC library has now been constructed
which contains 16,000 clones of an average insert size of 0.58 Mb giving
a coverage equivalent to 10 copies of the genome (Toye et al., In
Press). However, although it is now possible in principal to move from
mapping information to the equivalent YAC clones, to do this effectively
requires a means of efficiently increasing the density of markers in relevant
regions of the map. Once this is achieved, it will be possible to isolate
specific genes using the framework of the markers. In conjunction with
the generation of markers, it is now possible to directly select for candidate
genes, by positional cloning, which could code for traits of interest.
Positional cloning is now used as the established approach to determine
disease genes. It proceeds from an existing disease to identifying the
responsible gene's chromosomal locus, then to recovering the gene, and
finally to discovering the mutational alteration that accounts for the
disease (Collins, 1992). Variants at a locus defined by a candidate gene
could help to determine their effects on a trait of interest.
The aim of this project was to develop techniques for generating this
high density of marker loci. Possible ways to approach this include adding
known genes to the map whose position or possible function can be compared
with that of other organisms; the addition of large numbers of markers
to the map, by saturation or random mapping leading to enough markers such
that there is one every centi-Morgan; and targeting markers to regions
of the genome which contain genes of interest, in order to increase the
marker density just within that region.
1.6 Approaches
In order to isolate the genes responsible for inherited traits it will
be necessary to clone the chromosomal region containing the trait locus.
In practice, this means isolating YAC clones containing the marker loci
used to map the trait gene. In order to do this, refined mapping of the
chicken genome using a large number of markers is needed. Three major areas
of investigation have been undertaken:
i. Isolation of cDNAs which might code for candidate traits by ESTs.
ii. Total saturation of the genome with large numbers of anonymous
markers by RAPD.
iii. Targeted isolation of markers, which could include candidate genes,
in specific regions by RDA.
1.6.1 Expressed Sequence Tags (EST)
ESTs of genes have been used as an effective way of generating further
loci on genomic maps which provide useful genetic information rather than
just being anonymous markers. ESTs are the sequences generated from cDNA
clones and act as type I anchor loci within the linkage map (O'Brien and
Graves, 1991). The chance of cloning a candidate gene can to some extent
be controlled by the choice of tissue for mRNA extraction which will generate
the ESTs. Candidate genes can be isolated as ESTs and homology with known
genes in other species identified and the inference of possible association
to traits pursued. Comparative mapping of ESTs can determine possible regions
in which a gene of interest may lie, as it is already mapped in that area
in a different species (Adams et al., 1991). However, beyond using
a specific tissue for mRNA extraction, there is no other method for targeting
the required candidate gene.
In order to genetically map ESTs in chickens it was necessary to find
polymorphisms between the parents of the Compton reference mapping population,
this is usually achieved by RFLP analysis. However, this is time consuming
and not always productive. Ultimately with the construction of a WG-RH
library all chicken ESTs will be mapped, as there is no need for polymorphisms
between the two lines. Using ESTs derived either from tissue known to express
the gene product of interest, or from comparative mapping synteny, it is
unlikely that the candidate gene will be isolated, but this will at least
lead to further placement of informative loci on the chicken linkage map.
1.6.2 Random Amplified Polymorphic DNA markers (RAPD)
RAPD analysis should lead to the saturation of the genome with markers
to a level of at least one marker every 0.5 cM, without the requirement
of previous genetic information and using few expensive oligonucleotides.
RAPD markers are usually generated by the amplification of random DNA segments
with single short arbitrary primers. An oligo is used as both forward and
reverse primer in PCR amplification, at a low annealing temperature (in
the region of 36°C) (Williams et al., 1990). This procedure
generates large numbers of bands for each individual's genomic DNA (resolved
by gel electrophoresis) and may identify multiple polymorphisms between
individuals. Polymorphisms detected by RAPD are due to point mutations,
which affect the binding of the primer to the DNA and also to insertions
and deletions between primer binding sites. Although these markers are
anonymous, once a large number have been generated, high density genetic
maps can be constructed. This system should rapidly generate large numbers
of closely spaced markers. Each primer should be capable of detecting a
number of polymorphisms leading to the identification of multiple informative
loci for a small amount of initial resource. The benefits of RAPD technology
include its simplicity and ease of use in the laboratory. It does not require
any previous sequence data, but will give easily interpreted results. It
is also less costly in time than other methods, for example RFLP. However,
changes in enzyme manufacturer, primer or enzyme concentration, or thermal
cycling equipment, can give inconsistent results (MacPherson et al.,
1993; Meunier and Grimont, 1993). RAPD markers have been used for gene
cloning, medical diagnostics and trait introgression in breeding programs
(Williams et al., 1990). Levin et al. (1993) used RAPD markers
to generate new markers on the Z chromosome of the chicken in order to
identify sex-linked traits. RAPD markers have also been useful in determining
phylogenetic relationships between species as demonstrated by Barral et
al. (1993) with the Shistosoma genome.
1.6.3 Representational Difference Analysis
Representational difference analysis (RDA) uses subtractive and kinetic
enrichments to purify restriction endonuclease fragments present in one
population of DNA fragments but not in another (Lisitsyn et al.,
1993). It does not rely on knowledge of the sequences involved but can
be used to compare complex DNAs elucidating the sequence differences between
them. In comparison to subtractive hybridisation, this method is far quicker
and more successful, as the complexity of DNA is first reduced by the representation
step, and the regions of interest are enriched. A representation of the
genome is made by digestion with a specific endonuclease, ligation of enzyme-specific
initial adaptors (termed R) and amplification of fragments by PCR. This
generates a pool of fragments referred to as amplicons. Two species of
DNA are used; DNA containing the target sequence, the "tester" and the
non-target-containing DNA, the "driver". The adaptors are subsequently
removed from the amplicons, and new adaptors (termed J) ligated only to
the tester amplicons which are then mixed, melted and reannealed with excess
non-target DNA (driver) amplicons. During treatment with Taq DNA
polymerase, only those amplicons with adaptors can be filled in at the
3' ends. When the samples are amplified by PCR, exponential amplification
only occurs with the tester homoduplex DNA. Single-stranded DNA is not
amplified, and any heteroduplexes are only amplified linearly. This hybridisation
and amplification step can be repeated many times alternating between two
sets of adaptors (termed N and J) to greatly increase the enrichment of
the target DNA. Disadvantages of this system mainly concern the resource
of target and non-target DNA, which particularly in some experiments, should
come from the same individual, or at least a related individual.
More recently, Lisitsyn et al. (1994) have refined this technique
by using either congenic strains or two-generation crosses to generate
genetic markers linked to a trait of interest. They used Genetically Directed
Representational Difference Analysis (GDRDA) to target three polymorphisms
to within less than 1 cM of the mouse nude locus on chromosome 1.
Thus, RDA can be used to isolate candidate genes directly or to target
specific regions of genomic maps, in order saturate these regions with
markers for ease of YAC contig alignment.
RDA has been used to isolate a number of candidate genes from a variety
of diseases which may be caused by viruses, including the identification
of human herpes virus 6 from patients with multiple sclerosis (Challoner
et
al., 1995), a flavivirus from patients with hepatitis strain GB (Simons
et
al., 1995) and a herpes-like virus from AIDS patients with Kaposi's
sarcoma (Chang et al., 1994). RDA has also been applied to the isolation
of candidate genes for different cancers; Schutte et al. (1995)
identified a tumour suppressor gene deleted in pancreatic carcinoma, while
Hino et al. (1995) found 4 candidate genes for renal carcinomas
in the Eker rat. However when Tsuchiya et al. (1994) tried to use
RDA to isolate genes specific to the Yoshida sarcoma (YS), not expressed
in the long term survival Yoshida variant, they discovered that the two
were not as closely related as had been believed and generated nine genes
specific to YS.
RDA has also been adapted to the isolation of candidate differentiation
sequences from tissue using cDNA, initially by finding the two genes RAG-1
and RAG-2 which are known to be responsible for activating V(D)J recombination
(Hubank and Schatz, 1994). More recently the technique has been applied
to the isolation of sequences present on a single chromosome. This has
been successful in isolating sequences specific to the bovine-Y chromosome
(Wigger
et al., 1996), mouse chromosome 10 (Baldocchi et al.,
1996) and wheat chromosome 6 (Delaney et al., 1995).
RDA is a powerful tool which can be used to isolate sequences on specific
chromosomes, candidate genes and previously undetected viruses. This technique
depends on a number of enzymatic steps, which could lead to failure. However,
with the decrease in DNA complexity and enrichment of target sequences,
RDA could also prove highly successful for chickens as well as mammals
and plants.
Chapter Two
Materials and Methods
2.1 Materials
Chemicals, stock solutions, media, oligonucleotides and the thermal
profiles used for PCR reactions are detailed in Appendix A.
2.2 Bacterial strains and plasmids
Details of bacteria and plasmids used are shown in Table 2.1.
Table 2.1 Descriptions of bacterial strains and plasmid.
| Strain |
Description |
Source |
| MC1061/P3 |
araD139 [Delta](araABC-leu)7679 galU
galK
[Delta]lacX74 (r-K,m+K)
rpsL
thi
mcrB {P3: amber ampr, amber tetr, Kmr} |
Tregaskes and Young (1997) |
| JM109 |
recA1,
endA1, gyrA96, thi, hsdR17
(rK-,mK+), relA1, supE44, [Delta](lac-proAB),
[F', traD36, proAB,
lacIqZ[Delta]M15] |
Promega |
| pCDM8 |
tetr, ampr |
Tregaskes and Young (1997) |
| pGEM-T |
pGEM-5Zf(+) digested with EcoRV and 3' terminal thymidines
added. ampr |
Promega |
2.2.1 cDNA Library IAHchB1
The cDNA library, IAHchB1 in pCDM8, was used as a source of clones
for sequencing and mapping. The library was constructed as detailed in
Tregaskes and Young (1997), from RNA extracted from a 17-day embryonic
whole bursa and has insert sizes of 0.5 to 3.5 kb. Thus any random clone
could contain part or all of any gene expressed in the bursa. This library
would therefore include ubiquitously expressed genes, as well as those
specific to the tissue. The Bursa of Fabricius is a major organ in the
immunological system of the bird mainly concerned with the multiplication
and development of B lymphocytes, which are involved in the control of
disease through antibody production. Thus expressed genes specific to this
tissue could be of great interest as they could code for proteins associated
with the immune system or the development of B lymphocytes. Any of these
genes could code for proteins involved in host resistance, and so should
be fully characterised.
2.2.2 Transformation with IAHchB1 by electroporation of Escherichia
coli MC1061/P3
A single colony of Escherichia coli MC1061/P3 was picked
and grown up overnight in 5 ml L-broth (160 rpm, 37°C). Two ml of this
overnight culture were used to seed 200 ml of L-broth which was then incubated
(160 rpm, 37°C) until the OD600 measured 0.5. The culture
was placed on ice for 30 min, with occasional swirling, and then transferred
to a pre-cooled sterile 250 ml centrifuge bottle. The cells were pelleted
by centrifugation at 2500 g for 15 min. The supernatant was discarded,
and the pellet resuspended in 200 ml pre-cooled (4°C) sterile distilled
water. The cells were replaced on ice for 30 min, and then centrifuged
as before. The cell pellet was resuspended in 100 ml cold sterile distilled
water, and returned to the ice for another 30 min. The cells were centrifuged
as before, the supernatant removed and the pellet resuspended in 5 ml of
pre-cooled sterile filtered (using a 0.2 µm pore filter) 10% glycerol.
This cell suspension was transferred to a 50 ml Falcon tube and left on
ice for 30 min. The cells were then centrifuged at 700 g, 15 min. The supernatant
was removed and the pellet resuspended in 500 µl of 10% glycerol
and stored on ice until used.
Aliquots of 100 µl of competent cells were added to varying amounts
(10 ng, 50 ng and 100 ng) of the IAHchB1 library plasmid DNA ( initial
concentration 500 µg/ml) in pre-cooled tubes and electroporated,
using a Biorad Gene Pulser apparatus with pulse controller at 2.5 kV, 200
[Omega] and 25 mF. One ml of SOC medium was immediately added to the cuvette,
and the entire contents were transferred to a 5 ml Sterilin tube containing
1.5 ml SOC. The tubes were incubated (160 rpm, 37°C) for one hour before
plating onto pre-warmed L-agar plates, containing 7.5 µg/ml tetracycline
and 12.5 µg/ml ampicillin. The plates were incubated overnight at
37°C. Colonies were selected at random, and preparations of plasmids
made from these.
2.2.3 Cloning RDA products into pGEM-T vector
All Representational Difference Analysis (RDA) products were cloned
into the pGEM-T (Promega) TA cloning vector following the manufacturer's
protocol. Approximately 8 ng of un-purified RDA re-amplification product
was ligated to 50 ng of pGEM-T at 15°C for 3 hours. Once ligated, 2
µl (11.6 ng) of the product was transformed into 50 µl of E.
coli
JM109 competent cells (Promega). Initially, the tubes were kept on ice
for 20 min before heat-shock treatment at 42°C for 45 sec, and then
returned to ice for 2 min. 900 µl of SOC broth was added with subsequent
incubation with agitation at 37°C for one hour. The cells were then
plated onto L-agar plates containing 100 µg/ml Amp, 40 µg/ml
X-GAL and 46 µg/ml IPTG. After incubating the plates overnight at
37°C they were placed at 4°C for one hour to enhance blue/white
colony identification.
2.3 Chicken Strains and Crosses
The inbred lines of chickens used for all the experiments are white
leghorn chickens kept at Compton. The individual lines described here,
lines N, 15I, 61 and 72 were all imported from the
Regional poultry research laboratory (RPRL), East Lansing. The coefficients
of inbreeding for all the lines used are detailed in App. A. All the inbred
lines produce two generations a year, the female siblings are artificially
inseminated by pooled semen samples from the male siblings at two weeks
after the start of lay, in order to synchronise the flock production. Line
15I has been closed since its importation in 1962 and had been inbred for
17 generations by 1978 . Lines 72 and 61 have been
closed since their importation in 1972 and had been inbred for 7 generations
by 1978 . Line N has been closed since its importation in 1982, it is not
known how many generations of inbreeding have occurred.
Chickens from the Compton Mapping Reference Population were derived
from a backcross from line N and line 15I birds, as described by Bumstead
and Palyga (1992). These lines are inbred and differ in their resistance
to salmonellosis, line N being resistant, and line 15I susceptible. One
of the F1 progeny (B989), a line N x line 15I female, was backcrossed
to a line 15I male parent (B988). This mating produced 113 backcross progeny,
which have been extensively used to construct a genetic map in this population.
A second population of birds, the Marek's Disease (MD) reference population,
has been established using line 61 and line 72 .
These two White Leghorn lines show differences in their susceptibility
to MD: line 61 is resistant to MD, whilst line 72 is
susceptible. In this population F1 birds were backcrossed to
the line 72 parent to give 85 progeny for the generation of
a new linkage map and the mapping of resistance. F1 birds were
also mated to give an F2 population of 42 birds for further
analysis of disease resistance. All the birds used were infected with MD
virus and the infection characterised in terms of viraemia by quantitative
PCR and plaque assays, as well as mortality and tumour development
Quantification of Marek's disease virus by PCR has been developed (Bumstead
et.al.
In press). The PCR assay used two MDV specific primers to amplify a 279
bp product, which was fluorescently labeled to enable quantifiable detection
by ABI Genescan gel electrophoresis. Primers to chicken interferon were
used as a control assay for cell number and PCR effectiveness. The PCR
assay was calibrated and its linear range determined by comparing the amount
of PCR product for a range of dilutions of an MDV-transformed cell line.
The relationship of PCR value to the number of viral genomes was also determined
by applying the PCR assay to a series of dilutions of a plasmid containing
the template sequence. This was carried out in addition to the usual viral
plaque assay, with which it correlated well. The PCR assay was then tested
on samples of lymphocytes from pure line 61 and 72
birds as well as F2 birds of a cross between these two lines.
The PCR assay gave repeatable and consistent results which in fact correlated
better with the fate of the birds than did the plaque assay results.
There is no data for the phenotypes of the F1 birds with
respect to Marek's disease resistance, as all available birds were used
to generated the F2 population. However, the phenotype data
for the pure lines and the backcross birds is shown in App. A. Approximately
half the backcross birds show some resistance to Marek's disease according
to the quantitative PCR values.
2.4 DNA Preparation
2.4.1 Genomic DNA Extraction
DNA was extracted from 0.5 ml of chicken blood collected from the wing
vein using 3% sodium citrate as an anti-coagulant. The blood was treated
with 1.5 ml of 1% saponin in PBS, mixed and left at room temperature (RT)
for 10 min before washing with PBS. The nuclei were then pelleted by centrifugation
at 1000 g for 5 min. This was repeated twice to ensure that the nuclei
were free of cytoplasmic contamination. After centrifugation, the pellet
was resuspended in 2 ml TE with 10% SDS. Protein and RNA were removed from
the pellets by proteinase K digestion (20 mg/ml) for two hours at RT, followed
by addition of RNase A (100 mg/ml) and mixed by rotation at RT overnight.
The samples were phenol-extracted twice and chloroform:iso-amyl alcohol
extracted once, before precipitation with 0.5 ml 2 M sodium acetate and
8.75 ml ethanol at -20°C overnight. The DNA was pelleted by centrifugation
at 7800 g for 15 min, washed with 2 ml 70 % ethanol, air-dried and resuspended
in 2 ml TE.
2.4.2 Plasmid DNA Preparation-ABI Protocol
Plasmid DNA was prepared using the following modified mini alkaline-lysis/PEG
precipitation procedure. Overnight cultures of 5 ml of each clone were
grown up in Terrific broth containing 7.5 µg/ml tetracycline and
12.5 µg/ml ampicillin. After removal of 850 µl for glycerol
stock preparation, the remaining culture was pelleted by centrifugation
at 1000 g for 3 min. Glycerol stocks were made by the addition of 150 µl
glycerol to the removed cultures, which were then vortexed briefly and
snap frozen in a dry-ice and ethanol mixture before storage at -70°C.
The cell pellet was resuspended in 200 µl of GTE buffer, and lysed
by addition of 300 µl of 0.2 N NaOH and 1% SDS. The tube was inverted
and incubated on ice for 5 min. The solution was then neutralised with
300 µl of 3 M potassium acetate (pH 4.8) and inverted, before being
placed on ice for 5 min. Cellular debris was pelleted by centrifugation
for 10 min at 7000 g. The supernatant was removed and digested with 20
µg/ml RNase A at 37°C for 20 min and then extracted twice with
400 µl chloroform. The extracted DNA was precipitated with an equal
volume of isopropanol at RT and immediately pelleted by centrifugation
for 10 min at 7000g. After washing with 70% ethanol, the pellet was dissolved
in 32 µl Milli-Q water and further precipitated by the addition of
8 µl 4 M NaCl and 40 µl 13% PEG8000, with incubation
on ice for 20 min. The DNA was pelleted by centrifugation at 4°C for
15 min. After washing in 70% ethanol the pellet was dried and resuspended
in deionised water at 1 µg/µl. This preparation was stored
at -20°C.
2.4.3 Plasmid DNA Preparation-Hybaid maxiprep Protocol
Individual bacterial colonies were picked into two 10 ml of cultures
of L-Broth, containing 100 µg/ml ampicillin and incubated with agitation
overnight at 37°C. The following day 850 µl of each was removed
and a glycerol stock prepared as previously described. The rest of the
culture was centrifuged for 5 min at 1000 g to pellet the cells before
resuspension in 250 µl Milli-Q water per tube. The combined 500 µl
was then placed in a 1.5 ml microfuge tube and spun at 7000 g for 1 min,
the supernatant removed and the tube re-spun to remove any remaining liquid.
The pellet was then resuspended in 200 µl of pre-lysis solution and
vortexed before addition of 400 µl of alkaline lysis solution. The
tubes were inverted 15 times and left to stand at RT for 5 min before addition
of 300 µl of ice-cold neutralising solution and re-vortexing for
5 sec. The tubes were left on ice for 5 min and then centrifuged for 5
min at 7000 g. The supernatant was transferred to a clean 2 ml tube and
900 µl of Hybaid binding buffer (containing guanidine thiocyanate)
added. The tubes were again inverted 15 times and left to stand for 3 min
at RT. The upper aqueous layer was discarded and the binding beads transferred
to the spin filter and centrifuged for 3 min at 7000 g. The eluent was
removed and 500 µl wash solution added to the spin filter before
centrifugation as before. The eluent was again removed and the filter centrifuged
for a final 5 min to ensure all liquid was removed. The filter was then
transferred to a new tube and 100 µl TE was carefully stirred into
the beads to elute the DNA followed by centrifugation for 5 min at 7000
g. This gave an approximate DNA concentration of 0.4 µg/µl.
2.5 Restriction Endonuclease Digestion
DNA samples were digested with various restriction endonucleases according
to the manufacturer's instructions (of 10 U of enzyme/µg of DNA).
See App. A for details of enzymes used.
2.6 Electrophoresis
2.6.1 Agarose Gel Electrophoresis
Plasmid DNA and PCR products were resolved by electrophoresis at 4
V/cm for one hour in 1 x TBE, 1% agarose gels with 10 ng/ml of ethidium
bromide. Genomic DNA digests were resolved by electrophoresis at 1 V/cm
overnight in 1 x TAE, 0.6% agarose gels with 30 ng/ml of ethidium bromide.
Bands were visualised using a UV transilluminator, and the resulting pattern
recorded using a GDS5000 digital camera system (Mitsubishi).
2.6.2 ABI Genescan electrophoresis
2.6.2.1 Creation of a Matrix File
The ABI Genescan system enables the electrophoresis and detection of
fluorescent products. A matrix file was made so that background fluorescence
could be reduced in the Genescan gel to enable good resolution of the fluorescent
products. Prior to using the Genescan for analysis, the matrix file was
created to ensure optimum fluorescent detection. An identical gel to those
used for Genescan analysis, a 12 cm, 5.5 % "Long ranger", 1 x TBE gel,
was set up and run for 5 ¼ hours at 800 V, on filter set B ([lambda]=
531, 545, 560, 580 nm) in the ABI electrophoresis apparatus. The dye primer
matrix standard kit P/N 401114 was used, loading two samples for each dye
colour. The samples were loaded, 5 µl per lane, alongside samples
of 0.5 µl of Genescan ROX 2500 standard which were either native,
or denatured with the addition of an equal amount of deionised formamide
and heated before loading.
Raw data was collected using the Genescan (ABI) collection and analysis
programs. The matrix file was constructed according to ABI instructions.
Once the gel had run, the tracking was checked and a new matrix created
for all four dye colours. This matrix was entitled "5.5% LR 1 x TBE Matrix"
and was used in all subsequent Genescan reactions.
2.6.2.2 RAPD-PCR Electrophoresis
All the RAPD-PCR samples were denatured with the addition of an equal
volume of deionised formamide, mixed with 0.5 µl of Genescan ROX
2500 standard, heated to 94°C for 2 min and kept on ice until loading.
The samples were loaded on a 12 cm, 5.5 % Long ranger gel which was run
in the ABI electrophoresis apparatus for 6-8 hours at 800 V on filter set
B. The Genescan collection software was set up according to ABI instructions.
2.6.3 Sequence Gel Electrophoresis
Samples were loaded onto a 6% polyacrylamide sequencing gel (ABI 373A
Sequencer), and electrophoresed in 1 x TBE for 14 hours at 2500 V. The
apparatus was set up according to ABI protocols using Filter set A ([lambda]=
531, 560, 580, 610 nm) and the data collected using the 373A collection
package.
2.7 PCR Amplification
2.7.1 Plasmid insert PCR Amplification
PCR amplification was carried out using diluted plasmid DNA (1:10)
approximately 100 ng, in a 50 µl PCR reaction mix of 10 x buffer,
100 µM dNTPs, 2 mM MgCl2, 25 pmol of each primer (oligonucleotides
12 and 16 for pCDM8 and oligonucleotides 147 and 151 for pGEM-T, see App.A).
The reaction mix was overlaid with 50 µl of mineral oil before the
addition of 2.5 U Taq DNA polymerase after the tubes had reached
the denaturation temperature. Amplification was for 25 cycles with an annealing
temperature of 55°C. The thermal profile in program 1 was followed
(App. A).
2.7.2 Colony PCR of RDA products
Colonies were picked and screened for inserts by colony PCR. Each colony
was picked directly into a 50 µl PCR reaction mix of 10 x buffer,
100 µM dNTPs, 2 mM MgCl2, 1U Taq DNA polymerase,
25 pmol of each primer (147 and 151), and amplified for 25 cycles with
an annealing temperature of 55°C. PCR products were analysed by agarose
gel electrophoresis. Each picked colony was also streaked onto L-agar plates,
containing 100 µg/ml ampicillin, in an ordered array and also inoculated
into 1 ml of L-Broth, containing 100 µg/ml ampicillin, for incubation
with agitation at 37°C overnight. The overnight cultures were spun
down at 7000 g for 1 min and resuspended in L-Broth with 20% glycerol,
followed by snap-freezing in ethanol and dry-ice, and storage at -70°C.
2.8 Gel purification of PCR products
Clones containing two plasmids were initially PCR amplified by colony
PCR described above using primers 147 and 151. A 40 µl sample of
each product was separated by electrophoresis on a 2% low melting point
agarose gel. The insert DNA was excised as a band from the gel and placed
into a 0.5 ml tube. Once the weight was obtained, 3 M NaCl was added to
a final concentration of 30 mM and the agarose melted at 68°C for 10
min. The temperature was then reduced to 39°C and agarase (Sigma) added
to a concentration of 50 U/ml. After incubation for one hour 60 µl
of water was added along with 100 µl 4 M ammonium acetate and 200
µl propan-2-ol. The tubes were centrifuged for 30 min at 7000 g,
the supernatant discarded and the pellet washed with 70% ethanol. The DNA
pellet was resuspended in 40 µl of water and reamplified as detailed
in the pGEM-T sequencing protocol.
2.9 Automated Sequencing
Clones were partially sequenced, from each end of the insert, using
the Applied Biosystems 373A DNA automated sequencing system. Dye-deoxy
terminator chemistry was used to label the products of Taq DNA polymerase
cycle sequencing. Two different ABI kits were used as the technology was
developed from the PRISMTM ready reaction dyedeoxyTM
terminator cycle sequencing kit to the PRISMTM ready reaction
dyedeoxyTM terminator cycle sequencing FS kit.
The PRISMTM ready reaction dyedeoxyTM terminator
cycle sequencing kit was used following the ABI protocol to sequence clones
in pCDM8. One microgram of double-stranded DNA template was used in each
20 µl cycle sequencing reaction. Oligonucleotides 12 and 16 (see
App. A) were used as primers for the reaction. In each sequencing reaction
1 µl of cDNA was mixed with 9.5 µl of terminator premix, 3
µl of one primer and Milli-Q water up to a total volume of 20 µl.
This was then overlaid with 50 µl mineral oil and placed into a Hybaid
Omnigene PCR machine. The thermal cycling reaction was as described in
App. A for ABI Seq. The samples were then extracted with a phenol: H2O:chloroform
mix (68:18:14) at room temperature, and precipitated with 2M sodium acetate
and ethanol, to remove excess dye.
Clones in pGEM-T were sequenced using ABI PRISMTM dyedeoxyTM
terminator cycle sequencing ready reaction FS kit (Perkin Elmer). The protocol
was similar to that used previously except that only 8 µl of reaction
mix was added. To obtain 1 µg of double stranded DNA each clone was
initially PCR amplified in a standard reaction, using primers 147 and 151
before precipitation of the product with an equal volume of 4 M ammonium
acetate and propan-2-ol at RT. This precipitated product was centrifuged
for 30 min at 7000 g, resuspended in 5 µl Milli-Q water and used
directly for sequencing. The PCR product was sequenced using 7 pmol of
one of two nested primers: oligonucleotides 156 and 155. Each 20 µl
reaction was precipitated on ice for 10 min with the addition of 2 µl
of 3 M sodium acetate (pH 4.6) and 50 µl 95% ethanol, centrifuged
at 7000 g, and air-dried. The pellet was resuspended in 4 µl of a
5:1 deionised formamide: 50 mM EDTA mixture and denatured at 90°C for
2 min. These samples were loaded and analysed as described earlier.
2.9.1 Sequence Analysis
The 373A software enables direct determination of the sequence. Sequences
were edited using SeqEd (ABI) and converted into GCG format. Previously
sequenced genes with homology to the clones screened in this project were
identified by GCG (Program manual for the Wisconsin package, version 8,
September 1994, Genetics Computer Group, 575 Science Drive, Madison, Wisconsin,
USA 53711) FASTA analysis, in the EMBL database.
2.10 Southern Blotting
Agarose gels were blotted using a method based on that of Southern
(1975). Gels were washed in 500 ml blotting solution A (1.5 M NaCl, 0.5
M NaOH) twice for 30 min and then 500 ml of blotting solution B (0.02 M
NaOH, 1 M CH3COONH4) twice for 30 min. The gel was
placed on a 3 mm filter paper wick on a platform over a reservoir of 1
l of blotting solution B. Hybond-N (Amersham) membrane was carefully laid
over the gel, followed by three layers of filter paper. A stack of paper
towels were placed over this, and held in position with a weight. Blotting
took place overnight. The filter paper and towels were then removed, the
nylon membrane marked and air-dried before baking under vacuum for two
hours at 80°C.
2.10.1 Probe preparation and hybridisation
Southern blots were soaked overnight in Milli-Q water, and placed into
Hybaid tubes. Blots of less than 30 cm2 were probed in 20 ml
plastic universals. Prehybridisation buffer (50% formamide) was added at
42°C, 0.02 ml/cm2 for blots in Hybaid tubes and 0.25 ml/cm2
for
blots in 20 ml plastic universals. Three universals were placed into each
Hybaid tube during hybridisation. The tubes were placed in a preheated
(42°C) Hybaid oven and rotated overnight. Two methods of probe preparation
were used; Nick translation (Gibco BRL) and Prime-It RmT random primer
labelling kit (Stratagene). Probes were labelled with [[alpha]-32P]
dCTP (3000 Ci/mmol) (Dupont NEN), 10 µCi/µl (0.37 MBq/µl).
2.10.2 Nick translation system (Gibco BRL)
Blots were probed either with labelled PCR products (4 µl) or
labelled whole plasmids (1 µl) containing about 1 µg DNA. The
following basic protocol was used; a mixture of 2.5 µl solution A,
19.5 µl water, 2 µl of [[alpha]-32 |