Background

Pigeon lactation was first noted in the literature in 1786 when John Hunter described pigeon milk as being like “..granulated white curd” [1]. This curd-like substance is produced in the crop of male and female pigeons and regurgitated to the young. Like the mammary gland, the pigeon crop undergoes significant changes to the tissue structure during lactation. Several histological studies have characterised these changes and determined that pigeon milk consists of desquamated, sloughed crop epithelial cells [2, 3]. The process of pigeon milk production begins when the germinal cell layer of the crop rapidly proliferates in response to prolactin [4, 5], and this results in a convoluted, highly folded epithelial structure that then coalesces as it out-grows the vasculature, to form the nutritive cell layer that is sloughed off to produce the milk. This nutritive cell layer contains lipid-filled vacuoles [2, 3, 5, 6]. The lipid content of pigeon milk consists mainly of triglycerides, along with phospholipids, cholesterol, free fatty acids, cholesterol esters and diglycerides [7]. The triglyceride content decreases across the lactation period, from 81.2% of total lipid at day one, to 62.7% at day 19, whereas the other lipids increase, which suggests the cellular lipid content decreases towards the end of the lactation period, but the cell membrane-associated lipids remain constant [7].

Several studies have investigated the differences in gene expression between ‘lactating’ pigeon crop tissue and non-‘lactating’ crop tissue [6, 8, 9]. Nearly three decades ago, Horseman and Pukac were the first to identify that mRNA species differ in response to prolactin injection in the crop [8]. Specifically, they identified and characterised gene expression and protein translation of the prolactin-responsive mRNA anxI cp35 and the non-prolactin-responsive isoform, anxI cp37 [9, 10]. In addition, a recent global gene expression study in our laboratory [6] showed that genes encoding products involved in triglyceride synthesis and tissue signalling were up-regulated in the ‘lactating’ crop. We proposed that the evolution of the processes that result in the production of pigeon milk has built upon the more general ability of avian keratinocytes to accumulate intracellular neutral lipids during the cornification of the epidermis [11] in order to produce a nutritive substance for their young [6].

The mechanism of avian epidermal cornification and lipid accumulation is not well-characterised. However, studies have shown that antibodies against mammalian cornification proteins, which are relatively well-characterised, can cross-react with avian and reptilian species [12, 13], which suggests similarities in cornification proteins amongst vertebrate species. Cultured chicken keratinocytes have been shown to express beta-keratins (feather, scale and claw keratins), alpha-keratins (type I and II cytokeratins) and the cornified envelope precursor genes envoplakin and periplakin, as well as accumulating neutral lipids [11]. Mammalian keratinocytes differ from avian keratinocytes in that they are unable to accumulate intracellular neutral lipids [11], and can express alpha-keratins (cytokeratins) but not beta-keratins, which expanded from early archosaurians [14]. There are many cornification-associated proteins characterised from mammalian epidermal tissues. The proteins that form the cornified envelope include keratins, S100 proteins, small proline-rich proteins (SPRRs), late cornified envelope (LCE) proteins, annexins, involucrin, loricrin, filaggrin, desmoplakin, envoplakin, periplakin, trichohyalin, cystatin A, elafin and repetin [15]. Trans-glutaminase enzymes, some of which require cleavage by proteases and an increase in intracellular calcium concentration to become active, cross-link the cornified envelope proteins to form a ceramide lipid-coated protective barrier to the epidermis [16]. Many of the cornified envelope genes are present in the “epidermal differentiation complex” (EDC) which was first identified on chromosome 1q21 in humans [17]. Interestingly, the EDC region has been identified in an avian species (chicken), and is linked to the genes for beta-keratins, but lacks the LCE proteins [18].

Here we present an analysis of the pigeon crop transcriptome to show that pigeon milk production involves a specialised cornification process and de novo synthesis of lipids that accumulate intracellularly.

Results

Differentiation of the ‘lactating’ crop

Immunohistological analysis of the proliferating cells of the pigeon crop in its resting state and during nesting demonstrated the morphological changes that occur in preparation for pigeon milk production (Figure 1). As the crop changed in preparation for lactation, the number and depth of rete pegs increased and the lamina propria became progressively more extended and narrow, which increased the surface area of the crop. During lactation the crop was highly proliferative, which resulted in the accumulation and sloughing of large tracts of cornified epithelium (Figure 2). All lactating parents in this study (48 birds) fed their young every four hours over the lactation period. Histology revealed a cycle of production and turnover of cornified epithelium over the four-hour period (Figure 2). The squabs milk intake reduced gradually toward the end of the lactation period, which lasted approximately fourteen days.

Figure 1
figure 1

The pigeon crop differentiates during nesting. The non-‘lactating’ cropsac (A) differentiates during nesting (B&C), such that the lamina propria (*) becomes progressively more extended and narrow, and the number and depth of rete pegs (^) increases as the cropsac further differentiates. Proliferating cells are stained red with an antibody to proliferating cell nuclear antigen, and non-proliferating cells are counterstained blue with hematoxylin. Scale bar = 100 μm.

Figure 2
figure 2

Pigeon milk production is a four hour cycle. Pigeon milk is produced by proliferation of the crop germinal epithelium (proliferating cells of the germinal epithelium are stained red with an antibody to proliferating cell nuclear antigen) which results in the accumulation of the differentiated lipid and protein containing cells (counterstained blue with hematoxylin) every 4 hours during lactation. (A) The crop ready to regenerate pigeon milk after it has been regurgitated to the young. (B) Pigeon milk (blue) has been produced and is nearly ready to separate from the germinal epithelium. (C) The pigeon ‘milk has started to separate from the germinal epithelium, ready for regurgitation to the young. Scale bar = 100 μm.

Analysis of transcriptional changes over the lactation period compared to non-‘lactating’ crop revealed no differentially expressed probes at pre-hatch (time points −8 and −2), large differences at hatch (time point 0) and 2 days post-hatch (time point +2) (17.2 and 48.8% of all probes differentially expressed, respectively), and no difference above what could be expected by chance (5%) at 10 days post-hatch (time point +10) (2.7% of probes differentially expressed) (Additional file 1: Table S1). Any effect of sex was ruled out by comparing males to females at non-‘lactating’ and ‘lactating’ time points. There was no difference above what could be expected by chance (Additional file 1: Table S2).

Cornification genes are differentially expressed in the ‘lactating’ pigeon crop

Analysis of cornification-associated genes in the draft pigeon genome identified an epidermal differentiation complex (EDC) on scaffolds 1246 and 683, respectively (Figure 3). Transcriptional analysis of these EDC genes and other cornification-associated genes in the pigeon crop at time points 0 and +2 revealed differential expression of 43 genes in 0, +2 or both ‘lactating’ pigeon crops compared with non-‘lactating’ crop (Table 1). Thirteen of these genes were up-regulated and 30 were down-regulated. Notably, the majority of cornification-associated genes up-regulated in the ‘lactating’ crop were keratins, constituting eight of the thirteen up-regulated genes. Five of these eight keratins were beta-keratins and three were alpha-keratins. Conversely, eight of the 30 down-regulated cornification-associated genes were alpha-keratins, and none were beta-keratins. Phylogenetic analysis of the beta-keratins (Figure 4), which were all part of the pigeon EDC (Figure 3) separates them into several groups. Feather, claw and scale keratins share a common ancestor related to chicken beta-keratin, from which feather keratins (none differentially expressed in ‘lactating’ crop) formed their own clade, and claw (ns and 10.5 to 12-fold up-regulated) and scale keratins (ns and 3.2-fold up-regulated) formed another monophyletic clade. Putative pigeon keratins formed three more clades not containing a chicken homolog, and ORF 683_38 formed a clade of its own. GenBank IDs of keratins with the highest amino acid identity to the pigeon keratins are found in Additional file 2.

Figure 3
figure 3

The pigeon epidermal differentiation complex. The pigeon EDC is located on (A) scaffolds 1246 and (B) 683 of the draft pigeon genome. It is bound by nicastrin and cathepsin S, and contains putative genes for the cornified envelope precursors repetin, cornulin, involucrin and filaggrin. In addition, the S100 genes S100A11, S100A4 and two copies of S100A9 are present. Two putative keratin-associated proteins (KAPs) are present, and clusters of beta keratins, feather keratins, scale keratins and claw keratins.

Figure 4
figure 4

Phylogenetic analysis of putative pigeon keratin proteins. Neighbour-joining minimum evolution trees of putative pigeon (A) alpha and (B) beta keratins showing the relationship to chicken keratins are presented. Putative pigeon keratins are given the prefix p followed by the ORF number, and chicken keratins are given the prefix c. For a list of GenBank IDs corresponding to the chicken keratins refer to Additional file 2. (A) Putative pigeon alpha keratins form two distinct groups. The first consists of type II cytokeratins (brown) and the second consists of type I cytokeratins (blue). (B) Putative pigeon beta keratins form several distinct groups including groups of feather (green), claw (purple) and scale (orange) keratins. Additionally there are several smaller groups that contain no chicken homolog.

Table 1 Differential expression of cornification-associated genes in the ‘lactating’ crop

Phylogenetic analysis of the alpha-keratins separates them into type I and type II (Figure 4). Seven type I keratins and two type II keratins were down-regulated, and two type II keratins were up-regulated in ‘lactating’ crop (Table 1). Notably, all of the type I putative pigeon keratins were constrained to scaffold988, whereas the type I keratins included 15 putative genes on scaffold748 and two on scaffold988 (Figure 4). All of the chicken alpha-keratins had a closely related putative pigeon homolog.

Aside from keratins, there were several other differentially expressed cornification-associated genes in the pigeon crop. Up-regulated cornification-associated genes in the ‘lactating’ crop at 0 and +2 timepoints included transglutaminase 5 (3.1 and 2.7-fold), S100-A10 (1.3 and 1.6-fold) and the pigeon lactation-specific transcript, cp35 (5.7 and 23-fold), and its isoform cp37 (ns and 2.7-fold). Probes on exons 3, 4, 6, 12, 13, 14 and 15 of transglutaminase 4 were up-regulated (1.2 to 1.7 and 1.3 to 2.4-fold), whereas probes on exons 7, 10 and 11 were down-regulated (−1.5 and −1.4 to −1.6-fold), which suggests it is likely to be alternatively spliced in the pigeon crop. The transglutaminase genes transglutaminase 6-like and tissue transglutaminase were also down-regulated by 2-fold and 2.1 to 4.2-fold, respectively. Several of the S100 genes were down-regulated in the ‘lactating’ crop by 1.2 to 10.2-fold, including S100-A14-like, S100-A16-like, S100-A9-like, S100-A4 and S100P (Table 1). The annexin genes annexin A2, A5, A6, A7, A8, A11 and A13 were down-regulated by 1.1 to 21-fold (Table 1). The envelope precursor protein-encoding genes desmoplakin, periplakin, envoplakin, epiplakin and sciellin and cystatin A were all down-regulated in the ‘lactating’ crop by 1.3 to 2.6-fold (Table 1).

The proteases calpain-15 (ns and 6.6-fold) and calpain 9 isoform 1 (ns and 1.8-fold) were up-regulated in ‘lactating’ crop, whilst calpain-5 was down-regulated by 2.4 to 4.4-fold. Cathepsins H, C, B-like, S, Z, L and D were all down-regulated by between 1.2 and 1.8-fold.

The epithelial cell-derived antimicrobial peptide encoding gene beta defensin 5 was up-regulated by 2.3 to 4.8-fold in ‘lactating’ crop at timepoints 0 and +2, respectively (Additional file 3).

Differential expression of cornification-associated genes in the cornified epithelial cells of the pigeon crop

Analysis of differentially expressed cornification-associated genes revealed that six genes were up-regulated and none were down-regulated in the cornified cell layer compared to the underlying cell layers (Table 2). The most highly up-regulated gene was cornulin (1004-fold up-regulated), followed by cp35 (20-fold up-regulated). ly, transglutaminase 6-like (15-fold), annexin A8 (4-fold), S100-A9-like (3.5-fold), and S100-A16-like (2-fold) were up-regulated in cornified cells.

Table 2 Differential expression of cornification-associated genes in the cornified cell layer of the ‘lactating’ crop

Triglyceride synthesis is up-regulated in the ‘lactating’ pigeon crop

Examination of lipid droplets in ‘lactating’ pigeon crop showed that lipid was present throughout the differentiated epithelium, and was perinuclear (Figure 5). To investigate whether lipids could potentially be synthesised de novo, the expression of genes linked to milk lipid synthesis in the mouse mammary gland [19] were examined in the ‘lactating’ pigeon crop. Thirty-four mouse mammary gland-linked lipid synthesis genes were differentially expressed in the pigeon crop, including 7 variants of genes investigated in the mouse study. Expression patterns of milk lipid synthesis genes were similar in pigeon crop and mouse mammary gland, although pigeon crop expressed different variants of many genes of the triglyceride synthesis and fatty acid synthesis pathways in comparison to the mouse mammary gland. In particular, the triglyceride synthesis genes Agpat1 and Dgat1 were up-regulated in the lactating mouse mammary gland compared to pregnant mouse mammary gland [19], whereas Agpat3, Agpat9 and Dgat2 were up-regulated in the ‘lactating’ pigeon crop compared to non-‘lactating’ crop. The fatty acid synthesis gene Elovl1 was up-regulated in lactating mouse [19], whereas Elovl6 was up-regulated in ‘lactating’ pigeon crop. The lactating mouse mammary gland showed up-regulation of 5 different Fabp gene variants, whereas the ‘lactating’ pigeon crop up-regulated only Fabp5. Both lactating mouse and pigeon crop showed up-regulation of the same fatty acid transporter gene, Slc27a4, the fatty acid translocase, Cd36, and down-regulation of fatty acid transporter Slc27a1.

Figure 5
figure 5

Lipids form multiple intracellular droplets in the ‘lactating’ crop. A fluorescent Oil Red O stain visualised by confocal microscopy reveals the location of neutral lipid in the ‘lactating’ pigeon crop. (A) Neutral lipid (red) is present throughout the differentiated epithelium of the ‘lactating’ pigeon crop. Scale bar = 60 μm. (B) Lipid droplets are perinuclear (nucleus stained blue, DAPI). Scale bar = 10 μm.

Discussion

This is the first genome-wide pigeon crop transcriptome study to investigate the molecular mechanism of pigeon milk production. Here we show that differential expression of cornification-associated proteins and de novo lipid synthesis genes in the pigeon crop during lactation contribute to a highly specialised process that leads to the production of pigeon milk.

In preparation for lactation, the pigeon crop increases in surface area through an increase in rete pegs and extension of the lamina propria (Figure 1). This hyperplasia followed by desquamation results in large numbers of lipid-rich differentiated cells accumulating in the crop lumen, in the form of a curd-like substance, which provides nourishment for the young (Figure 2). Although the process of terminal differentiation, from the basal layer through to the desquamated layer takes days in mammals [20], it appears that the epidermal cells of the pigeon crop undergo a terminal differentiation program within the space of four hours. We have previously described this histologically [6]. The 1004-fold up-regulation of cornulin and 15-fold up-regulation of transglutaminase 6 (Table 2), both late epidermal differentiation markers [21], in the cornified cell layer of the ‘lactating’ crop demonstrates the presence of terminally differentiated cells in the lactating pigeon crop epithelium.

Up-regulation of several beta keratins and three alpha keratins in the ‘lactating’ crop (Table 1) suggests an important function for keratin in the formation of pigeon milk. Beta-keratins are specific to archosaurians [22], and are found in the pigeon EDC (Figure 3), whereas alpha-keratins are ubiquitously expressed in eukaryotes. Phylogenetic analysis of the putative pigeon beta-keratins places the majority of up-regulated beta keratins in claw and scale beta keratin groups (Table 1, Figure 4). Beta-keratins have been suggested to have evolved from alpha-keratins to form a new class of matrix proteins that have a structural role in cornification [22]. Hence, it appears that in addition to alpha-keratins, beta-keratins have an important structural role in ‘lactating’ pigeon crop cells. Unlike alpha-keratins, beta-keratins form their own filament-matrix structures [23] which negates the need to express matrix proteins to form cornified beta-keratin epidermis. The down-regulation in ‘lactating’ crop of the typical mammalian cornified envelope precursors desmoplakin, envoplakin, periplakin, sciellin and cystatin A (Table 1) suggests that beta-keratins could play an alternative role to these matrix proteins in the ‘lactating’ crop.

Alpha-keratins are cross-linked to matrix proteins by transglutaminase enzymes, which are activated by proteolytic cleavage and increased intracellular calcium concentration [15]. S100 proteins play a role in the establishment of the calcium gradient in epithelial cell layers [24] and can also be substrates for transglutaminase themselves [25]. Both transglutaminase and S100 protein-encoding genes are up-regulated in ‘lactating’ crop (Table 1). Interestingly, prostate transglutaminase (transglutaminase 4) is differentially expressed in ‘lactating’ crop. Putative exons 7, 10 and 11 were down-regulated, while seven other putative exons were up-regulated, (Table 1) which suggests there could be multiple splice variants; this is the case for human transglutaminase 4 in cancer tissues [26]. In addition, transglutaminase 5, which is expressed in mammalian cornifying epithelium [27] is up-regulated (Table 1). This is in contrast to mammalian cornifying tissues that express transglutaminases 1 and 3 in addition to transglutatminase 5[28]. The up-regulation of the proteases calpain-15 and calpain 9 isoform 1 in the ‘lactating’ crop (Table 3) could suggest a role for these enzymes in the proteolytic activation of transglutaminases 4 and 5, as calpains are thought to activate transglutaminase 1 [15]. Additionally, cathepsin D has been suggested as an activator of transglutaminase, but this does not appear to be the case in the pigeon crop, as cathepsin D and six other cathepsin genes are down-regulated (Table 3).

Table 3 Differential expression of protease genes in the ‘lactating’ crop

In addition to being a substrate for transglutaminase, S100 proteins can interact with annexins and form part of the cornified envelope [29]. S100-A10 is up-regulated in ‘lactating’ crop, as are the pigeon lactation-specific annexin gene cp35 and its isoform cp37 (Table 1), which could indicate roles for these genes in the formation of the cornified envelope. Cp35 is expressed 20-fold higher in cornified cells of the ‘lactating’ crop, which suggests it has a function in the cornified cells, along with the S100 protein-encoding genes S100-A16-like and S100-A9-like (Table 2).

Accumulation of neutral lipids in keratinocytes is a unique trait of avian species [11]. The pigeon makes use of this ability in the crop to produce a lipid-rich milk [30] for the young. It was suggested by Garrison and Scow [31] that the lipids in pigeon milk were sequestered from another organ due to the increase in lipoprotein lipase activity in the prolactin-stimulated crop. We have also shown previously that there is up-regulation of genes involved in the oxidation of imported triglycerides [6], and in this study we found that lipoprotein lipase was up-regulated in ‘lactating’ crop (Table 4). However, the current study showed that lipid synthesis in the ‘lactating’ pigeon crop is a combination of importation and de novo synthesis of lipids, and results in the perinuclear accumulation of neutral lipid droplets (Figure 5).

Table 4 Differential expression of mammalian milk triglyceride synthesis genes in the ‘lactating’ crop

Table 4 shows that genes involved in triglyceride synthesis in the mouse mammary gland during lactation [19] are also differentially expressed in the ‘lactating’ crop. The majority of genes involved in de novo lipid synthesis in the mouse are also expressed in the pigeon, but there are three gene variants that are expressed in the pigeon and not in the mouse. The pigeon expresses Agpat3, Agpat9 and Dgat 2 (Table 4), whereas the mouse expresses Agpat1 and Dgat1[19], which suggests that both the mechanism of lipid synthesis and crop cornification in the pigeon varies from that of mammals. The differences in the specific combinations of genes expressed may be reflected in the differences in triglycerides produced by each species. Amongst mammalian species there are differences in the fatty acid composition of milk triglycerides [32]. However, a comparison of the major fatty acid components of pigeon milk; oleic acid, linoleic acid and palmitic acid [30], reveals these are also the major fatty acid components of mammalian milk fat. There is a difference in the expression of ELOVL genes involved in fatty acid synthesis in the mouse mammary gland and in the pigeon crop (Table 4). In mouse and human, the ELOVL1 gene is up-regulated during lactation [19, 33], whereas the pigeon crop up-regulates ELOVL6 during lactation (Table 4). It has been shown that de novo synthesis of fatty acids in the mammary gland can change in response to dietary availability [34]. Therefore, the difference in ELOVL gene expression between mammals and pigeons could be due to differences in the dietary availability of triglycerides/fatty acids in the pigeon diet. ELOVL6, up-regulated in ‘lactating’ crop, has been shown to have high elongation activity on C16:0 long chain fatty acids, and also some activity on C18:1 and C18:2 long chain fatty acids [35], which are the major fatty acid components of pigeon milk. This suggests that a large proportion of pigeon milk fatty acids could be synthesised de novo in the crop. One of the major differences between pigeon milk fatty acids and mammalian milk fatty acids is the lack of very long chain fatty acids, which are synthesised de novo by ELOVL1 [35].

Here we have shown that pigeon milk is the result of a specialised cornification process that produced large numbers of lipid-laden, cornified cells with a very rapid four hour cycle of hyperplasia followed by desquamation in the ‘lactating’ pigeon crop.

Conclusions

This study has expanded our knowledge of pigeon milk production, in particular, the mechanism of cornification and lipid production in the crop. Pigeon lactation is a highly specialised process that utilises the normal keratinocyte cellular processes to produce a targeted nutrient solution for the young at a very high turnover rate.

Methods

Pigeon tissue sample collection

Thirty-two breeding pairs of King pigeons were purchased from Kooyong Squab Producers (Moama, New South Wales). They were housed in temperature-controlled cabinets (between 21–24°C) with a 12 hour light cycle (lights on 6 am), and supplied with nest bowls and nesting materials. Pigeons had ad libitum access to pigeon mix (pro-vit-min, Ivorsons, Geelong) and water. Control non-‘lactating’ pairs (ctrl, 13 birds) were culled prior to mating. Breeding pairs were culled at different lactation time points whereby squab hatch was designated as time zero. Time points pre-hatch have the prefix ‘-‘ and post-hatch have the prefix ‘+’. Specifically, breeding pairs were euthanised at 8 days pre-hatch (−8, n = 10 birds), 2 days pre-hatch (−2, n = 10 birds), at hatch (0, n = 14 birds), 2 days post-hatch (+2, n = 10 birds), and 10 days post-hatch (+10, n = 4 birds). Whole crop tissue samples were snap frozen in liquid nitrogen and separate samples of all crops were fixed in 10% neutral buffered formalin or snap frozen in optimal cutting temperature (OCT) compound for histology. Samples of pigeon crop from a time 0 pair were fixed in PaxGene (Qiagen) fixative according to the manufacturer’s instructions for laser dissection microscopy, to investigate gene expression differences between basal and proliferating cell types. Samples of other whole tissues (brain, pituitary, thymus, esophagus, trachea, proventriculus, gizzard, heart, kidney, duodenum, ileum, jejunum, pancreas, spleen, cecum, bone marrow, muscle and skin) were snap frozen in liquid nitrogen and used for the construction of a pooled tissues cDNA library. The blood and spleen of the ten day old squab were removed; the blood into vacutainers coated with EDTA dipotassium salt and the spleen into sterile media (DMEM with 10% FCS, 100 U/mL penicillin, 100 μg/mL streptomycin, 500 μg/mL fungizone).

All work using animals was conducted in accordance with the Australian Code of Practice for the Care and Use of Animals for Scientific Purposes (7th edition), and in accordance with institutional animal ethics guidelines (CSIRO AAHL Animal Ethics Committee).

Pigeon splenocyte stimulation

The squab spleen was minced through a 70 μm filter using a syringe plunger into 15 mL phosphate buffered saline (PBS). The blood was diluted in PBS, and the cell suspensions were layered slowly over the same volume of Lymphoprep (Axis-Shield, Oslo, Norway). After centrifugation the cells were removed from the interface of the gradient and washed twice with 50 mL PBS + 10% FCS. The cells were seeded on a 24-well plate (Nunc) at 5 × 105 cells/mL. To each well 10 μg/mL concanavalin A (Astral Scientific) was added and the cells were incubated at 37°C in 5% CO2. After 24 hours the cells were pelleted and re-suspended in 1 mL TRIreagant RT (Molecular Research Center) for RNA extraction and synthesis of the immune library.

cDNA library synthesis

Double-stranded cDNA (dscDNA) libraries were syn-thesised from poly(A)+ RNA extracted from the female crop tissue samples and the pooled tissues and immune cells. Total RNA was extracted with TRIreagant RT (Molecular Research Center) according to the manufacturer’s instructions, with an additional high salt precipitation buffer step to remove glycoproteins. Poly(A)+ RNA was twice purified from total RNA using Dynal beads (Life Technologies), according to the manufacturer’s instructions. 200 ng poly(A)+ RNA was fragmented at 70°C for 30 s in a 20 μl reaction containing 18 μl fragmentation solution (100 mM Tris–HCl pH 7.0, 100 mM ZnCl2). The reaction was stopped on ice with 2 μl of 0.5 M EDTA pH 8.0 and 28 μl of 10 mM Tris–HCl pH 7.5 and cleaned up with an RNeasy column (Qiagen). The sequencing libraries were synthesised from the fragmented poly(A)+ RNA according to the Roche cDNA Rapid Library Preparation Method Manual, Rev. Jan 2010 beginning section 3.2. Briefly, fragmented RNA was reverse transcribed into cDNA and made double-stranded using a mixture of DNA polymerase I, ligase and RNase H and blunt ended with T4 DNA Polymerase (Roche cDNA synthesis system).

High throughput sequencing, assembly, microarray design and annotation

Sequencing libraries were prepared from the dscDNA libraries using a Rapid Library Preparation Kit (Roche). Sequencing beads were generated with a SV-emPCR Kit (Roche) and sequenced on a 454 GS FLX using the titanium chemistry (Roche). Each sample was sequenced in a separate region. The raw reads from all regions were combined (430654) and assembled with Newbler v2.3 shotgun assembler (Roche). The resulting contigs (10463) and remaining Singletons (71997) were used to design unique microarray probes with OligoArray 2.1 [36]. The microarray probes were annotated via the source contigs or read sequences by a series of BLAST searches using an E-value of 10-3 as cut-off for all searches. The first search used BLASTX [37] with all sequences against a local copy of the non-redundant protein database (dated 11 April 2012). All non-matched sequences were then used in a BLASTN query [38] against the non-redundant nucleotide database (dated 13 April 2012). Finally, the remaining unmatched sequences were used as queries in a TBLASTX search [37] against the nucleotide database.

Identification of putative pigeon cornification-associated full-length genes

Cornification-associated genes were identified by literature search, and Raw 454 reads or assembled contigs were used as local megaBLAST [38] queries against the Columba livia draft genome [39] to identify in which scaffold each gene of interest was present. The scaffolds of interest were then submitted to a Hidden Markov Model gene prediction program (FGENESH, Softberry; http://linux1.softberry.com/berry.phtml) using parameters for chicken (aves) to identify predicted full-length gene sequences. Where scaffolds were too large to be processed by FGENESH, the region of the scaffold with the BLAST match was submitted. Microarray probes were mapped to the predicted gene sequence by local BLAST. Non-redundant, non-overlapping microarray probes matching predicted gene coding sequences were identified by megaBLAST against the predicted gene sequences and the Columba livia draft genome sequence.

Phylogenetic analysis of pigeon alpha and beta keratins

Phylogenetic trees were constructed separately for alpha- and beta-keratins. The evolutionary relatedness was inferred using the Minimum Evolution method [40]. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) was calculated [41]. The tree was drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances were computed using the JTT matrix-based method [42] and are in the units of the number of amino acid substitutions per site. The ME tree was searched using the Close-Neighbor-Interchange (CNI) algorithm [43] at a search level of 1. The Neighbor-joining algorithm [44] was used to generate the initial tree. All positions containing alignment gaps and missing data were eliminated in pairwise sequence comparisons (Pairwise deletion option). Phylogenetic analyses were conducted in MEGA4 [45] after alignment in ClustalX [46].

Laser dissection microscopy and RNA amplification

PaxGene fixed time 0 pigeon crop of a female and male breeding pair were dehydrated through fresh ethanol and xylene using an automated processor (Leica), and embedded in paraffin according to the PaxGene manufacturer’s instructions. Sections of 4 μm were cut by microtome and floated on to laser dissection slides (Leica #11505158 membrane slides PEN-membrane 2 um). The cornified crop epithelial cells and the basal cells of 5 serial sections of each crop were laser dissected using a Leica LMD6000 machine and collected by gravity into 500 μl PCR tubes. The dissected cells were dissolved in QIAzol by pipetting up and down, and RNA was extracted using the RNeasy Lipid Tissue kit according to the manufacturer’s instructions, and eluted in 30 μl water. RNA was quantified using a Bioanalyzer RNA Pico chip, and an equal amount of RNA of each of the four samples was used for two rounds of RNA amplification using an Ambion MessageAmp II aRNA Amplification Kit, according to the manufacturer’s instructions.

Microarray hybridisation, scanning and data pre-processing

RNA was extracted from whole frozen pigeon crop tissue according to the manufacturer’s instructions (Qiagen RNeasy Lipid Tissue kit). RNA quality and quantity was measured using a Bioanalyser RNA Pico chip and 5 μg of this RNA was used to synthesise first-strand cDNA with oligodt primer according to the manufacturer’s instructions (Invitrogen SuperScript SuperMix) which was then purified using a PCR purification kit (Qiagen). cDNA was synthesised and purified from whole crop RNA and from amplified laser dissected sample RNA. All cDNA samples were labelled with Cy3 using a Roche One-Color DNA Labelling Kit according to the manufacturer’s instructions. The labelled microarray probes were re-suspended with a sample tracking control and hybridisation buffer and loaded on 12-plex 135 k custom pigeon microarrays (ArrayExpress ID A-MEXP-2257). These were hybridised for 20 hours in a NimbleGen Hybridisation Station (Roche) at 42°C and then washed using the NimbleGen wash buffer kit (Roche) according to the manufacturer’s instructions. Each subarray was scanned at 2 μm on autogain with a NimbleGen MS200 microarray scanner (Roche). Sample tracking controls and control spots were used to autoalign a grid over each subarray using NimbleGen MS200 Software (Roche).

Microarray normalisation and statistical analysis

Robust Multichip Average (RMA) analysis [47] was used to background correct and normalise spot signal intensity. To compare datasets hybridised to different slides, the data were subjected to the non-parametric CombatR algorithm to remove batch effects [48]. The datasets were exported into GeneSpring (Agilent) and differentially expressed genes were identified using an unpaired Welch t-test assuming unequal variances with a Benjamini and Hochberg post-hoc test, with a false discovery rate of p = 0.05. The comparison of cell layers from laser dissected RNA omitted the post-hoc test as there were only two samples per group. All microarray data has been deposited into ArrayExpress (accession numbers E-MTAB-1317 for whole crop and E-MTAB-1318 for laser dissected cell layers).

Proliferating cell nuclear antigen immunohistochemistry

Formalin fixed pigeon crop was dehydrated through ethanol and xylene and embedded in paraffin. Sections of 4 μm were dewaxed in xylene and rehydrated through ethanol. Antigen retrieval was performed using the Dako PT Link (97°C for 30 min) while immersed in Target Retrieval Solution High pH (Dako). Following retrieval the sections were quenched with hydrogen peroxide. Sections were then incubated for 1 h with primary antibody (PCNA, 1:8000; Abcam ab29). This was followed by a Mouse linker (EnVision Flex+, Dako) for 15 minutes to enhance the staining. Horseradish peroxidase conjugated secondary antibody (Envision Flex+, Dako) was then applied for 20 minutes. Sections were stained with 3-amino- 9-ethylcarbazole (AEC) substrate chromogen (Dako) for 10 min, and counterstained with Lillie-Mayer’s haematoxylin.

Oil Red O staining and confocal microscopy

70 μm sections of formalin-fixed ‘lactating’ crop tissue were sectioned by vibrating microtome (Leica VT1200S). Sections were stained with Oil Red O according to the method of Lillie and Ashburn [49] and nuclei were labelled with DAPI for 15 min. Following a water wash, sections were mounted with Vectashield (Abacus, Australia). Samples were imaged sequentially for each dye with a Leica SP5 confocal microscope (Leica Microsystems, Sydney).