Tutorial for Windows and Macintosh
Mitochondrial DNA Typing
© 2007 Gene Codes Corporation
Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074 (fax) www.genecodes.com
[email protected]
Mitochondrial DNA Typing Open a Project Template .............................................................................................................. 3 Import Sample Data..................................................................................................................... 4 Comparing the Data..................................................................................................................... 4 Validate mtDNA Profiles ............................................................................................................... 7 Create an Analyst Report.............................................................................................................. 7 Export mtDNA Profile................................................................................................................... 8 Evaluate Multiple Samples............................................................................................................ 9
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 2 of 10
Mitochondrial DNA Typing Tutorial There are two versions of Sequencher, the standard version, which has become the favorite for sequence analysis since its release in 1991, and the forensic version. The forensic version of Sequencher has specific functionality to meet the needs of forensic scientists analyzing mitochondrial (mtDNA) sequence. It is the standard for mtDNA analysis in forensic laboratories across the country. An mtDNA profile is a list of the differences between the mtDNA sequence of a forensic sample and the sequence of a standard reference. The accepted standard sequence for comparison is the revised Cambridge Reference Sequence, rCRS. In most instances the entire mitochondrial genome is not used for comparison, but just the region around the Dloop (16024-576), the control region that does not code for proteins. This region, which crosses the origin of replication, includes 1122 bases. The region around the D-loop has a high degree of variation, particularly in the two hyper variable regions, HV1 (16024–16365) and HV2 (73–340), and so is useful in forensic science. This tutorial builds on the more generic Reference Sequence Tutorial. Please run through the Reference Sequence Tutorial first if you are not yet familiar with the applications covered there. The Mitochondrial DNA Typing Tutorial begins with forensic samples that have already been assembled and edited. You are now ready for validation and reporting of your results. This tutorial is structured to give you an overview of the tools available for mtDNA sequence analysis in a forensic environment. It is not a substitute for standard protocols. The samples in this tutorial consist of four separate Sequencher projects. They are two samples, 90 and 94, prepared by two analysts. In this tutorial we will: Confirm the reported differences in the Variance Table. Validate the reports from the two analysts. Create an export of the mtDNA profiles in a format that is compatible with the CODIS data format.
OPEN A PROJECT TEMPLATE • •
Launch Sequencher. From the File menu, choose New Project From Template > rCRS.
Sequencher opens a new project that contains the user preferences, sequences, and contigs contained in the Template Project rCRS. You can see in the Comments section of the Project window that rCRS is the revised Cambridge Reference Sequence. The sequence has already been configured to correspond to the numbering of the mitochondrial genome around the HV1 and HV2 regions.
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 3 of 10
•
Double-click on the rCRS sequence icon to open the sequence.
Note that after base position 16569, the sequence numbering starts again at base position 1 which is selected above. •
Close the rCRS sequence window to return to the Project window.
IMPORT SAMPLE DATA Now that you have your reference sequence you are ready to import your sample data. • • • •
From the File menu, choose Import > Sequencher Project… Navigate to Applications/Sequencher 4.8/Sample Data/Mitochodrial Sequence/mtDNA Profiles. Select 90.SPF and click Open. Repeat to import 082790.SPF and click Open.
The two imported contigs contain data from the same sample, but it was sequenced in two different labs. The process of editing and assembling sample data into a contig is described in detail in the Reference Sequence Tutorial.
COMPARING THE DATA Because both contigs are assembled to the same Reference Sequence, the rCRS, it is possible to compare equivalent base positions. •
Select both contigs and then select Contig > Compare Consensus to Reference.
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 4 of 10
•
In the bottom left corner of the Variance Table, click on the open elevator icon widths for the display of the full names of the contigs.
to optimize the column
Each row in the Variance Table represents a position in the Reference Sequence that has a difference in one of the sample sequences. Each column represents a sample sequence. The sample sequences differ from the Reference Sequence at positions 73 and 195. Every difference that is reported in one is confirmed in the other. However, the fact that each of the cells that contain the name of a contig is shaded in pink indicates that the data in the contigs does not cover the full length of the rCRS. •
Double-click on the G in column 082790.
This opens the Variance Table in Review mode so that you can now see all of the data that supports each difference reported in the table. You can also customize the layout of the windows, so that Sequencher will remember how you like to work. •
Rearrange the windows so that you optimize the view of the Chromatograms, the Contig Editor, and the Variance Table. The following is an example.
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 5 of 10
•
From the Window menu, select Remember Window Layout > Single Variance Table Review.
Now, whenever you execute the Variance Table Review mode, Sequencher will display the windows in the configuration that works optimally for your projects. When you select the cell to the right in the Variance Table, you display the equivalent base position in the next contig. When you move your cursor down in the Variance Table, you update the display to the next difference in the same contig. • •
Move your cursor within the four cells in the Variance Table to navigate to all of the differences in the contig. Close the Variance Table and the Chromatogram and Contig Editor windows to return to the Project window.
Because this Variance Table, in contrast to the more simple Variance Tables created from the Sequence menu Compare To commands, is based on a comparison to a consensus sequence, the results of the table will vary depending on the method for calculating the consensus. • • •
Change the method of calculating the consensus from Plurality by selecting Contig > Consensus Inclusively. With the two contigs selected, again execute the Compare Consensus to Reference command from the Contig menu. Expand the name columns by clicking on the open elevator button.
When the consensus is computed inclusively, the rare ambiguous base also makes a contribution to the consensus. Thus 082790 now has two ambiguities, one at position 16,163 and the other at 183. •
Double-click on the “N” cell at reference position 16,163.
Sequencher opens the Contig and Chromatogram Editor windows in the configuration saved from your earlier work. The chromatogram data shows some “G” noise, but there is “A” support in each of the five chromatograms. •
In the Variance Table, type “A” over the ”N” at position 16,163.
Sequencher will make the appropriate edits in the Contig Editor, the Chromatogram Editor, and the Variance Table; however, although there are no longer any differences to the Reference Sequence at base position 16,163, this row will persist until you click on the Refresh button. • •
Click on the “N” in the Variance Table cell at position 183. With your cursor still in the Variance Table, type an “A” to support the chromatogram data.
Now all of the data between the two analysts support each other, so you are now ready to create a report. •
Close the Variance Table and the Chromatogram and Contig Editor windows to return to the Project Window.
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 6 of 10
VALIDATE MTDNA PROFILES In order to submit data to some databases, such as the CODIS database, you must validate your results. Sequencher has several reports and views to facilitate the validation of the mtDNA profiles. •
Select the two contigs, 90 and 082790, and then select Contig > Validate mtDNA Profiles.
The top of the Validate mtDNA Profiles view shows you that you have complete agreement between the work done by the two analysts. The rest of the document gives you additional information concerning the two reports, including the range of bases for which you have data on both the HV1 and HV2 side of the origin, and alerts (in red text) that let you know if your data fails to meet the requirements for HV1 and HV2 coverage.
The warning of incomplete coverage for HV1 and HV2 alerts you when your data does not cover all bases between 16024 – 16365 or between 73 – 340. Because the experimental data for both HV1, 16024 to 16235, and HV2, 33 to 255, do not cover the full range of these regions, you are given the “Incomplete Coverage for HV1, HV2” warning. Once you are satisfied with the validation of your data, you may create a report and export the profile data in CMF format. While you cannot generate reports, save, or export in the demo version of Sequencher, you may review an example of the report below.
CREATE AN ANALYST REPORT A text version of the mtDNA profile validation may be saved or printed for your records. •
Click on the Create Report button at the top of the window.
Sequencher presents you with the following report to save or print:
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 7 of 10
•
Close the Analyst Report window.
EXPORT MTDNA PROFILE Once you have validated your results, you are ready to export the profile in the CMF format that is compatible with the CODIS database. •
Click on the Export CMF button at the top of the Validate mtDNA Profiles window.
You are presented with a dialog that allows you to enter the additional data required for the CMF export. The dialog is divided into two portions. The top half contains required information, the bottom half optional. If you float your cursor over any of the fields for entry, Sequencher will provide hints for entering the data to specification. Below is the tip that opens when you hover your cursor over the Specimen ID.
•
Complete the data for each of the required fields.
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 8 of 10
• • •
Click on the Export button. Select a location to Save to or Cancel. (Note that the file created on export is in xml format.) Close the Validate mtDNA Profiles window.
EVALUATE MULTIPLE SAMPLES It is also possible to evaluate multiple samples in the same project. • • • • • • •
From the File menu, choose Import > Sequencher Project… Navigate to Applications/Sequencher 4.8/Sample Data/Mitochodrial Sequences/mtDNA Profiles. Select 94.SPF and click Open. Repeat to import 120294.SPF and click Open. Navigate to Select > Select All. Select Contig > Consensus By Plurality. Select Contig > Compare Consensus to Reference.
The resulting Variance Table displays the results from both sets of data. The pink X’s reflect positions where there is a difference in one sample that is not supported by data in the sample with the pink X. For instance, at position 73 there is an A to G difference in samples 90 and 082790, but we don’t know if this difference also occurs in samples 120294 and 94, because there is no data covering this region. •
Click on the Total button in the lower left-hand corner of the Variance Table.
The samples are now sorted by their total number of differences. This has the effect of grouping the sequences with similar differences. • •
Double-click on any of the cells to enter the Review mode. Navigate across the Variance Table to view the data at positions 195 and 200.
A Variance Table with more than one sample source gives you an opportunity to compare a variety of chromatogram data that yield different base calls. This can often be valuable information from which you can draw stronger conclusions. The Variance Table allows for the comparison of hundreds of contigs. In contrast, the Validate mtDNA Profiles window specifically functions to compare two equal mtDNA profiles. If you select two different mtDNA profiles, the Validate mtDNA Profiles window will alert you that the two sets of data do not support each other. • •
Close the Variance Table and the Chromatogram and Contig Editor windows to return to the Project window. Select contigs 90 and 94.
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 9 of 10
•
From the Contig menu, select Validate mtDNA Profiles.
You will get the following display:
The yellow background in the reference column tells you that the data in one mtDNA profile is unsupported by the other. The yellow background in the mtDNA profile columns brings your attention to conflicts between the data. •
Close the project using File > Close Project.
Gene Codes Corporation ©2007
Mitochondrial DNA Typing
p. 10 of 10