Physical Traits

Physical Traits
Prev	Chapter 4. Baboon Data: Analyzed	Next

HORMONE_KITS (KITS used to assay HORMONE concentration)

This table contains one row for each kit or protocol that may be used to assay for the concentration of a hormone in a sample. In addition to general information about the kit or protocol (e.g. the assayed hormone, description/discussion), this table also records information regarding how different kits/protocols for the same hormone can be compared to each other.

The Correction column indicates how — or if — results from this kit/protocol should be used with results from other kits/protocols that assay the same hormone. To do this, it indicates what adjustments should be made to correct the "raw" results from this kit. The ESTROGENS, GLUCOCORTICOIDS, HORMONE_RESULTS, PROGESTERONES, TESTOSTERONES and THYROID_HORMONES views all use this column and the corrected_hormone() function to calculate a "corrected" concentration, so all values in this column must be legal correction inputs for that function. Specifically: the text in this column must be readable as a mathematical expression for how the "raw" value should be adjusted, and when referring to the "raw" value, the string %s must be used. See corrected_hormone() for examples of how these values should be recorded.

The system will return a warning for any rows whose Correction does not include a %s.

Tip

When no correction is needed, do not set the Correction column to NULL. "No correction needed" is indicated with a Correction of %s.

When the Correction column is NULL, this is interpreted to mean that it is unknown how to use or compare the kit's results with data from other kits. In related views that provide a "corrected" concentration, this corrected concentration for these results may be NULL (as in HORMONE_RESULTS), or the result may be omitted entirely (as in ESTROGENS, GLUCOCORTICOIDS, etc.).

Example 4.2. Kits with no Correction, and NULL Correction

For a few years, the concentration of "hormone X" was measured using kits made by the Mojo Jojo corporation. A few years later, the Mojo Jojo kit was discontinued, and Hormone X was instead measured using a kit from Utonium, Incorporated. Analyses suggested that results from the Utonium kit are more accurate than the Mojo Jojo kit. The Utonium kit's Correction would therefore be set to %s, and ideally the Mojo Jojo kit's data would be corrected to allow comparison with the Utonium data. However, the actual measurements from the two kits are highly inconsistent with each other, so a reliable correction factor for the Mojo Jojo kit cannot be calculated. Because of this, the Correction column is set to NULL for the Mojo Jojo kit. In a view showing concentrations of hormone X (similar to how ESTROGENS shows concentrations of estrogen), assay results from the Utonium kit are included, while results from the Mojo Jojo kit are omitted.

Columns in HORMONE_KITS

Kit

A unique identifier for this kit or protocol. This is an automatically generated sequential number that uniquely identifies the row.

This column is automatically maintained by the database, cannot be changed, and must not be NULL.

Hormone

The HORMONE_IDS.Hormone whose concentration is assayed by this kit or protocol.

This column may not be NULL.

Correction

A string of text indicating how to "correct" results from this kit/protocol so that its results may be used alongside results from other kits/protocols that assay the same hormone.

This column may be NULL, when an appropriate correction has not been determined or is not possible.

Descr

A textual description or discussion about the kit or protocol, including any miscellaneous comments or notes.

This column may be NULL. This column may not be empty, it must contain characters, and it must contain at least one non-whitespace character.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HORMONE_PREP_DATA (Lab PREParations performed on hormone samples)

This table contains one row for every laboratory preparation that was performed on a sample as part of a specific series. Each preparation is recorded with a date, and textual comments may be also be noted.

Tip

Always use the HORMONE_PREPS view in place of this table. It contains additional related columns which may be of interest.

Each row's preparation must have taken place after the hormone sample was freeze-dried and sifted. That is, each row's Procedure_Date cannot be before the related HORMONE_SAMPLE_DATA.FzDried_Date and Sifted_Date columns.

Note

The freeze-drying and sifting of fecal samples that are recorded in HORMONE_SAMPLE_DATA are arguably preparatory procedures. Those preparations are not included here because they affect the whole sample. Recording them in this table would incorrectly indicate that the effect of those preparations is limited to a single series.

The procedure cannot occur more than once in the same series. That is, the Procedure must be unique to the HPSId.

The system will return a warning if an ethanol extraction is recorded in the same series as any other preps^[177].

Columns in HORMONE_PREP_DATA

HPId

A unique identifier for this preparation. This is an automatically generated sequential number that uniquely identifies the row.

This column is automatically maintained by the database, cannot be changed, and must not be NULL.

HPSId

The HORMONE_PREP_SERIES.HPSId of the series to which this preparation belongs.

This column may not be NULL.

Procedure

The HORMONE_PREP_PROCEDURES.Procedure performed for this preparation.

This column may not be NULL.

Procedure_Date

The date on which this preparation finished. If a preparation spans multiple days, then the latest date should be used here.

This column may be NULL, when the date is unknown.

Comments

Comments or miscellaneous information about this preparation.

This column may be NULL. This column may not be empty, it must contain characters, and it must contain at least one non-whitespace character.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HORMONE_PREP_SERIES (SERIES of Lab PREParations performed on hormone samples)

This table contains one row for each series of laboratory preparations and hormone measurements performed on a specific sample.

It seems worthwhile to use an example to illustrate the meaning of a "series" in this context.

Example 4.3. A familiar "series" of events

One day, Little Miss Muffet decides she wants to have a snack. She gets a bowl of curds and whey, sits down on a tuffet, and proceeds to eat. Soon a spider comes along and sits next to her, frightening her and causing her to run away. Video dramatizations of the event often show her dropping, spilling, or otherwise losing the curds and whey.

Taken together, these events comprise a "series" whose component events could be divided into two groups: preparatory events (get food, sit on tuffet, get scared by spider) and results (run away, lose food). Each preparatory event is preparatory for both of the results, and neither result is dependent or contingent on the other result.

To accurately record these events in a database, Miss Muffet's three preparatory events should be connected to each of the two results, and vice versa. Ideally, separate tables of preparations and results should be in a "one-to-one" or "one-to-many" relationship, but the nature of these data prohibits such an arrangement. This inconvenient "many-to-many" relationship can be addressed by designating each of the many events as components of a single "series".

Similar to Miss Muffet, the process of measuring hormones extracted from a tissue sample is divided into preparatory procedures (in HORMONE_PREP_DATA) and results (in HORMONE_RESULT_DATA), and multiple preparation events may apply to each of multiple results. Their troublesome "many-to-many" relationship is managed by this table, in which related events are grouped into a single series.

This table includes a Series column, which identifies the series for the sample. The first row for a sample in this table must have a Series of 1, the second should be 2, and so on. To allow editing or reordering this value, this rule is only checked after the current transaction is committed.

Columns in HORMONE_PREP_SERIES

HPSId

A unique identifier for this series. This is an automatically generated sequential number that uniquely identifies the row.

This column is automatically maintained by the database, cannot be changed, and must not be NULL.

TId

The HORMONE_SAMPLE_DATA.TId of the sample that was worked on in this series.

This column may not be NULL.

Series

A positive integer that identifies the series for this sample.

This column may not be NULL.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HORMONE_RESULT_DATA (RESULTs of HORMONE concentration assays)

This table contains one row for the result of every assay performed to measure a hormone concentration in a sample. In addition to the assay's result — the concentration in nanograms of hormone metabolite per gram of fecal sample — each row records the series (from HORMONE_PREP_SERIES) to which this result belongs, the date of the assay, the identity of the "kit" or protocol used to perform the assay, and the mass of sample from which this hormone was extracted. Textual comments may be also be noted.

Tip

There are many views that may be preferred in place of this table. The HORMONE_RESULTS view adds some related columns that make this table more legible for human consumption. Also, there are several views dedicated to specific hormones (ESTROGENS, GLUCOCORTICOIDS, PROGESTERONES, TESTOSTERONES, THYROID_HORMONES) that show assay results with relevant information about the sample and any preps involved with generating the result.

Each row's assay must have taken place after all related preparatory procedures were performed. That is, each row's Assay_Date cannot be before the Procedure_Date of any HORMONE_PREP_DATA rows from the same series (with the same HPSId).

When assay results are generated in the lab, a sample may undergo more than one assay for the same hormone. When there are multiple results, it becomes necessary to determine what the "right" concentration for the sample actually is. Often, it may be best to use the average of all results. In some cases, the results from one kit may be universally preferred over results from another kit. And so on. Laboratory and data managers are presumed to be better-qualified to make such decisions, so those decisions should be made before adding data to this table. To help ensure that this is occurring, a hormone sample cannot have more than one result for each hormone, regardless of the series and the Kit. That is, the combination of the related HORMONE_PREP_SERIES.TId and HORMONE_KITS.Hormone must be unique^[178].

The mass of fecal sample that was extracted and measured in the assay is recorded in the Grams_Used column. This column should not be NULL. However, in some rare circumstances the mass can be unknown, in which case the column will be NULL. Regardless, this is expected to be rare. The system will return a warning for any assay whose Grams_Used is NULL.

Columns in HORMONE_RESULT_DATA

HRId

A unique identifier for this assay result. This is an automatically generated sequential number that uniquely identifies the row.

This column is automatically maintained by the database, cannot be changed, and must not be NULL.

HPSId

The HORMONE_PREP_SERIES.HPSId of the series to which this assay result belongs.

This column may not be NULL.

Kit

The HORMONE_KITS.Kit used to perform this assay.

This column may not be NULL.

Assay_Date

The date that the assay was performed.

This column may be NULL, when the date is unknown.

Grams_Used

The initial mass of fecal sample from which the assayed hormone was extracted.

This column may be NULL, when the mass is unknown.

Raw_ng_g

The "raw" concentration of the hormone, in nanograms hormone per gram of fecal sample, determined by this assay.

This column may not be NULL.

Comments

Comments or miscellaneous information about this assay.

This column may be NULL. This column may not be empty, it must contain characters, and it must contain at least one non-whitespace character.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HORMONE_SAMPLE_DATA (Tissue SAMPLEs used for HORMONE analysis)

This table contains one row for every tissue sample used for hormone analysis; generally for hormone analysis these will be fecal samples. For each sample, the table records data that are only relevant to hormone analysis and would thus be inappropriate for inclusion in the TISSUE_DATA table.

Tip

Always use the HORMONE_SAMPLES view in place of this table. It contains additional related columns which may be of interest.

For various logistical reasons, it is often not practical for lab personnel to use the database's unique identifier for a sample (its TId) in the lab. Instead, they use a system of their own. The unique identifier used by hormone lab personnel — the "Hormone Sample ID" — is indicated in the HSId column^[179].

To analyze a fecal sample for its hormone content, the sample must first be freeze-dried. Following that, the dry sample is sifted into a fine powder, at which point it is ready for whatever preparations are necessary for hormone analysis. The dates that the fecal sample is freeze-dried and sifted are recorded in this table, in the FzDried_Date and Sifted_Date columns, respectively.

A fecal sample cannot be sifted before it is freeze-dried; its Sifted_Date must be on or after its FzDried_Date.

This table attempts to keep an ongoing record of a fecal sample's remaining mass in the Avail_Mass_g column. It is left to the user to judge this column's accuracy, which depends greatly on how diligently the lab personnel keep the data manager(s) informed of changes. To assist users in making these judgments, the date that the Avail_Mass_g was last updated is recorded in the Avail_Date column. A sample's remaining mass cannot be recorded without also recording this date; the Avail_Mass_g and Avail_Date columns must both be NULL or both non-NULL.

Preparing a fecal sample for hormone extraction and any subsequent handling of the sample must be after the sample was collected. That is, all dates in this table (FzDried_Date, Sifted_Date, Avail_Date) must be after the sample's related TISSUE_DATA.Collection_Date.

Columns in HORMONE_SAMPLE_DATA

TId

The TISSUE_DATA.TId of the tissue sample and the unique identifier for the row.

This column cannot be changed and must not be NULL.

HSId

The "Hormone Sample ID", another unique identifier for this sample. This is a number, created and maintained by lab personnel.

This column may not be NULL.

FzDried_Date

The date that this sample was freeze-dried.

This column may be NULL, when this date is unknown.

Sifted_Date

The date that the freeze-dried sample was sifted.

This column may be NULL, when this date is unknown.

Avail_Mass_g

The mass of this sample, in grams, that is available for use as of the Avail_Date.

This column may be NULL, when the remaining mass (if any) is unknown.

Avail_Date

The date that the Avail_Mass_g was determined.

This column may be NULL, when the remaining mass (if any) is unknown.

Comments

Comments or miscellaneous information about this sample.

This column may be NULL. This column may not be empty, it must contain characters, and it must contain at least one non-whitespace character.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HYBRIDGENE_ANALYSES

A table listing each analysis that has been performed to generate genetic hybrid scores for individual baboons, with basic information about each analysis.

Each analysis combines statistical techniques with the genetic data available at the time to estimate what proportion of each individual's genome came from ancestry of a specified other species^[180]. These estimates are the so-called “hybrid scores”. After several years have elapsed, more individuals are available for scoring, which prompts a new analysis. For many reasons, each analysis may yield somewhat different scores for the same individual. A more-recent analysis does not necessarily negate or supersede an older one, so all analyses are stored here.

The HYBRIDGENE_ANALYSES.Date must be after the BIOGRAPH.Entrydate of all individuals scored in that analysis in HYBRIDGENE_SCORES. Similarly, the system will return a warning for each individual scored in an analysis where the related Date is before the individual's LatestBirth.

Column Descriptions

HGAId (HybridGene_Analyses Identifier)

A unique integer that identifies the analysis.

This column is automatically maintained by the database, cannot be changed, and must not be NULL.

Date

The date the hybrid scores were generated by this analysis.

This column may not be NULL.

Analyzed_By

The LAB_PERSONNEL.Initials of the person who performed the analysis.

Tip

It's technically possible to have more than one person involved with an analysis, but even in such cases there will certainly be a lead whose initials should fill this column.

This column may not be NULL.

Software

The HYBRIDGENE_SOFTWARE.Software used to perform the analysis.

This column may not be NULL.

Marker

The MARKERS.Marker type used in the analysis.

This column may not be NULL.

Comments

Notes or comments about the analysis.

This column may not be empty, it must contain characters, and it must contain at least one non-whitespace character.

This column may be NULL.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HYBRIDGENE_SCORES

A table listing all the hybrid scores determined by genetic hybridity analyses. Hybridity analyses use statistical tools that may also determine upper and lower confidence intervals^[181]. This table also stores those values, if any.

The combination of Sname and HGAId must be unique.

In some analyses, upper and lower confidence intervals are not generated, in which case the Upper_Conf and Lower_Conf will be NULL. The system will return a warning in this case.

If either Upper_Conf or Lower_Conf is provided, then the other must also; the Upper_Conf and Lower_Conf must both be NULL or both non-NULL.

When the confidence columns are not NULL, the individual's Score must be greater than its Lower_Conf (inclusive), and less than its Upper_Conf (inclusive).

Column Descriptions

HGSId (HybridGene_Scores Identifier)

A unique integer that identifies the row.

This column is automatically maintained by the database, cannot be changed, and must not be NULL.

HGAId (HybridGene_Analyses Identifier)

The HYBRIDGENE_ANALYSES.HGAId of the analysis in which this score was determined.

This column may not be NULL.

Sname

The BIOGRAPH.Sname of the scored individual.

This column may not be NULL.

Score

The individual's hybrid score for this analysis. This value must be between 0 and 1 (inclusive).

This column may not be NULL.

Lower_Conf

The lower confidence interval for the hybrid score. This value must be between 0 and 1 (inclusive).

This column may be NULL, but only when the analysis did not generate a lower confidence.

Upper_Conf

The upper confidence interval for the hybrid score. This value must be between 0 and 1 (inclusive).

This column may be NULL, but only when the analysis did not generate a higher confidence.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HYBRIDMORPH_OBSERVERS (Observers of hybrid morphology scores)

This table records the observers for the hybrid morphology reports, one row for each observer.

A report cannot have the same observer more than once; each HMRId-Observer must be unique.

In practice, an observer does not score an individual more than once on the same date. However, the indicated Date for a score might not be be the actual date of the observation, for reasons discussed below. Because of this, each combination of Observer and related HYBRIDMORPH_REPORTS.Date and Sname should be unique but occasionally will not. The system will return a warning for each non-unique Sname-Date-Observer that appears in these data.

Column Descriptions

HMOId

A unique identifier for this observer. This is an automatically generated number that uniquely identifies the row.

This column is automatically maintained by the database, cannot be changed, and must not be NULL.

HMRId

The HYBRIDMORPH_REPORTS.HMRId of this report.

This column may not be NULL.

Observer

The OBSERVERS.Initials of the observer.

This column may not be NULL.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HYBRIDMORPH_REPORTS (Hybrid morphology score collection events)

This table contains one row for each time an individual's hybrid morphology is assessed and scored. For details about these scoring events, see the Amboseli Baboon Research Project Monitoring Guide.

Observers do not always record the exact date of the scoring. Instead, they may only record the year and month. When this occurs, the Date is recorded as the first day of the indicated month. It is not possible to discriminate between this and a report that was truly collected on the first day of that month.

An individual cannot be scored before they were first observed, nor after they were last seen alive; the year and month of the Date cannot be before the year and month of the individual's Entrydate, nor can the Date be after the individual's Statdate.

A hybrid score report includes the name of the baboon group in which the scored individual lives. Recorded in the Grp column, this value is provided by the observer(s) and is useful for finding the original paper sheet where the scores were recorded.

Tip

This table's Grp is usually but not necessarily the individual's Grp in MEMBERS on that date. To find the individual's group on the day of scoring, use MEMBERS.

It is difficult to recognize in real time when a group fission/fusion has begun or ended, so the observer(s) might record a Grp that no longer exists. The system will return a warning for any row in this table whose Grp did not exist on the indicated Date.

Column Descriptions

HMRId

A unique identifier for this report. This is an automatically generated number that uniquely identifies the row.

This column is automatically maintained by the database, cannot be changed, and must not be NULL.

Sname

The BIOGRAPH.Sname of the individual whose hybridity is assessed.

This column may not be NULL.

Date

The date of this report.

This column may not be NULL.

Grp

The GROUPS.Gid of the baboon group in which the observer(s) indicated that this scoring occurred.

This column may not be NULL.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

HYBRIDMORPH_SCORE_DATA (Hybrid morphology scores)

This table contains the hybrid morphology scores that the observer(s) recorded for each of the various traits. Each row is a single score for a single trait.

Tip

Use the HYBRIDMORPH_SCORES view instead of this table. It shows the scores in one row per report, and can also be used to insert, update, or delete data from this table.

An individual cannot be scored more than once at a time for a single trait; every HMRId-Trait must be unique.

Scores are recorded for seven traits: 1) coat color, 2) hair length, 3) body shape, 4) head shape, 5) tail length and thickness, 6) tail bend at the "hook", and 7) muzzle skin. See the Amboseli Baboon Research Project Monitoring Guide for more information.

Which traits are scored depends on the age and sex of the individual. Adults — females age 4 or higher, males age 5 or higher — are scored on all seven traits. Juveniles — females under 4, males under 5 — are only scored on the first five. It is an error to record a score for those last two traits if the scored individual is a juvenile on the related Date. Because the Date may not be exactly right, this rule is enforced with "month"-level precision. E.g. a female cannot have any scores for those last two traits on any date in any month prior to the month in which she becomes 4 years old.

If a report has scored an individual for any trait, there should be scores recorded for all the traits, juveniles and the last two traits notwithstanding. The system will return a warning for any HMRId that does not have scores for all of its expected traits.

Column Descriptions

HMRId

The HYBRIDMORPH_REPORTS.HMRId of the report in which this score was recorded.

This column cannot be changed and must not be NULL.

Trait

The trait being scored. This column must be one of the following:

HYBRIDMORPH_SCORE_DATA.Trait Values

Code	Meaning
`COLOR`	Coat color
`HAIR_LENG`	Hair length
`BODY_SHAPE`	Body shape
`HEAD_SHAPE`	Head shape
`TAIL_LENG`	Tail length and thickness
`TAIL_BEND`	Tail bend at the "hook"
`MUZZLE`	Muzzle skin

This column cannot be changed and must not be NULL.

Score

The individual's score for this trait. This number must be a multiple of 0.5, and must be between 0 and 2.

This column may not be NULL.

Sys_Period

The timestamp range during which this row's data are considered valid. See The Sys_Period Column for more information.

Prev	Up	Next
Group Membership and Life Events	Home	SWERB Data