|   DataDownload Data
 Variables
 DocumentationAbout Slave PUMS
 User's Guide
 Revision History
 Citation and Use
 Site ToolsContact UsSearch This Site
 Site Map
 Feedback
 Staff
 Mailing Address
   | User’s Guide The Public Use Microdata Samples of the Slave Population
        are a series of six samples taken from the 1850 and 1860 Censuses of
        Slave Inhabitants. The questions included in the Census of Slave Inhabitants
        were relatively few: schedules from both years asked slaves’ age,
        sex, color, and disabilities including whether deaf, dumb, blind, insane
        or idiotic. Enumerators were allowed to record slaves’ names in
        the 1860 census if the slave was over 100 years of age. Slaveholders also
        provided information about the number of slaves who had been manumitted
        or escaped in the prior year. In 1860, the number of houses provided to
        slaves on the slaveholding was also tallied. We have provided two samples from 1850 and three samples
        from 1860. The 1850 samples include: 
        a Flat sample, where all slaveholdings had an equal probability of
          being included, andan Urban/Group Quarters sample, where slaves from urban areas and
          those living in large holdings were sampled under different rules (see
          below).a sample linking 1850 Slave data to the 1850 Free population data available in IPUMS-USA The 1860 samples include: 
         a Flat sample,a Group Quarters sample, where slaves living in large holdings were
          sampled under different rules, anda Complete Count sample that includes the complete population of slaves
          in selected southern counties. All samples other than the 1850 linked sample have the same file structure, and all except
        for the 1860 Complete Count sample are nationally representative (when
        weights are applied). We recommend that most researchers use the Flat samples.
        They are adequate for almost any purpose, and the use of weights is optional.
        The 1850 Urban/Group Quarters sample and the 1860 Group Quarters sample
        are intended for more specialized purposes. In particular, they provide
        a much more precise sample of slaves living in large holdings. The 1850
        Urban/Group Quarters sample also provides an oversample of those living
        in cities. The final sample, the 1860 Complete Count dataset, should be
        used with great caution--it is not nationally representative. Data FormatAll six datasets have the same rectangular column-format structure. The
        unit of analysis is the individual slave (or free person, in the case of the 1850 linked sample). Since the file structure is
        rectangular, all holding-level variables—such as slaveholder type
        or number of fugitives from the holding—are attached to all of the
        individual slaves within each holding.
 Users making tabulations of any holding-level variables
        should do so with caution. To make an accurate tabulation of the number
        of slaveholders from any of the slave samples, for instance, one would need to select only the first
        slave record in each holding (using the variable SLAVENUM). Sample weights must be applied for all analyses of the 1850
        Urban/Group Quarters sample. Sample weights are optional in when using
        the 1850 and 1860 “flat” samples, and the 1860 Group Quarters
        sample. The 1860 Complete Count sample covers only selected southern counties
        and therefore cannot be made representative even with the use of weights. 1850 Flat SampleThis sample contains approximately 1-in-20 sample of the slaves enumerated
        in 1850. The sampling strategy for this sample is based on the manuscript
        page from the slave schedule. Each page contains 84 lines in the 1850
        schedules. Each line describes one or more slaves of a particular age,
        sex, color and disability. (Typically, each line contained information
        for only one slave.) We randomly generated a “window” of 4
        sample points on every page (e.g., lines 24, 25, 26, and 27).
 To ensure that holdings had an equal probability of being
        included in the sample regardless of their size, a holding was entered
        only if a sample point fell on the line containing the holding’s
        first slave. When the sample point fell on any other line, the holding
        was skipped. If the holding was included, all slaves were entered. Because
        this sample is a flat sample, the use of sample weights is not required.
      1850 Urban/Group Quarters SampleFor most of the slaveholding United States, this sample is also an approximately
        1-in-20 sample. Also included, however, is an oversample of cases from
        the counties that contained the South’s ten largest cities –
        Baltimore, Maryland, New Orleans, Louisiana, St. Louis, Missouri, Louisville,
        Kentucky, Charleston, South Carolina, Washington, DC, Richmond, Virginia,
        Mobile, Alabama, Savannah, Georgia, and Norfolk, Virginia. In these ten
        counties, the sampling rate is 1-in-4.
 Additionally, we employed a different sampling strategy
        for slaveholdings including 100 of more slaves. Like the 1850 Flat sample,
        the sampling windows were only accepted if it fell on the first line of
        a slaveholding, provided the slaveholding had fewer than 100 slaves. If the slaveholding had 100 or more slaves, however, a modified
        group-quarters sampling strategy was followed, much like that used in
        the IPUMS-USA samples. When the sampling window fell within a slaveholding
        with 100 or more slaves, data would be recorded for the slaves within
        the sampling window. All slaves in the sampling window were included,
        even if no slave in the window was on the first line of a holding. When
        the sampling winding included the first slave in a holding of 100 or more,
        only those in the window were included (i.e., we did not include ANY intact
        holdings of 100 or more slaves in the group quarters samples). The chief advantage of this approach is that the number
        of large slaveholdings that are represented in the sample is far higher
        than would be the case otherwise. This increased representation ultimately
        increases precision of any analysis of slaves living in large holdings.
        Because the 1850 Urban/Group Quarters Sample contains two distinct sampling
        rates, users must apply sample weights to obtain representative statistics
        of the slave population (see the WEIGHT variable for details). 1850 Linked Slave/Free Population SampleThis dataset is a 1-in-100 sample of the slave and free population in 1850. The dataset contains all persons present in the original 1850 IPUMS sample, regardless of whether or not they were slaveholders. For all free persons, there are variables identifying serial and person number in the original 1850 IPUMS data as well as slaveholder status.  For all slave persons, there are variables indicating serial number of the slaveholding household in which they lived, location of the slaveholding on the slave schedules, and age, race, and sex of slave. Using the SERIAL and PERNUM variables, this data can easily be integrated with the 1850 Free Population data available at the IPUMS-USA website.
 1860 Flat SampleThis sample contains approximately 1-in-20 of the slaves enumerated. The
        sampling strategy for this sample is based on the manuscript page from
        the slave schedule. Each page contains 80 lines in the 1860 schedules.
        Each line describes one or more slaves of a particular age, sex, color
        and disability. (Typically, each line contained information for only one
        slave.) As with the other samples, we randomly generated a “window”
        of 4 sample points on every page. Only cases where the window fell on
        the first line of the slaveholding were included. When the sample point
        fell on any other line, the holding was skipped. If the holding was included,
        all slaves were entered. Because this sample is a flat sample, the use
        of sample weights is not required.
 1860 Group Quarters SampleThis also is a 1-in-20 sample, but unlike the 1860 Flat Sample, group
        quarters sampling rules are used here for slaveholdings that contained
        at least 100 slaves. When the sampling window fell within a slaveholding
        with 100 or more slaves, data would be recorded for the slaves within
        the sampling window. All slaves in the sampling window were included,
        even if no slave in the window was on the first line of a holding. When
        the sampling winding included the first slave in a holding of 100 or more,
        only those in the window were included (i.e., we did not include ANY intact
        holdings of 100 or more slaves in the group quarters samples). Like the
        flat samples, this sample does not require the use of weights.
 1860 Complete Count SampleThis sample is contains all data from a random series of microfilm reels
        from across the South. Within the selected reels, data from all slaves
        was recorded and appears within this sample. The selected reels include
        1 reel from 12 different states and the District of Columbia. As this
        sample is not representative, it will be primarily useful for scholars
        analyzing aspects of slavery for places that appear in this sample.
 |  |