Data revision history
State Summary/State Characteristics Longitudinal Data File (PUSUM)
Robert E. Molyneux
Abstract
This page documents a longitudinal set of state summary public library data constructed from two sources. One is the annual Public Library Survey formerly collected by the National Center for Education Statistics (NCES) under the Federal State Cooperative System (FSCS) program and more recently from the continuing series published by the Institute of Museuem and Library services. This compilation is of the State Summary/State Characteristics Data File, that is Public Use Summary (hence, I conclude, PUSUM which convention NCES and IMLS have named files in this series). The dataset now contains 884 observations (an observation being summary data from the libraries in one state for one year), with 66 variables per observation, although not all variables were collected all years. These data are from the states, the District of Columbia, and occasionally territories.
The summary data were originally compiled and published for the years Fiscal Year (FY) 1987-FY 2008 although for reasons discussed shortly, the set of data discussed here only has the data from FY 1992-FY 2008 or 17 years. The data are available in formats discussed at the bottom of the page under Distribution and Formats.
The state summary data are fundamentally different from the Public Library Data File which is a compilation of data from 10,000 public libraries collected from FY 1987-FY 2008. These summary data are the result of analyzing aspects of those data and other data. Given that not all libraries respond to all survey items, NCES and IMLS have imputed those missing data. Documentation is carried in the various sources cited in the list of NCES and IMLS Public Library Survey publications elsewhere on this site. PUSUM contains imputed data given that it is attempting to compensate for the missing values for purposes of arriving at national assessments of the state of public libraries. Knowing this, could you replicate these summary data using the Public Library Data File either in its unimputed or imputed forms? Probably not. The documentation on how the imputations were done is sketchy but improving but does involve data that are masked in the Public Library Data file for privacy reasons so IMLS has access to data suppressed for individual libraries. Data are suppressed in smaller libaries, for instance, salary information when the staff is sufficiently small to cause a privacy concern. Of course, you could do your own imputations. More telling, though, is the fact that these summary files are also constructed using the Restricted Use data, in part. These data are not available for public use.
A further complication follows from this practice of imputation. The documentation for several years (here 2007) specifically warns:
Comparisons to data prior to FY 92 should be made with caution, as earlier data files do not include imputations for nonresponse, and the percentage of libraries responding to a given item varied widely among states. (p. 13)According to the documentation from earlier years, there are supposed to be summary data from FY 1987-FY 1991. 1991 is made up of a summary from the reporting libraries so that comparing FY 1991 with FY 1992 is likely to show discontinuities that are an artifact of the imputations not from any change in the libraries. For FY 1987-FY 1989, in spite of the documentation's saying there are summary data, they are sketchy. FY 1987 has a few variables reported for the 19 states' summaries reported. Summary data for FY 1988 were not with the rest of the FY 1988 data and I don't know where they are. FY 1989 does have summary data in a separate file as does FY 1990. All imputations could be taken out of the post-1992 data but experimenting with that idea resulted in derivative datasets with very few values. So, we get some data imputed by methods that are not clear or we get vanishly few data.
Variables
The tables below list the variables that were dealt with in constructing PUSUM. The intent was to group all like variables in the various years in one variable name. Rarely was there any ambiguity in reading the documentation from year to year.
Two variables have been added to each year's data in order to ease processing: state and year. (NCES/IMLS variables are in upper case while any non-NCES/IMLS data are in italics. year was hard coded in one of the programs and is the nominal year of collection and the same as the fiscal year that appears on publications. The NCES/IMLS library data have been issued as annual publications without explicit acknowledgement in the data of previous years or years that might follow. State is the two-digit postal code. This variable was made from various variables in the dataset as noted below. All NCLIS datasets use the same conventions.
A spreadsheet (Open Office, xls) presents these variables in another format.
The following table has details on the variables in PUSUM. The convention followed is to use the 2008 variable name except where the data are no longer collected and the short definitions and notes come, largely, from the documentation for FY 2008. The wording differs on occasion for variables documented in the earlier years but the definitions overall are pretty consistent. Imputation flags listed in the documentation are not included in this list nor in PUSUM.
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| state | Two-letter State Code | ALL | Created from STABR | Used for processing. |
| year | Year of data collection | ALL | Inserted by code | Used for processing. |
| STABR | Two-letter Federal Information Processing Standards (FIPS) State Code | ALL | STABR (1992-) | See PUB_FIPS, below. |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| POPU_LSA | Population of the Legal Service Area | ALL | POPU (1992-97) POPU_LSA (1998-) |
|
| POPU_UND | Unduplicated population of the legal service area for the library (constructed variable). | ALL | PERIOD_POP (1992-1997) POPU_UND (1998-) |
|
| POPU_ST | Official state population estimate | ALL | PERIOD_EST (1992-1997) POPU_ST (1998-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| CENTLIB | Number of central libraries | ALL | CENTLIB (1992-) | |
| BRANLIB | Number of branch libraries | ALL | BRANLIB (1992-) | |
| BKMOB | Number of bookmobiles | ALL | BKMOB (1992-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| MASTER | ALA-MLS Librarians | ALL | MASTER (1992-) | |
| LIBRARIA | Total number of FTE employees holding the title of librarian. | ALL | LIBRARIAN (1992-97) LIBRARIA (1998-) |
|
| OTHPAID | All other paid FTE employees. | ALL | OTHPAID (1992-) | |
| TOTSTAFF | Total paid FTE employees | ALL | TOTEMP (1992-97) TOTSTAFF (1998-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| LOCGVT | Operating income from local government | ALL | LOCGVT (1992-) | |
| STGVT | Operating income from state government | ALL | STGVT (1992-) | |
| FEDGVT | Operating income from federal government | ALL | FEDGVT (1992-) | |
| OTHINCM | Other operating income (i.e., income not included in LOCGVT, STGVT, and FEDGVT) | ALL | OTHINCM (1992-) | |
| TOTINCM | Total income (Sum of LOCGVT, STGVT, FEDGVT, and OTHINCM) | ALL | TOTINCM (1992-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| SALARIES | Salaries and wages for all library staff | ALL | SALARIES (1992-) | |
| BENEFIT | Employee benefits for all library staff | ALL | BENEFIT (1992-) | |
| STAFFEXP | Total staff expenditures (SALARIES plus BENEFIT) | ALL | TOTEXP (1992-97) STAFFEXP (1998-) |
|
| PRMATEXP | Operating expenditures for print materials (including books, serial back files, current serial subscriptions, government documents) | 2003- | PRMATEXP (2003-) | |
| ELMATEXP | Operating expenditures for library materials in electronic format | 1995- | ELMATEXP (1995-) | |
| OTHMATEX | Operating expenditures for all other library materials (microform, audio, video, DVD, and new formats) | 2003- | OTHMATEX (2003-) | |
| TOTEXPCO | Total expenditures on library collections. | ALL | TOTEXPCOL (1992-97) TOTEXPCO (1998-) |
|
| OTHOPEXP | Other operating expenditures (i.e., expenditures not included in STAFFEXP and TOTEXPCO) | ALL | OTHOPEXP (1992-) | |
| TOTOPEXP | Total operating expenditures (Sum of STAFFEXP, TOTEXPCO, and OTHOPEXP) | ALL | TOTOEXP1 (1992-97) TOTOPEXP (1998-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| CAPITAL | Expenditures for capital outlay | ALL | CAPITAL (1992-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| CAP_REV | Capital revenue for major capital expenditures | 2003- | CAP_REV (2003-) | |
| FCAP_REV | Federal government capital revenue | 2005- | FCAP_REV (2005-) | |
| LCAP_REV | Local government capital revenue | 2005- | LCAP_REV (2005-) | |
| OCAP_REV | Other capital revenue | 2005- | OCAP_REV (2005-) | |
| SCAP_REV | State government capital revenue | 2005- | SCAP_REV (2005-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| BKVOL | Number of books and serial volumes | ALL | BKVOL (1992-) | |
| EBOOK | Electronic books (E-books) (digital documents, including non-serial government documents in digital format) | 2003- | EBOOK (2003-) | |
| AUDIO | Number of audio materials | ALL | AUDIO (1992-) | |
| VIDEO | Number of video materials | ALL | VIDEO (1992-) | |
| DB_LOC | Locally licensed databases | 2006- | DB_LOC (2006-) | |
| DB_ST | State licensed databases | 2006- | DB_ST (2006-) | |
| DB_OTH | Other licensed databases | 2006- | DB_OTH (2006-) | |
| DATABASE | Databases (locally mounted or remote, full-text or not, for which temporary or permanent access rights have been acquired) | 2003- | DATABASE (2003-) | |
| SUBSCRIP | Number of current serial subscriptions | ALL | SUBSCRIPT (1992-97) SUBSCRIP (1998-) |
|
| ESUBSCRP | Current electronic serial subscriptions | 2003- | ESUBSCRP (2003-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| HRS_OPEN | Total annual public service hours for all service outlets | ALL | DUPLI (1992-97) HRS_OPEN (1998-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| VISITS | Total annual library visits | ALL | ATTEND (1992-1997) VISITS (1998-) |
ATTEND was defined as "Annual attendance in library" which, I understand, is regarded as the same as VISITS today. |
| REFERENC | Total annual reference transactions | ALL | REFERENCE (1992-1997) REFERENC (1998-) |
|
| REGBOR | Registered Borrowers | 2006- | REGBOR (2006- |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| TOTCIR | Total annual circulation transactions | ALL | TOTCIR (1992-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| LOANTO | Total annual loans provided to other libraries | ALL | LOANTO (1992-) | |
| LOANFM | Total annual loans received from other libraries | ALL | LOANFM (1992-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| KIDCIRCL | Total annual circulation (including renewals) of all children's materials in all formats to all users | ALL | KIDCIRCL (1992-) | |
| KIDATTEN | Total annual attendance at all programs intended primarily for children. Includes adults who attend programs intended primarily for children. | ALL | KIDATTEN (1992-) | |
| TOTATTEN | Total annual attendance at all library programs. | 2004- | TOTATTEN (2004-) | |
| TOTPRO | Total library programs. | 2004- | TOTPRO (2004-) | |
| KIDPRO | Total children's programs. | 2005- | KIDPRO (2005-) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| ELACCEXP | Operating expenditures for electronic access | 1995-2002 | ELACCEXP (1995-2002) | |
| ELMATS | Number of library materials in electronic format | 1995-2002 | ELMATS (1995-2002) | |
| STFTERMS | Internet terminals used by staff only | 1998-2002 | STFTERMS (1998-2002) | |
| GPTERMS | Internet terminals used by general public | 1998- | GPTERMS (1998-) | |
| ERES_USR | Users of electronic resources per typical week | 1999-2005 | ERES_USR (1999-2005) | |
| PITUSR | Users of public Internet terminals per year | 2006- | PITUSR (2006) |
| Variable | Short definition | Years | History | Notes |
|---|---|---|---|---|
| STARTDAT | Reporting period start date | ALL | PERIOD_PSM (1992-1997) STARTDAT (1998-) |
Month/year format, for example: "07/2003" |
| ENDDAT | Reporting period end date | ALL | PERIOD_PEM (1992-1997) ENDDAT (1998-) |
Month/year format, for example: "06/2003" |
| PUB_FIPS | Two-digit FIPS State Code. | ALL | PUB_FIPS (1992-) | This is a numerical code (Alabama = 01). STABR uses a character state code (Alabama = AL). |
| YR_SUB | FSCS submission year of public library data in four-digit format (YYYY) | ALL | YR (1992-97) + 1900 YR_SUB (1998-) |
YR is a two-digit year format and YR_SUB is four digits. |
| OBEREG | Bureau of Economic Analysis Code | ALL | OBEREG (1992-) | 00-U.S. Service Schools 01-New England (CT ME MA NH RI VT) 02-Mid East (DE DC MD NJ NY PA) 03-Great Lakes (IL IN MI OH WI) 04-Plains (IA KS MN MO NE ND SD) 05-Southeast (AL AR FL GA KY LA MS NC SC TN VA WV) 06-Southwest (AZ NM OK TX) 07-Rocky Mountains (CO ID MT UT WY) 08-Far West (AK CA HI NV OR WA) 09-Outlying Areas (AS FM GU MH MP PR PW VI) |
Distribution and Formats
| Format | Filename | Size |
|---|---|---|
| SAS | pusum.sas7bdat | 465KB |
| CSV | pusum.csv | 407KB |
| Open Office | pusum.ods | 427KB |
| Excel | pusum.xls | 472KB |



