The postsecondary researcher data files contain commonly requested longitudinal student-level data from the postsecondary educational environment. The data is organized into 5 datasets: base, enrollment, program, coursework, and awards. These datasets reflect information about postsecondary student demographics, enrollments, programs, courses, cumulative progress, and degrees/certifications (referred to as awards). Data from the 2010-present school years are available.

The primary data sources for the postsecondary researcher data are the Student Transcript and Academic Record Repository (STARR) and the National Student Clearinghouse (NSC). STARR is a collection of student data from all Michigan public institutions and some independent institutions. NSC is a national database of postsecondary student information from most colleges nationwide. The information collected from STARR is more detailed than that obtained from NSC, but we use the NSC data to track the progress of Michigan's students who attend college out of state, and as a backup to ensure we obtain the most complete data possible for Michigan's postsecondary students. The STARR and NSC data are merged together and standardized, and these merged datasets serve as the source for the researcher files. More specifically, CEPI submits a student list to NSC for the past 8 cohorts of Michigan high school graduates and for all Michigan postsecondary students. NSC matches the student list to their Student Tracker database and CEPI uses the returned results.

When merging these data sources, student x year x institution enrollment and awards records are first identified from the STARR and NSC. Where only a STARR record or both STARR and NSC have a matching record, data from STARR are incorporated into the panel. After having done this, any remaining unique NSC records are then added to the panel. Most commonly, these NSC records will come from out-of-state and independent/private institutions although community colleges and, to a lesser degree, four-year institutions will also show records.

Image
Flowchart of STARR record categorization

While this approach greatly reduces duplicate records, there are potential shortcomings. Academic Year is based on Start and End dates that are received from STARR & NSC and there are known data quality concerns around the End date where data is received differently between STARR & NSC. When those dates fall into a different Academic Year, overreporting of the outcomes from those data may occur. A second artifact is the mixing of STARR and NSC data for the same student at the same college in the same school year if, for example, a STARR enrollment record and an NSC award record are received. NSC enrollment records at that college/school year are dropped, but the NSC award record is kept, even if it was not reported in STARR.

This strategy of combining the two data sources, plus the method of data collection used, results in something of a convenience sample for the overall postsecondary dataset. Included are the universe of students enrolled at a Michigan public institution regardless of whether or not they attended Michigan public high school. Additionally, K12 Michigan public high school attendees who attended an out-of-state postsecondary institution or an in-state independent/private postsecondary institution are included, meaning a comprehensive of post-secondary outcomes for Michigan public K12 students is achieved. Beyond this, K12 students who dual-enroll will also show records.

Overall Data Table Notes

Table Hierarchy

This is the general hierarchy of the tables:

Student Base 
EnrollmentsA student may have multiple associated enrollments.
CourseworkAn enrollment session may have multiple associated courses.
ProgramsAn enrollment session may have multiple associated academic programs.
AwardsA student may have multiple associated awards, but awards do not link directly to enrollment session records.
Independent IHE Reporting OptionsIHEs and their reporting option designation. This table is mostly informational, and exists outside of the normal 5 table structure.

Standard Table Joins

The below grids show how the tables link together, but other joins may be appropriate depending on the requirements of the research.

Student BaseJoins toEnrollments, Coursework, Programs, Awards
RIC=RIC
EnrollmentsJoins toCoursework, Programs
RIC=RIC
IPEDS_CODE=IPEDS_CODE
YEAR_SESSION=YEAR_SESSION
ProgramsJoins toCoursework
RIC=RIC
IPEDS_CODE=IPEDS_CODE
YEAR_SESSION=YEAR_SESSION

Though awards do not have a direct one-to-one link to enrollment records, most student awards can be linked to a corresponding set of enrollments as shown below. 

AwardsMay join toEnrollments
RIC=RIC
IPEDS_CODE=IPEDS_CODE
Independent IHE Reporting OptionsMay join toAny dataset with an IPEDS Code
IPEDS_CODE=IPEDS_CODE

Post-Secondary Student Base

Highlights

Record Level: Student-level 
Record Count: ~2.6 million observations 
Years Covered: 2010-Present* 
Population Coverage: Students enrolled in or receiving an award from a Michigan public institution

The base dataset contains information on student demographics: race, gender, and date of birth. The base population includes students who have enrolled in or received an award from a Michigan public institution (and some independent institutions) during the 2010 academic year or later. The data will include the entire academic history of any such student, which may include enrollments or awards prior to the 2010 academic year.

Students are identified with a research identification code (RIC), which is unique in nearly all cases in the dataset. However, due to linking and unlinking over time as data was moved from the source datasets to the final datasets, some students may have duplicated or conflicting data. In these cases, both records are retained within the final dataset, allowing researchers to decide how to eliminate duplicate RICs. In addition, in some cases there are discrepancies between STARR and NSC data.

In addition, the data file includes a stable date of birth (latest reported date of birth) for each student, and indicators for where conflicts in demographic data exist (i.e., if conflicting dates of birth, race, or gender have been reported for any given student over time).

* Note that, because the academic histories of students might extend before the 2010 academic year, the years covered for other postsecondary files may differ. For example, there are student enrollments dating back as early to 1900 in the enrollment datafile.

Post-Secondary Student Enrollment

Highlights

Record Level: Student by school year session by institution 
Record Count: 1,792,406 observations/year beginning in 2010 
Years Covered: 2010-Present* 
Population Coverage: Students with an enrollment record at a Michigan public institution

The enrollment dataset includes one record per student, for each institution attended and school year session (for example, fall 2010-2011). For each record, session beginning and end dates are provided. Additionally, the dataset includes variables that reflect academic progress, such as session GPA and credits attempted/earned, in addition to variables reflecting academic progress. The dataset also includes flags for full-time enrollment and whether any courses were taken for remedial purposes. As with the base file, students are identified using a RIC, which is unique within a student-year-IHE-session for nearly all cases.

In this dataset, Spring and Winter sessions are treated the same by CEPI. Researchers should note that in the 2016-17 collection, CEPI began renaming all Winter sessions to Spring to reflect that. Therefore, you will see no Winter sessions from the 2016-17 enrollments on. The figure below illustrates how different sessions were categorized. For example, sessions categorized as “Late Summer” have a start date between 4/15 and 7/31 and an end date between 7/16 and 8/31. Some sessions span more than one session range; a session falling entirely within an overlap range is put into the later second session of the range (i.e., the end date determines the session for these cases). Any sessions with a start date between 4/15-EndYear and 8/1-EndYear and ending on or after 7/16-EndYear are moved to the subsequent school year.

Image
Timeline of session coverage in data

In addition, researchers should note the following definitions of student transfers, which apply to those with ENROLLMENT_TYPE = “TransferIn”. CEPI has two principal definitions of a transfer:

  1. If an institution tells us that a student is a transfer, that student is a transfer, full stop. This stems from an Enrollment Type of “TransferIn”. CEPI tries to find the student’s previous college or university, but if they cannot find one then their previous enrollment is labeled as ‘Unknown’.
  2. CEPI looks at all enrollments within the academic year of the report and finds the most recent enrollments for the students before that point. They look for the following to be true. If these are true, then the enrollment pattern is considered a transfer by CEPI’s definition: - The previous enrollment is at a different institution - The previous enrollment is at the same Enrollment Level (Undergraduate vs Graduate) - The student has stopped enrollment at that previous institution - The student has not previously been enrolled at their current institution (that is, readmissions are not eligible to be transfers)

CEPI then takes all transfers discovered by methods 1 and/or 2, and compiles them into the data file.

View the enrollment data: record counts by institution for 2010-2019

* Years covered includes any year for which there is data available in this file. However, researchers should note that these data encompass the academic histories of any student receiving an award beginning with the 2010 school year. As such, the base file “years covered” encompasses 2010, but all other files may include earlier records as well.

Post-Secondary Student Programs

Highlights

Record Level: Student by program by school year session by institution 
Record Count: 1,994,786 observations/year beginning in 2010 
Years Covered: 2010-Present* 
Population Coverage: Students enrolled in or receiving an award from a Michigan public institution, as well as some out of state students from NSC.

The programs dataset includes one record per student, for each program the student is working toward at all institutions and school year session (for example, fall 2010-2011). For each record, session beginning and end dates are provided. Additionally, the dataset includes variables that reflect program type and description (such as “Major in Computer Science”). Program CIP codes and descriptions are also provided but due to reporting issues, many records reflect a null value.

CEPI conducts some deduplication of program records, as follows:

  • Drop records with a null program name if there is a corresponding record where all other key fields match and the program name field contains a value.
  • Drop records with a null program CIP code if there is a corresponding record where all other key fields match and the program CIP code field contains a value.
  • If both a STARR and an NSC program record exist where all key fields are identical, keep only the STARR record.

* Years covered includes any year for which there is data available in this file. However, researchers should note that these data encompass the academic histories of any student enrolled in or receiving an award beginning with the 2010 school year. As such, the base file “years covered” encompasses 2010, but all other files may include earlier records as well.

Codebook

Showing 201 - 300 of 1138 results
Category Sort descending Variable Description
Category: Assessment Score Variable: SCI_RSCH_Z_SCORE Description: 

A standardized score assigned to each student that describes each students relationship to the mean of all students with a score.

Category: Assessment Score Variable: SOC_THETA_SE Description: 

The Theta standard error for the social studies portion of the assessment.

Category: Assessment Score Variable: SCI_OVERALL_AGP Description: 

The adequate growth percentile (AGP) for the science portion of the assessment.

Category: Assessment Score Variable: MTH_PL_NAME Description: 

The performance level name for the mathematics portion of the assessment.

Category: Assessment Score Variable: SCI_SGP_LEVEL_DESC Description: 

The student growth percentile (SGP) description for the science portion of the assessment.

Category: Assessment Score Variable: RDG_PL_CHANGE Description: 

Performance level change is used to compare student performance from year to year for the reading portion of the assessment . Transition categories are: Significant Decline, Decline, Maintaining, Improvement, or Significant Improvement. Abbreviations are not used in this field, rather the actual descriptions are used.

Category: Assessment Score Variable: WRI_SS Description: 

The scaled score for the writing portion of the assessment.

Category: Assessment Score Variable: WRI_SS_SE Description: 

The scaled score standard error for the writing portion of the assessment.

Category: Assessment Score Variable: SOC_SS Description: 

The scaled score for the social studies portion of the assessment.

Category: Assessment Score Variable: MTH_PL_RANGE Description: 

A distinction which further differentiates whether the student was Low, Mid or High within their designated performance level for the mathematics portion of the assessment.

Category: Assessment Score Variable: MTH_SGP_LEVEL_DESC Description: 

The student growth percentile (SGP) description for the mathematics portion of the assessment.

Category: Assessment Score Variable: SCI_PL_NAME Description: 

The performance level name for the science portion of the assessment.

Category: Assessment Score Variable: ELA_SGP Description: 

The student growth percentile (SGP) for the English language arts portion of the assessment.

Category: Assessment Score Variable: RDG_PL_NAME Description: 

The performance level name for the reading portion of the assessment.

Category: Assessment Score Variable: MTH_THETA_SE Description: 

The Theta standard error for the mathematics portion of the assessment.

Category: Assessment Score Variable: ELA_PL_NAME Description: 

The performance level name for the English language arts portion of the assessment.

Category: Assessment Score Variable: SOC_SS_SE Description: 

The scaled score standard error for the social studies portion of the assessment.

Category: Assessment Score Variable: MTH_RSCH_Z_SCORE Description: 

A standardized score assigned to each student that describes each students relationship to the mean of all students with a score.

Category: Assessment Score Variable: MTH_OVERALL_AGP Description: 

The adequate growth percentile (AGP) for the mathematics portion of the assessment.

Category: Assessment Score Variable: MTH_PL_CHANGE Description: 

Performance level change is used to compare student performance from year to year for the mathematics portion of the assessment . Transition categories are: Significant Decline, Decline, Maintaining, Improvement, or Significant Improvement. Abbreviations are not used in this field, rather the actual descriptions are used.

Category: Assessment Score Variable: SCI_PL Description: 

The performance level for the science portion of the assessment.

Category: Assessment Score Variable: ELA_SS_SE Description: 

The scaled score standard error for the English language arts portion of the assessment.

Category: Assessment Score Variable: SCI_SGP Description: 

The student growth percentile (SGP) for the science portion of the assessment.

Category: Assessment Score Variable: MTH_SS Description: 

The scaled score for the mathematics portion of the assessment.

Category: Assessment Score Variable: RDG_VALID Description: 

Flag denoting whether the student's test was valid for the reading portion of the assessment.

Category: Assessment Score Variable: RDG_THETA_SE Description: 

The Theta standard error for the reading portion of the assessment.

Category: Assessment Score Variable: ELA_SGP_LEVEL_DESC Description: 

The student growth percentile (SGP) description for the English language arts portion of the assessment.

Category: Assessment Score Variable: SCI_AGP_YRS_PROJ Description: 

The timeframe in which students are expected to reach proficiency.

Category: Assessment Score Variable: ELA_AGP_YRS_PROJ Description: 

The timeframe in which students are expected to reach proficiency.

Category: Assessment Score Variable: SOC_SGP Description: 

The student growth percentile (SGP) for the social studies portion of the assessment.

Category: Assessment Score Variable: SOC_SGP_LEVEL_DESC Description: 

The student growth percentile (SGP) description for the social studies portion of the assessment.

Category: Assessment Score Variable: SOC_PL Description: 

The performance level for the social studies portion of the assessment.

Category: Assessment Score Variable: SCI_SS Description: 

The scaled score for the science portion of the assessment.

Category: Assessment Score Variable: SCI_VALID Description: 

Flag denoting whether the student's test was valid for the science portion of the assessment.

Category: Assessment Score Variable: RDG_PL Description: 

The performance level for the reading portion of the assessment.

Category: Assessment Subscore Variable: SUBSCORE_RPT_LEVEL_DESC Description: 

The description of the reporting level.

Category: Assessment Subscore Variable: SUBSCORE_PCT_CORRECT Description: 

The number of questions answered correctly divided by the total questions for this reporting level.

Category: Assessment Subscore Variable: SUBSCORE_RPT_LEVEL_TYPE Description: 

The reporting level type. Used as a means of organizing content standards and expectations. Varies by subject.

Category: Assessment Subscore Variable: SUBSCORE_SCALED_SCORE Description: 

The scaled score value for this reporting level.

Category: Assessment Subscore Variable: SUBSCORE_RPT_LEVEL_ID Description: 

The reporting level identifier value used by MDE.

Category: Assignments Variable: ASSIGNED_GRADE_06 Description: 

Flag designating that the staff member was assigned to teach sixth grade.

Category: Assignments Variable: IS_HIGHLY_QUALIFIED Description: 

This field reports whether or not a staff meets the requirements for being highly qualified. It is a required field only for teachers assigned to a core academic subject area and specific instructional paraprofessionals/aides as defined by NCLB and the MI Department of Education.

Category: Assignments Variable: ASSIGNED_GRADE_05 Description: 

Flag designating that the staff member was assigned to teach fifth grade.

Category: Assignments Variable: ASSIGNED_GRADE_09 Description: 

Flag designating that the staff member was assigned to teach ninth grade.

Category: Assignments Variable: ASSIGNMENT_CODE Description: 

The code that indicates the position held by the employee.

Category: Assignments Variable: ASSIGNED_GRADE_10 Description: 

Flag designating that the staff member was assigned to teach tenth grade.

Category: Assignments Variable: ASSIGNED_ISD_CODE Description: 

The state-assigned five-digit number, as recorded in the Educational Entity Master (EEM), that identifies the intermediate school district (ISD) or educational service agency (ESA) in which the district or program is located. This is the ISD that the educator is assigned to.

Category: Assignments Variable: ASSIGNED_GRADE_K Description: 

Flag designating that the staff member was assigned to teach Kindergarten.

Category: Assignments Variable: ASSIGNED_GRADE_11 Description: 

Flag designating that the staff member was assigned to teach eleventh grade.

Category: Assignments Variable: ASSIGNED_GRADE_02 Description: 

Flag designating that the staff member was assigned to teach second grade.

Category: Assignments Variable: EFFECTIVENESS_DESC Description: 

Description used to categorize Educator Effectiveness ranking. This is a locally determined label assigned as a result of annual educator evaluations.

Category: Assignments Variable: FTE Description: 

The amount of time required to perform an assignment stated as a proportion of a full-time position and computed by dividing the amount of time employed by the time normally required for a full-time position.

Category: Assignments Variable: FID_FUNCTION_CODE_DESC Description: 

The accounting/function description for the employee as determined for accounting purposes by the school district.

Category: Assignments Variable: SUMMARY_GROUP_NAME Description: 

The name for a grouping of assignment codes. The data from this grouping are similar to the data published on CEPI's website.

Category: Assignments Variable: ASSIGNED_GRADE_01 Description: 

Flag designating that the staff member was assigned to teach first grade.

Category: Assignments Variable: FEDERAL_GRANT_PARTICIPATION Description: 

This field identifies teachers funded by Title I or Title II Part A who teach core academic subjects in a targeted-assistance program.

Category: Assignments Variable: FID_FUNCTION_CODE Description: 

The accounting/function code for the employee as determined for accounting purposes by the school district.

Category: Assignments Variable: ASSIGNED_GRADE_RK Description: 

Flag designating that the staff member was assigned to teach Retentive Kindergarten.

Category: Assignments Variable: ASSIGNED_GRADE_04 Description: 

Flag designating that the staff member was assigned to teach fourth grade.

Category: Assignments Variable: ASSIGNED_GRADE_08 Description: 

Flag designating that the staff member was assigned to teach eighth grade.

Category: Assignments Variable: EDUCATIONAL_SETTING_DESC Description: 

The description for the educational setting (e.g., Bilingual Education/ELL or Special Education for various age groups, etc.)

Category: Assignments Variable: IS_MDE_TEACHER Description: 

Flag identifying teachers according to the MDE definition.

Category: Assignments Variable: ASSIGNMENT_DESCRIPTION Description: 

The description of the position held by the employee.

Category: Assignments Variable: ASSIGNED_GRADE_03 Description: 

Flag designating that the staff member was assigned to teach third grade.

Category: Assignments Variable: ASSIGNED_GRADE_07 Description: 

Flag designating that the staff member was assigned to teach seventh grade.

Category: Assignments Variable: ASSIGNED_GRADE_12 Description: 

Flag designating that the staff member was assigned to teach twelfth grade.

Category: Assignments Variable: IS_PRINCIPAL Description: 

Flag identifying principals according to the REP manual.

Category: Awards Variable: SCHOOL_START_YEAR_AWARDED Description: 

The numeric start year of the school year during which the award was conferred. Example: 2016

Category: Awards Variable: AWARD_CIP_CATEGORY Description: 

The highest-level category description of the discipline or field of study of the student's program, as defined in the 2-digit Classification of Instructional Programs (CIP) by the US Department of Education's National Center for Education Statistics (NCES).

Category: Awards Variable: IS_STARR_AWARD Description: 

Designates if the award record came from the STARR collection.

Category: Awards Variable: AWARD_CIP_CODE Description: 

The code indicating the discipline or field of study for which the award was conferred. As submitted by the IHE, sourced from the Classification of Instructional Programs (CIP), by the US Department of Education's National Center for Education Statistics (NCES).

Category: Awards Variable: AWARD_LEVEL_CATEGORY Description: 

The reporting category of the level of achievement the student has received upon graduation/completion.

Category: Awards Variable: AWARD_DESCRIPTION Description: 

The descriptive title for the academic award, as submitted by the IHE.

Category: Awards Variable: AWARD_CIP_DESCRIPTION Description: 

The description of the discipline or field of study of the student's program, as defined in the Classification of Instructional Programs (CIP) by the US Department of Education's National Center for Education Statistics (NCES).

Category: Awards Variable: AWARD_DATE Description: 

The year, month and day in which the academic award was conferred.

Category: Awards Variable: SCHOOL_YEAR_AWARDED Description: 

The academic school year during which the award was conferred. Example: 2016-2017

Category: Awards Variable: AWARD_LEVEL Description: 

The level of achievement the student has received upon graduation/completion.

Category: Census Block Variable: CENSUS_BLOCK Description: 

The state code, county code, census tract and census block numbers merged into one field as described in the Notes section on the previous tab.

Category: Census Block Variable: COLLECTION Description: 

General collection (Fall, Spring, Year) or SRM identifier.

Category: Course Taking Variable: COURSE_SECTION_NUMBER Description: 

The number assigned to differentiate among distinct occurrences ofcourses that have the same Course Subject Abbreviation and Course Number butare considered to be different courses.

Category: Course Taking Variable: IS_STARR_RECORD Description: 

Designates if the session record came from the STARR collection.

Category: Course Taking Variable: COURSE_TYPE_CODE Description: 

The two-digit code for the course type that represents the level and rigor of the instruction provided throughout the reported course.

Category: Course Taking Variable: LOCAL_COURSE_TITLE Description: 

The title assigned by the educating entity to identify a particular course.

Category: Course Taking Variable: RPIC_3 Description: 

The Research Personnel Identification Code (RPIC), as assigned in the Registry of Educational Personnel (REP) Application, for each teacher responsible for some or all of the instruction of the course being reported. Third of up to 3 possible.

Category: Course Taking Variable: ACADEMIC_SCHOOL_YEAR Description: 

The school year in which the data was reported by the district.

Category: Course Taking Variable: COURSE_FUNDING_CODE Description: 

The code for any programs that provide funding to the student course.

Category: Course Taking Variable: COMPLETION_STATUS_CODE Description: 

The standardized code representing a student's final status for the course being reported.

Category: Course Taking Variable: IS_VIRTUAL_DELIVERY Description: 

Flag that indicates whether the student is receiving instruction via a virtual delivery method. This could be virtual learning, online learning or computer courses; distance learning; or self-scheduled virtual learning.

Category: Course Taking Variable: COLLEGE_CREDIT Description: 

The number of college credit hours earned in a dual/concurrent enrollment course that is eligible for Section 64b Incentive funding.

Category: Course Taking Variable: NUMERIC_GRADE Description: 

The final numeric grade awarded for participation in the course.

Category: Course Taking Variable: IS_REMEDIAL Description: 

Y/N indicator of whether the course is a remedial course.

Category: Course Taking Variable: COURSE_TITLE Description: 

The name or title of the course, as defined by the IHE.

Category: Course Taking Variable: MOD_CREDITS_EARNED Description: 

Total of STARR Credits Earned for the course, or NSC FTE Days converted to Credits Earned for the session.

Category: Course Taking Variable: IS_REMEDIAL_SCIENCE Description: 

Y/N indicator of whether the course is a remedial science course.

Category: Course Taking Variable: IS_REMEDIAL_READING Description: 

Y/N indicator of whether the course is a remedial reading course.

Category: Course Taking Variable: LETTER_GRADE Description: 

The alphabetical grade earned in the course.

Category: Course Taking Variable: COURSE_SUBJECT Description: 

The alphabetic abbreviation of the academic department or discipline offering the course, as defined by the IHE.

Category: Course Taking Variable: IS_REMEDIAL_ESL Description: 

Y/N indicator of whether the course is a remedial English as a Second Language (ESL) course.

Category: Course Taking Variable: GRADE_SCALE_CODE Description: 

The grading scale used by the institution for the enrolled course, according to Postsecondary Electronic Standards Council (PESC) standards.

Category: Course Taking Variable: RPIC_2 Description: 

The Research Personnel Identification Code (RPIC), as assigned in the Registry of Educational Personnel (REP) Application, for each teacher responsible for some or all of the instruction of the course being reported. Second of up to 3 possible.