[UCDP] Non-State Conflict Dataset (v. 22.1)

“This document describes the Non-State Conflict Dataset, a project within the Uppsala Conflict Data Program (UCDP) at the Department of Peace and Conflict Research, Uppsala University. The UCDP Non-State conflict project has been developed with support from the Human Security Report Project, Simon Fraser University, in Vancouver, Canada.” (codebook, p. 3)

I/ Conflict variable
I/1/ Unit of conflict

Event (< 0 or 1 death)Conflict (< 25 deaths)War (< 100 deaths)Episode (< 500 deaths)
X
“A non-state conflict is defined by the Uppsala Conflict Data Program (UCDP) as ‘the use of armed force between two organized armed groups, neither of which is the government of a state, which results in at least 25 battle-related deaths in a year.'” (codebook, p. 4)

I/2/ Conflict domain

State-based conflictNon-state conflictOne-sided violenceTO BE DETERMINED
X
“A non-state conflict is defined by the Uppsala Conflict Data Program (UCDP) as ‘the use of armed force between two organized armed groups, neither of which is the government of a state, which results in at least 25 battle-related deaths in a year.'” (codebook, p. 4)

II/ Time variable
II/1/ Unit of time

DayYear
X
“start_date
The first time there is a recorded event in a given dyad that results in at least one fatality. This date is the same for all years in which the conflict has been active, regardless of whether the conflict has been active in several episodes or not.
The start_date is coded as precisely as possible. For certain conflicts we can pinpoint the start of the conflict down to a single event, taking place on a specific day. For other conflicts, this is not possible, due to lack of precise information.
Date (YYYY-MM-DD)” (codebook, p. 9)

II/2/ Time domain

Time domain
1989 – 2021
“year
The year of observation (1989-2021)” (codebook, p. 10)

III/ Space variable
III/1/ Unit of space

CoordinatesCountryRegionTO BE DETERMINED
X
“location
The countries where fighting took place in the conflict-year.
Comma-separated if multiple.
This variable should never be used for any geographical or spatial analyses of conflict as the distribution of violence as well as the relative magnitude of violence by country is not captured. In effect, a country is listed here if even one dead in the given conflict has occurred in that country.
In fact, UCDP provides much better geographic coverage of conflict (including distribution of violence for each conflict and each country) in the UCDP Georeferenced Event Dataset (GED).” (codebook, p. 11)

III/2/ Space domain

GlobalMediterranean Sea and SahelTO BE DETERMINED
X
“Like the UCDP Non-State Conflict Dataset, GED is global and covers the same period (1989-2021).” (codebook, p. 11)

IV/ Data structure
IV/1/ Unit of observation

Unit of conflict (UC)UC-yearUC-actorCountry-year
Actor-yearDyad-yearOTHERTO BE DETERMINED
X
“an automatic filtering and aggregation of the UCDP Georeferenced Event Dataset from incident/event level to the conflict/dyad-year level.” (codebook, p. 13)

(ucdp-nonstate-221.xlsx, 21/08/2022)

IV/2/ Number of observations

Number of observation
1 442

(ucdp-nonstate-221.xlsx, 18/05/2022)

V/ All variables

Conflict nameConflict typeIntensity
X
OutcomeTimeSpace
XXX
ActorType of actorDyad
XXX
CoalitionDeathsNon-conflit variables
XX
conflict_id
The unique identifier of the non-state conflict.” (codebook, p. 7)
“ep_end
ep_end is a binary variable that codes whether the conflict is inactive the following year and an episode of the conflict thus ends. If the conflict is inactive the following year(s), this variable is coded as 1. If not, a 0 is coded. For the latest year in the dataset, it is unknown whether the conflict will be recorded as active or inactive in the following year, and the variable is always given the code 0.” (codebook, p. 10)
“start_date
The first time there is a recorded event in a given dyad that results in at least one fatality.” (codebook, p. 9)
location
The countries where fighting took place in the conflict-year.” (codebook, p. 11)
side_a_name
The party that constitute Side A in the conflict. For each conflict the parties are listed in alphabetical order, using the latest known names of the parties involved.” (codebook, p. 8)
org
This variable indicates the organizational level of the warring sides. The level of organization is determined according to the following categories:
Organizational level 1 (formally organized groups): […]
Organizational level 2 (informally organized groups): […]
Organizational level 3 (informally organized groups):” (codebook, p. 7)
dyad_id
The unique identifier of the non-state dyad (a pair of two opposing actors).”
“side_a_components
For conflicts with multiple actors fighting together as a joint (temporary) coalition, the components of the coalition (in the form of a string of actor IDs) are listed here. Comma separated. […]
side_a_2nd
side_a_2nd lists all states that enter a non-state conflict with troops to actively support side A in the dyad. See section 2.2 for information on under which conditions this is applicable. This variable is not part of the API version of the dataset.” (p. 8)
“best_fatality_estimate
The best fatality estimate for the given conflict-year.” (codebook, p. 11)

VI/ Transparency
VI/1/ Sources

Intergovernmental organizationsGovernmental organizationsNon-governmental organizations
XX
Research organizationsPress mediaSocial media
X
Other databasesOTHERTO BE DETERMINED
XX
“This dataset is the result of:
1. an automatic filtering and aggregation of the UCDP Georeferenced Event Dataset from incident/event level to the conflict/dyad-year level.
2. information gathering and coding of a number of extra variables at the aggregate conflict or actor level (such as organization type).
The original reporting underlying the dataset is collected from three sets of sources:
1. global newswire reporting
2. global monitoring and translation of local news performed by the BBC
3. secondary sources such as local media, NGO and IGO reports, field reports, books etc.” (codebook, p. 13)

VI/2/ Codebook

DateVersion
202222.1
“When appropriate, also cite this codebook: Pettersson, Therese (2022) UCDP Non-state Conflict Codebook v 22.1 (https://ucdp.uu.se/downloads/).” (codebook, p. 1)

VII/ Update
VII/1/ Current version

DateVersion
202222.1
“This codebook corresponds to Version 22.1 of the UCDP Non-state Conflict Dataset. […]
The version number is a combination of a year and a number. The year refers to when the dataset is updated with new observations. If there are changes in the data between yearly updates, or if there are substantial changes in the structure of the dataset, the number behind the year is incremented.” (codebook, p. 13)

VII/2/ Regularly updated ?

Real-timeMonthQuarterAnnually
X
YES, PERIOD TO BE DETERMINEDTO BE DETERMINED
“The version number is a combination of a year and a number. The year refers to when the dataset is updated with new observations. If there are changes in the data between yearly updates, or if there are substantial changes in the structure of the dataset, the number behind the year is incremented.” (codebook, p. 13)

VIII/ Access
VIII/1/ Registration ?

YESNO
X

VIII/2/ Formats

.XLS/.XLSX.CSV.DTA (STATA).RDTA
XXXX
“The data is available in CSV (respecting the RFC 4180 specification), Excel (XLSX), Rdata (3.x version) and STATA (2010 format).” (codebook, p. 14)

VIII/3/ API ?

YESNOTO BE DETERMINED
X
“The data is available for machine-to-machine interaction through a public API.
Documentation for how to use the API is available at http://ucdp.uu.se/apidocs.” (codebook, p. 14)