Diverse Data Hub
  • Data Sets
  • Citation
  • Collaborate

    On this page

    • Income inequality: Gini coefficient - Data package
      • CSV Structure
      • Metadata.json structure
      • About the data
        • How we process data at Our World In Data
      • Detailed information about each time series
      • Gini Coefficient – World Bank
        • How to cite this data
        • What you should know about this data
        • Source

    Income inequality: Gini coefficient - Data package

    This data package contains the data that powers the chart “Income inequality: Gini coefficient” on the Our World in Data website.

    CSV Structure

    The high level structure of the CSV file is that each row is an observation for an entity (usually a country or region) and a timepoint (usually a year).

    The first two columns in the CSV file are “Entity” and “Code”. “Entity” is the name of the entity (e.g. “United States”). “Code” is the OWID internal entity code that we use if the entity is a country or region. For normal countries, this is the same as the iso alpha-3 code of the entity (e.g. “USA”) - for non-standard countries like historical countries these are custom codes.

    The third column is either “Year” or “Day”. If the data is annual, this is “Year” and contains only the year as an integer. If the column is “Day”, the column contains a date string in the form “YYYY-MM-DD”.

    The remaining columns are the data columns, each of which is a time series. If the CSV data is downloaded using the “full data” option, then each column corresponds to one time series below. If the CSV data is downloaded using the “only selected data visible in the chart” option then the data columns are transformed depending on the chart type and thus the association with the time series might not be as straightforward.

    Metadata.json structure

    The .metadata.json file contains metadata about the data package. The “charts” key contains information to recreate the chart, like the title, subtitle etc.. The “columns” key contains information about each of the columns in the csv, like the unit, timespan covered, citation for the data etc..

    About the data

    Our World in Data is almost never the original producer of the data - almost all of the data we use has been compiled by others. If you want to re-use data, it is your responsibility to ensure that you adhere to the sources’ license and to credit them correctly. Please note that a single time series may have more than one source - e.g. when we stich together data from different time periods by different producers or when we calculate per capita metrics using population data from a second source.

    How we process data at Our World In Data

    All data and visualizations on Our World in Data rely on data sourced from one or several original data providers. Preparing this original data involves several processing steps. Depending on the data, this can include standardizing country names and world region definitions, converting units, calculating derived indicators such as per capita measures, as well as adding or adapting metadata such as the name or the description given to an indicator. Read about our data pipeline

    Detailed information about each time series

    Gini Coefficient – World Bank

    The Gini coefficient measures inequality on a scale from 0 to 1. Higher values indicate higher inequality. Last updated: October 7, 2024
    Next update: May 2025
    Date range: 1963–2023

    How to cite this data

    In-line citation

    If you have limited space (e.g. in data visualizations), you can use this abbreviated in-line citation:
    World Bank Poverty and Inequality Platform (2024) – with major processing by Our World in Data

    Full citation

    World Bank Poverty and Inequality Platform (2024) – with major processing by Our World in Data. “Gini Coefficient – World Bank” [dataset]. World Bank Poverty and Inequality Platform, “World Bank Poverty and Inequality Platform (PIP) 20240627_2017, 20240627_2011” [original data]. Source: World Bank Poverty and Inequality Platform (2024) – with major processing by Our World In Data

    What you should know about this data

    • Depending on the country and year, the data relates to income measured after taxes and benefits, or to consumption, per capita. ‘Per capita’ means that the income of each household is attributed equally to each member of the household (including children).
    • Non-market sources of income, including food grown by subsistence farmers for their own consumption, are taken into account.

    Source

    World Bank Poverty and Inequality Platform – World Bank Poverty and Inequality Platform (PIP)

    Retrieved on: 2024-10-07
    Retrieved from: https://pip.worldbank.org

    Notes on our processing step for this indicator

    For most countries in the PIP dataset, estimates relate to either disposable income or consumption, for all available years. A number of countries, however, have a mix of income and consumption data points, with both data types sometimes available for particular years.

    In most of our charts, we present the data with some data points dropped in order to present single series for each country. This allows us to make readable visualizations that combine multiple countries and metrics. In choosing which data points to drop, we try to strike a balance between maintaining comparability over time and showing as long a time series as possible. As such, the exact approach varies somewhat across countries.

    If you would like to see the original data with all available income and consumption data points shown separately, you can do so in our Poverty Data Explorer. You can also download this data in our complete dataset of the World Bank PIP data.

     
     

    This page is built with Quarto.