Type: | Package |
Title: | Access Colombian Data via APIs and Curated Datasets |
Version: | 0.3.0 |
Maintainer: | Renzo Caceres Rossi <arenzocaceresrossi@gmail.com> |
Description: | Provides a comprehensive interface to access diverse public data about Colombia through multiple APIs and curated datasets. The package integrates four different APIs: 'API-Colombia' for Colombian-specific data including geography, culture, tourism, and government information; 'World Bank API' for economic and demographic indicators; 'Nager.Date' for public holidays; and 'REST Countries API' for general country information. The package enables users to explore various aspects of Colombia such as geographic locations, cultural attractions, economic indicators, demographic data, and public holidays. Additionally, 'ColombiAPI' includes curated datasets covering Bogota air stations, business and holiday dates, public schools, Colombian coffee exports, cannabis licenses, Medellin rainfall, malls in Bogota, as well as datasets on indigenous languages, student admissions and school statistics, forest liana mortality, municipal and regional data, connectivity and digital infrastructure, program graduates, vehicle counts, international visitors, and GDP projections. These datasets provide users with a rich and multifaceted view of Colombian social, economic, environmental, and technological information, making 'ColombiAPI' a comprehensive tool for exploring Colombia's diverse data landscape. For more information on the APIs, see: 'API-Colombia' https://api-colombia.com/, 'Nager.Date' https://date.nager.at/Api, 'World Bank API' https://datahelpdesk.worldbank.org/knowledgebase/articles/889392, and 'REST Countries API' https://restcountries.com/. |
License: | GPL-3 |
Depends: | R (≥ 4.1.0) |
URL: | https://github.com/lightbluetitan/colombiapi, https://lightbluetitan.github.io/colombiapi/ |
BugReports: | https://github.com/lightbluetitan/colombiapi/issues |
Encoding: | UTF-8 |
Suggests: | ggplot2, testthat (≥ 3.0.0), knitr, rmarkdown |
Imports: | httr, jsonlite, dplyr, scales, tibble |
RoxygenNote: | 7.3.2 |
Config/testthat/edition: | 3 |
VignetteBuilder: | knitr |
NeedsCompilation: | no |
Packaged: | 2025-09-29 19:32:57 UTC; Renzo |
Author: | Renzo Caceres Rossi
|
Repository: | CRAN |
Date/Publication: | 2025-09-29 22:00:16 UTC |
ColombiAPI: Access Colombian Data via APIs and Curated Datasets
Description
This package provides a comprehensive interface to access diverse public data about Colombia through multiple APIs and curated datasets. The package integrates four different APIs: API-Colombia for Colombian-specific data including geography, culture, tourism, and government information; World Bank API for economic and demographic indicators; Nager.Date for public holidays; and REST Countries API for general country information. Additionally, ColombiAPI includes curated datasets covering Bogota air stations, business and holiday dates, public schools, Colombian coffee exports, cannabis licenses, Medellin rainfall, malls in Bogota, as well as datasets on indigenous languages, student admissions and school statistics, forest liana mortality and more.
Details
ColombiAPI: Access Colombian Data via APIs and Curated Datasets
Access Colombian Data via APIs and Curated Datasets.
Author(s)
Maintainer: Renzo Caceres Rossi arenzocaceresrossi@gmail.com
See Also
Useful links:
Bogota Air Stations Coordinates
Description
This dataset, Bogota_airstations_df, is a data frame containing the coordinates of air quality measurement stations in Bogota, Colombia.
Usage
data(Bogota_airstations_df)
Format
A data frame with 10 observations and 3 variables:
- ESTACION
Character string representing the name of the station.
- X
Numeric value representing the X coordinate of the station.
- Y
Numeric value representing the Y coordinate of the station.
Details
The dataset name has been kept as Bogota_airstations_df to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix '_df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Source
Originally taken from the SpatFD package version 0.0.1
Bogota Business Dates
Description
This dataset, Bogota_business_Date, is a Date object containing the business dates for Bogota, Colombia.
Usage
data(Bogota_business_Date)
Format
A Date object with 27,173 observations:
- Bogota_business_Date
Date object representing the business dates in Bogota, Colombia, ranging between "1990-01-02" and "2100-12-31".
Details
The dataset name has been kept as Bogota_business_Date to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix '_Date' indicates that the dataset is an object of the Date class. The original content has not been modified in any way.
Source
Originally taken from the quantdates package version 2.0.4.
Bogota Holidays Dates
Description
This dataset, Bogota_holidays_Date, is a Date object containing the official holidays of Bogota, Colombia.
Usage
data(Bogota_holidays_Date)
Format
A Date object with 1787 observations:
- Bogota_holidays_Date
Date object representing the official holidays in Bogota, Colombia, ranging between "1990-01-01" and "2100-12-08".
Details
The dataset name has been kept as Bogota_holidays_Date to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix '_Date' indicates that the dataset is an object of the Date class. The original content has not been modified in any way.
Source
Originally taken from the quantdates package version 2.0.4.
Holidays were created using the bizdays
package.
Bogota Malls Information
Description
This dataset, Bogota_malls_tbl_df, is a tibble containing detailed information about shopping malls located in Bogota, Colombia. It includes the mall names, physical addresses, web URLs, geographical coordinates, and the locality to which each mall belongs.
Usage
data(Bogota_malls_tbl_df)
Format
A tibble with 42 observations and 6 variables:
- NAME
Character string representing the name of the shopping mall.
- ADDRESS
Character string representing the physical address of the mall.
- URL
Character string representing the web URL of the mall.
- LATITUD
Numeric value representing the latitude of the mall.
- LONGITUD
Numeric value representing the longitude of the mall.
- LOCALIDAD
Character string representing the locality the mall belongs to.
Details
The dataset name has been kept as Bogota_malls_tbl_df to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Originally taken from Kaggle, url: https://www.kaggle.com/datasets/erik172/bogota-shopping-malls.
WiFi Connectivity Zones in Bucaramanga (Spanish speaking)
Description
This dataset, Bucaramanga_wifi_tbl_df, is a tibble containing information about community WiFi connection zones in the municipality of Bucaramanga. It includes details such as the zone name, address, neighborhood, commune, urban or rural classification, department, municipality, and geographic coordinates. This dataset was taken from GOV.CO Datos Abiertos, last update on September 20, 2025.
Usage
data(Bucaramanga_wifi_tbl_df)
Format
A tibble with 82 observations and 9 variables:
- NOMBRE ZONA WIFI
Name of the WiFi zone (character)
- DIRECCION
Address of the WiFi zone (character)
- BARRIO
Neighborhood of the WiFi zone (character)
- COMUNA
Commune of the WiFi zone (character)
- ZONA (URBANA/RURAL)
Urban or rural classification (character)
- DEPARTAMENTO
Department where the zone is located (character)
- MUNICIPIO
Municipality where the zone is located (character)
- LATITUD
Latitude coordinate of the WiFi zone (character)
- LONGITUD
Longitude coordinate of the WiFi zone (character)
Details
The dataset name has been kept as 'Bucaramanga_wifi_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Source
Data obtained from GOV.CO Datos Abiertos: https://www.datos.gov.co/Ciencia-Tecnolog-a-e-Innovaci-n/SERVICIO-DE-CONECTIVIDAD-ZONAS-WIFI/9n9m-2dqq/about_data
Cannabis Licenses Information
Description
This dataset, Cannabis_Licenses_tbl_df, is a tibble containing detailed information about cannabis-related licenses issued by the Ministry of Justice and Law in Colombia. It includes data on the number of received, denied, archived, modified, and granted licenses for cannabis cultivation, as well as the licenses issued for both psychoactive and non-psychoactive cannabis plants.
Usage
data(Cannabis_Licenses_tbl_df)
Format
A tibble with 92 observations and 10 variables:
- Mes
Character string representing the month in which the data was recorded.
- Solicitudes de licencias recibidas
Numeric value representing the number of license requests received.
- Solicitudes Negadas
Numeric value representing the number of denied license requests.
- Solicitudes Archivadas
Numeric value representing the number of archived license requests.
- Solicitudes Modificadas
Numeric value representing the number of modified license requests.
- Licencias Otorgadas
Numeric value representing the number of licenses granted.
- Licencias expedidas de semillas para siembra
Numeric value representing the number of seed cultivation licenses issued.
- Licencias expedidas de plantas cannabis psicoactivo
Numeric value representing the number of psychoactive cannabis plant cultivation licenses issued.
- Licencias expedidas de plantas cannabis no psicoactivo
Numeric value representing the number of non-psychoactive cannabis plant cultivation licenses issued.
- Año
Numeric value representing the year in which the data was recorded.
Details
The dataset name has been kept as Cannabis_Licenses_tbl_df to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Originally taken from Kaggle, url: https://www.kaggle.com/datasets/josetroyatoscano/licencias-cannabis-colombia-full-dataset-feb-2024
Connected Devices in WiFi Zones of Cartagena (Spanish speaking)
Description
This dataset, Cartagena_wifi_tbl_df, is a tibble containing information about the number of devices connected in the public WiFi zones of the District of Cartagena. It includes the year, month, corregimiento or neighborhood, WiFi zone, and counts of connected devices by type (Other, Smartphone, Tablet, PC). The dataset was obtained from GOV.CO Datos Abiertos, last update on September 23, 2025.
Usage
data(Cartagena_wifi_tbl_df)
Format
A tibble with 468 observations and 8 variables:
- AÑO
Year of the record (numeric)
- MES
Month of the record (character)
- CORREGIMIENTO/BARRIO
Name of the corregimiento or neighborhood (character)
- ZONA WIFI
Name of the WiFi zone (character)
- OTRO
Number of connected devices categorized as 'Other' (numeric)
- SMARTPHONE
Number of connected smartphones (numeric)
- TABLET
Number of connected tablets (numeric)
- PC
Number of connected PCs (numeric)
Details
The dataset name has been kept as 'Cartagena_wifi_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Source
Data obtained from GOV.CO Datos Abiertos: https://www.datos.gov.co/Ciencia-Tecnolog-a-e-Innovaci-n/Dispositivos-conectados-en-las-zonas-wifi-del-dist/94zk-4tpe/about_data
Colombian Coffee 2016 Export/Import
Description
This dataset, Colombia_coffee_tbl_df, is a tibble containing detailed information about Colombian coffee exports and imports in 2016. It includes data on various factors such as trade flow, commodity codes, trade value, weight in kilograms, and information about the countries involved in the trade. The dataset is sourced from Kaggle and provides a comprehensive analysis of the foreign trade of Colombian coffee.
Usage
data(Colombia_coffee_tbl_df)
Format
A tibble with 106 observations and 35 variables:
- Classification
Character string representing the classification of the coffee product.
- Year
Numeric value representing the year of the data entry.
- Period
Numeric value representing the period of the trade.
- Period Desc.
Numeric value representing the period description.
- Aggregate Level
Numeric value representing the aggregate level of the trade data.
- Is Leaf Code
Numeric value indicating whether the trade involves a leaf product.
- Trade Flow Code
Numeric value representing the trade flow code.
- Trade Flow
Character string representing the flow of the trade, either export or import.
- Reporter Code
Numeric value representing the code of the reporting country.
- Reporter
Character string representing the reporting country.
- Reporter ISO
Character string representing the ISO code of the reporting country.
- Partner Code
Numeric value representing the code of the partner country.
- Partner
Character string representing the partner country.
- Partner ISO
Character string representing the ISO code of the partner country.
- 2nd Partner Code
Logical value indicating if there is a second partner country involved in the trade.
- 2nd Partner
Logical value indicating if there is a second partner country.
- 2nd Partner ISO
Logical value indicating if there is a second partner country's ISO code.
- Customs Proc. Code
Logical value representing the customs procedure code.
- Customs
Logical value indicating whether customs procedure information is available.
- Mode of Transport Code
Logical value representing the mode of transport code.
- Mode of Transport
Logical value indicating the mode of transport used in the trade.
- Commodity Code
Character string representing the commodity code for coffee.
- Commodity
Character string representing the commodity (coffee).
- Qty Unit Code
Numeric value representing the quantity unit code.
- Qty Unit
Character string representing the unit of quantity for the trade.
- Qty
Logical value representing the quantity of coffee in the trade.
- Alt Qty Unit Code
Logical value representing an alternative quantity unit code.
- Alt Qty Unit
Numeric value representing an alternative quantity unit for the trade.
- Alt Qty
Logical value representing an alternative quantity for the trade.
- Netweight (kg)
Numeric value representing the net weight of the coffee in kilograms.
- Gross weight (kg)
Logical value indicating the gross weight of the coffee in kilograms.
- Trade Value (US)
Numeric value representing the trade value in US dollars.
- CIF Trade Value (US)
Logical value representing the CIF trade value in US dollars.
- FOB Trade Value (US)
Logical value representing the FOB trade value in US dollars.
- Flag
Numeric value representing a flag for the dataset entry.
Details
The dataset name has been kept as Colombia_coffee_tbl_df to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Originally taken from Kaggle, url: https://www.kaggle.com/datasets/carlosbeltranv/colombian-coffee-2016
Medellin Rainfall Information
Description
This dataset, Medellin_rain_tbl_df, is a tibble containing information about rainfall measurements in Medellin, Colombia, including station identifiers, geographical coordinates, date of observation, and rainfall values.
Usage
data(Medellin_rain_tbl_df)
Format
A tibble with 185,705 observations and 8 variables:
- station_id
Numeric identifier for the rain observation station.
- lat
Numeric value representing the latitude of the station.
- lon
Numeric value representing the longitude of the station.
- date
Date of the rainfall observation.
- year
Numeric year of the observation.
- month
Numeric month of the observation.
- day
Numeric day of the observation.
- rainfall
Numeric value representing the amount of rainfall in millimeters.
Details
The dataset name has been kept as Medellin_rain_tbl_df to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Originally taken from the tidychangepoint package version 0.0.1.
Tulua Public Schools Information
Description
This dataset, Tulua_Public_Schools_tbl_df, is a tibble containing detailed information about public schools located in the municipality of Tulua, Valle del Cauca, Colombia. It includes the name, location, contact information, and the geographical coordinates of each institution, as well as specific details about each school and its facilities.
Usage
data(Tulua_Public_Schools_tbl_df)
Format
A tibble with 105 observations and 8 variables:
- COD_DANE_INS
Numeric code representing the educational institution's DANE code.
- INSTITUCION
Character string representing the name of the public school.
- SEDE
Character string representing the name of the specific school location or branch.
- COD_DANE_SEDE
Numeric code representing the DANE code of the specific school branch.
- TELEFONO
Character string representing the phone number of the school.
- CORREOS
Character string representing the email addresses of the school.
- DIRECCIÓN
Character string representing the physical address of the school.
- CORDE_GEO
Character string representing the geographical coordinates of the school.
Details
The dataset name has been kept as Tulua_Public_Schools_tbl_df to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Originally taken from GOV.CO, url: https://www.datos.gov.co/Educaci-n/Listado-de-colegios-p-blicos-del-municipio-de-Tulu/scz9-f8ud/about_data
Admitted Students to the Science Faculty, UNAL (2013-I)
Description
This dataset, admitted_students_df, is a data frame containing the admission scores and socio-demographic information of 445 students admitted to the seven careers of the Facultad de Ciencias at the Universidad Nacional de Colombia, Bogotá, for the first semester of 2013. It includes exam component scores, gender, socio-economic background, and other demographic attributes.
Usage
data(admitted_students_df)
Format
A data frame with 445 observations and 15 variables:
- carr
Career or program admitted to (factor with 7 levels)
- mate
Mathematics exam score (numeric)
- cien
Science exam score (numeric)
- soci
Social sciences exam score (numeric)
- text
Textual reasoning exam score (numeric)
- imag
Abstract/imagery reasoning exam score (numeric)
- exam
Total exam score (numeric)
- gene
Gender of the student (factor with 2 levels)
- estr
Socio-economic stratum (factor with 3 levels)
- orig
Origin or background classification (factor with 3 levels)
- edad
Age group (factor with 4 levels)
- niLE
Mother tongue indicator (factor with 2 levels)
- niMa
Maternal education indicator (factor with 2 levels)
- stra
Admission stratum or classification (factor with 7 levels)
- age
Age of the student in years (numeric)
Details
The dataset name has been kept as 'admitted_students_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame object. The original content has not been modified in any way.
Source
Data taken from the FactoClass package version 1.2.9
Mortality of lianas (vines) in tropical forests
Description
This dataset, 'dead_lianas_df', is a data frame containing information on the mortality of vascular epiphytes (lianas) in tropical forests of the Colombian Andes. The study is part of the project "Diversity and dynamics of vascular epiphytes in Colombian Andes" supported by COLCIENCIAS (contract 2115-2013). The data corresponds to the first large-scale assessment of vascular epiphyte mortality in the neotropics.
Usage
data(dead_lianas_df)
Format
A data frame with 4,247 observations and 52 variables:
- PlotSite
Plot site identifier (character)
- Y.Plot
Y coordinate of the plot (numeric)
- X.Plot
X coordinate of the plot (numeric)
- PhoroNo
Phorophyte number (integer)
- EpiFam
Epiphyte family (character)
- EpiGen
Epiphyte genus (character)
- cf.aff
Taxonomic uncertainty or affiliation (character)
- Species
Epiphyte species name (character)
- Author
Species author (character)
- EpiAzi
Epiphyte azimuth (character)
- BraAzi
Branch azimuth (integer)
- EpiDisTru
Epiphyte distance from trunk (numeric)
- EpiSize
Epiphyte size (numeric)
- EpiAttHei
Epiphyte attachment height (numeric)
- Date0
Initial survey date (Date)
- Date1
Follow-up survey date (Date)
- Location
Location name (character)
- Mortality
Mortality status (factor with 2 levels)
- MorCat
Mortality category (character)
- Elevation
Elevation in meters (integer)
- AP_bio12
Annual precipitation (integer)
- PDM_bio14
Precipitation of driest month (integer)
- PS_bio15
Precipitation seasonality (integer)
- MDT_bio2
Mean diurnal temperature range (integer)
- TS_bio4
Temperature seasonality (integer)
- ATR_bio7
Annual temperature range (integer)
- AET
Actual evapotranspiration (integer)
- BasAre
Basal area (numeric)
- BasAre5_10
Basal area of trees 5–10 cm DBH (numeric)
- BasAre10
Basal area of trees
\ge
10 cm DBH (numeric)- Ind10
Number of individuals
\ge
10 cm DBH (integer)- Ind5
Number of individuals
\ge
5 cm DBH (numeric)- Ind5_10
Number of individuals 5–10 cm DBH (numeric)
- Ind10_15
Number of individuals 10–15 cm DBH (integer)
- Ind15_20
Number of individuals 15–20 cm DBH (integer)
- Ind20_25
Number of individuals 20–25 cm DBH (integer)
- Ind25_30
Number of individuals 25–30 cm DBH (integer)
- Ind30
Number of individuals
\ge
30 cm DBH (integer)- TreeHei
Tree height (numeric)
- MedHei
Median tree height (numeric)
- MaxHei
Maximum tree height (numeric)
- BranchNumb
Number of branches (integer)
- MEAH
Mean epiphyte attachment height (numeric)
- MEAH_sd
Standard deviation of mean epiphyte attachment height (numeric)
- MSP
Mean species per phorophyte (integer)
- MSP_sd
Standard deviation of mean species per phorophyte (integer)
- Htree
Mean tree height (numeric)
- Htree_sd
Standard deviation of mean tree height (numeric)
- BranNum
Mean branch number (numeric)
- BranNUm_sd
Standard deviation of branch number (numeric)
- tree.id
Tree identifier (character)
- dead
Tree mortality status (factor with 2 levels)
Details
The dataset name has been kept as 'dead_lianas_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame object. The original content has not been modified in any way.
Source
Data taken from the biometrics package version 1.0.2
Digital Centers in Sucre (Spanish speaking)
Description
This dataset, digital_centers_tbl_df, is a tibble containing information about Digital Centers located in the department of Sucre. It includes details such as department name, municipality name, PDET status, name of the populated center, educational center name, start date of operation, end date of initial operation, and current status. The dataset was obtained from GOV.CO Datos Abiertos, last update on April 20, 2024.
Usage
data(digital_centers_tbl_df)
Format
A tibble with 475 observations and 8 variables:
- NOMBRE DEPARTAMENTO
Name of the department (character)
- NOMBRE MUNICIPIO
Name of the municipality (character)
- PDET
PDET status (character)
- NOMBRE CENTRO POBLADO
Name of the populated center (character)
- NOMBRE SEDE EDUCATIVA
Name of the educational center (character)
- FECHA INICIO DE OPERACIÓN
Start date of operation (character)
- FIN DE OPERACIÓN INICIAL
End date of initial operation (character)
- ESTADO
Current status of the center (character)
Details
The dataset name has been kept as 'digital_centers_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Source
Data obtained from GOV.CO Datos Abiertos: https://www.datos.gov.co/Ciencia-Tecnolog-a-e-Innovaci-n/Centros-Digitales-Sucre-/pwyn-md8n/about_data
"Advancing in Digital" Program Graduates (Spanish speaking)
Description
This dataset, digital_graduates_tbl_df, is a tibble containing information about graduates of the first cohort in the various courses of the "Avanzando en lo Digital" program, an initiative aimed at developing digital skills. It includes data such as year, age, gender, courses enrolled, zone, ethnicity, and socioeconomic stratum. The dataset was obtained from GOV.CO Datos Abiertos, last update on October 3, 2024.
Usage
data(digital_graduates_tbl_df)
Format
A tibble with 230 observations and 7 variables:
- AÑO
Year of graduation (numeric)
- EDAD
Age of the graduate (numeric)
- GENERO
Gender of the graduate (character)
- CURSOS INSCRITOS
Courses in which the graduate was enrolled (character)
- ZONA
Geographical zone of the graduate (character)
- ETNIA
Ethnicity of the graduate (character)
- ESTRATO
Socioeconomic stratum of the graduate (numeric)
Details
The dataset name has been kept as 'digital_graduates_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Source
Data obtained from GOV.CO Datos Abiertos: https://www.datos.gov.co/Ciencia-Tecnolog-a-e-Innovaci-n/GRADUADOS-AVANZANDO-EN-LO-DIGITAL-PRIMERA-PROMOCIO/yfz9-fwqr/about_data
Non-Resident Foreign Visitors to Colombia (Spanish speaking)
Description
This dataset, foreign_visitors_tbl_df, is a tibble containing information about non-resident foreigners entering Colombia through various migration points. It includes data on the year, month, destination department, destination city, country of residence of the foreign visitor, and the number of non-resident foreigners. The dataset was obtained from GOV.CO Datos Abiertos, last update on September 25, 2025.
Usage
data(foreign_visitors_tbl_df)
Format
A tibble with 466,915 observations and 6 variables:
- Año
Year of the record (numeric)
- Mes
Month of the record (character)
- Departamento
Destination department in Colombia (character)
- Ciudad
Destination city in Colombia (character)
- PaisOEEResidencia
Country of residence of the foreign visitor (character)
- Cant Extranjeros no Residentes
Number of non-resident foreign visitors (numeric)
Details
The dataset name has been kept as 'foreign_visitors_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Source
Data obtained from GOV.CO Datos Abiertos: https://www.datos.gov.co/Comercio-Industria-y-Turismo/Extranjeros-No-Residentes/7wm8-w5ad/about_data
Departmental GDP Projection (Spanish speaking)
Description
This dataset, gdp_departments_tbl_df, is a tibble containing the projected Gross Domestic Product (GDP) at constant prices for the departments of Colombia, expressed in 2015 prices. It includes data such as year, economic activity, sector, price type, DIVIPOLA department code, department name, and GDP value in thousands of millions of pesos. The dataset was obtained from GOV.CO Datos Abiertos, last update on November 1, 2024.
Usage
data(gdp_departments_tbl_df)
Format
A tibble with 16,302 observations and 7 variables:
- Año
Year of the record (numeric)
- Actividad
Economic activity (character)
- Sector
Sector of the economy (character)
- Tipo de precios
Price type (character)
- Código Departamento (DIVIPOLA)
DIVIPOLA code of the department (numeric)
- Departamento
Name of the department (character)
- Valor (miles de millones de pesos)
GDP value in thousands of millions of pesos (numeric)
Details
The dataset name has been kept as 'gdp_departments_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Source
Data obtained from GOV.CO Datos Abiertos: https://www.datos.gov.co/Econom-a-y-Finanzas/PIB-Departamental-con-proyecci-n/kgyi-qc7j/about_data
Get Colombia's Airports from API Colombia
Description
Retrieves comprehensive information about all Colombian airports from the API Colombia service. This function fetches data including airport names, codes, types, locations, and geographical coordinates for all airports across Colombia's departments.
Usage
get_colombia_airports()
Details
This function sends a GET request to the API Colombia service.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Note: The API returns latitude and longitude values swapped, so this function
corrects the coordinates by swapping them back to their correct positions.
Value
A tibble with the following columns:
-
id
: Airport identifier (integer) -
name
: Airport name (character) -
iata_code
: IATA airport code (character) -
oaci_code
: ICAO airport code (character) -
type
: Airport type (character) -
city
: City where airport is located (character) -
department
: Department where airport is located (character) -
latitude
: Latitude coordinates (numeric) -
longitude
: Longitude coordinates (numeric)
Note
Requires internet connection.
Source
API Colombia: https://api-colombia.com/api/v1/Airport
See Also
Examples
if (interactive()) {
get_colombia_airports()
}
Get All Touristic Attractions of Colombia
Description
Retrieves selected information about touristic attractions in Colombia from the API Colombia. This function fetches the attraction ID, name, description, latitude, longitude, and city ID.
Usage
get_colombia_attractions()
Value
A tibble with multiple rows, each representing a touristic attraction in Colombia, or NULL if the API is unavailable.
Examples
attractions <- get_colombia_attractions()
print(attractions)
Get Colombia's Under-5 Mortality Rate from World Bank
Description
Retrieves Colombia's under-5 mortality rate, measured as the number of deaths
of children under five years of age per 1,000 live births, for the years 2010 to 2022
using the World Bank Open Data API. The indicator used is SH.DYN.MORT
.
Usage
get_colombia_child_mortality()
Details
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name (e.g., "Mortality rate, under-5 (per 1,000 live births)") -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: Mortality rate (per 1,000 live births)
Note
Requires internet connection.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/SH.DYN.MORT
See Also
Examples
if (interactive()) {
get_colombia_child_mortality()
}
Get All Cities of Colombia
Description
Retrieves basic information about all cities in Colombia from the API Colombia. This function fetches only the city ID and name to provide a clean and concise dataset.
Usage
get_colombia_cities()
Value
A tibble with two columns: id
and name
, or NULL if the API is unavailable.
Examples
cities <- get_colombia_cities()
print(cities)
Get Colombia's Consumer Price Index (2010 = 100) from World Bank
Description
Retrieves Colombia's Consumer Price Index (CPI), with 2010 as the base year (index = 100),
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is FP.CPI.TOTL
.
Usage
get_colombia_cpi()
Details
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name (e.g., "Consumer price index (2010 = 100)") -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: Consumer Price Index (numeric, base year 2010 = 100)
Note
Requires internet connection.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/FP.CPI.TOTL
See Also
Examples
if (interactive()) {
get_colombia_cpi()
}
Get Colombia's Departments from API Colombia
Description
Retrieves comprehensive information about all Colombian departments from the API Colombia service. This function fetches data including department names, capitals, geographical information, area, population, and administrative details for all 32 departments plus Bogotá D.C.
Usage
get_colombia_departments()
Details
This function sends a GET request to the API Colombia service.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
id
: Department identifier (integer) -
name
: Department name (character) -
capital
: Capital city name (character) -
surface
: Department area in km² (integer) -
population
: Department population (integer) -
municipalities
: Number of municipalities (integer) -
phone_prefix
: Phone area code (character) -
region_id
: Geographic region identifier (integer)
Note
Requires internet connection.
Source
API Colombia: https://api-colombia.com/api/v1/Department
See Also
Examples
if (interactive()) {
get_colombia_departments()
}
Get Colombia's Energy Use (kg of oil equivalent per capita) from World Bank
Description
Retrieves Colombia's energy use per capita, measured in kilograms of oil equivalent,
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is EG.USE.PCAP.KG.OE
.
Usage
get_colombia_energy_use()
Details
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name (e.g., "Energy use (kg of oil equivalent per capita)") -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: Energy use in kilograms of oil equivalent per capita
Note
Requires internet connection.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/EG.USE.PCAP.KG.OE
See Also
Examples
if (interactive()) {
get_colombia_energy_use()
}
Get Colombia's GDP (current US$) from World Bank
Description
Retrieves Colombia's Gross Domestic Product (GDP) in current US dollars
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is NY.GDP.MKTP.CD
.
Usage
get_colombia_gdp()
Details
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name (e.g., "GDP (current US$)") -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: GDP in current US dollars -
value_label
: Formatted GDP with commas (e.g., "1,800,000,000,000")
Note
Requires internet connection.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/NY.GDP.MKTP.CD
See Also
GET
, fromJSON
, as_tibble
, comma
Examples
if (interactive()) {
get_colombia_gdp()
}
Get Official Public Holidays in Colombia for a Given Year
Description
Retrieves the list of official public holidays in Colombia for a specific year using the Nager.Date public holidays API. This function returns a tibble containing the date of the holiday, the name in the local language (Spanish), and the English name. It is useful for academic, planning, and data analysis purposes. The information is retrieved directly from the Nager.Date API and reflects the current status of holidays for the requested year. The field names returned are consistent with the API structure.
Usage
get_colombia_holidays(year)
Arguments
year |
An integer indicating the year (e.g., 2024 or 2025). |
Value
A tibble with the following columns:
-
date
: Date of the holiday (classDate
) -
local_name
: Holiday name in the local language (Spanish) -
name
: Holiday name in English
Source
Data obtained from the Nager.Date API: https://date.nager.at/
Examples
get_colombia_holidays(2024)
get_colombia_holidays(2025)
Get Colombia's Hospital Beds (per 1,000 people) from World Bank
Description
Retrieves Colombia's number of hospital beds per 1,000 people
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is SH.MED.BEDS.ZS
.
Usage
get_colombia_hospital_beds()
Details
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name (e.g., "Hospital beds (per 1,000 people)") -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: Number of hospital beds per 1,000 people
Note
Requires internet connection.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/SH.MED.BEDS.ZS
See Also
Examples
if (interactive()) {
get_colombia_hospital_beds()
}
Get Invasive Species of Colombia
Description
Retrieves information about invasive species in Colombia from the API Colombia. This function fetches only selected fields: id, name, scientificName, commonNames, impact, manage, and riskLevel.
Usage
get_colombia_invasive_species()
Value
A tibble with multiple rows and the selected columns, or NULL if the API is unavailable.
Examples
invasive_species <- get_colombia_invasive_species()
print(invasive_species)
Get Colombia's Life Expectancy at Birth (Total, Years) from World Bank
Description
Retrieves Colombia's life expectancy at birth (total, years)
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is SP.DYN.LE00.IN
.
Usage
get_colombia_life_expectancy()
Details
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name (e.g., "Life expectancy at birth, total (years)") -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: Life expectancy at birth in years
Note
Requires internet connection.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/SP.DYN.LE00.IN
See Also
Examples
if (interactive()) {
get_colombia_life_expectancy()
}
Get Colombia's Adult Literacy Rate
Description
Retrieves Colombia's adult literacy rate (
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is SE.ADT.LITR.ZS
.
Usage
get_colombia_literacy_rate()
Details
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: Literacy rate as a percentage
Note
Requires internet connection.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/SE.ADT.LITR.ZS
See Also
Examples
literacy_data <- get_colombia_literacy_rate()
head(literacy_data)
Get Native Communities of Colombia
Description
Retrieves information about Colombia's native communities from the API Colombia. This function fetches only selected fields: id, name, description, and languages.
Usage
get_colombia_native_communities()
Value
A tibble with multiple rows and the selected columns, or NULL if the API is unavailable.
Examples
native_communities <- get_colombia_native_communities()
print(native_communities)
Get Natural Areas of Colombia
Description
Retrieves information about natural areas in Colombia from the API Colombia. This function fetches only selected fields: id, areaGroupId, categoryNaturalAreaId, name, departmentId, daneCode, and landArea.
Usage
get_colombia_natural_areas()
Value
A tibble with multiple rows and the selected columns, or NULL if the API is unavailable.
Examples
natural_areas <- get_colombia_natural_areas()
print(natural_areas)
Get Colombia's Total Population from World Bank
Description
Retrieves Colombia's total population for the years 2010 to 2022
using the World Bank Open Data API. The indicator used is SP.POP.TOTL
.
Usage
get_colombia_population()
Details
The function sends a GET request to the World Bank API.
If the API request fails or returns an error status code, the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name (e.g., "Population, total") -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: Population as a numeric value -
value_label
: Formatted population with commas (e.g., "51,000,000")
Note
Requires internet connection. The data is retrieved in real time from the World Bank API.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/SP.POP.TOTL
See Also
GET
, fromJSON
, as_tibble
, comma
Examples
if (interactive()) {
get_colombia_population()
}
Get Colombia's Presidents from API Colombia
Description
Retrieves comprehensive information about all Colombian presidents from 1886 to present from the API Colombia service. This function fetches data including names, political parties, government periods, descriptions, and biographical information for each president.
Usage
get_colombia_presidents()
Details
This function sends a GET request to the API Colombia service.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
id
: President identifier (integer) -
name
: President's first name(s) (character) -
last_name
: President's last name(s) (character) -
political_party
: Political party affiliation (character) -
start_period_date
: Start date of presidency (character, YYYY-MM-DD format) -
end_period_date
: End date of presidency (character, YYYY-MM-DD format or NULL for current)
Note
Requires internet connection.
Source
API Colombia: https://api-colombia.com/api/v1/President
See Also
Examples
if (interactive()) {
get_colombia_presidents()
}
Get Radios of Colombia
Description
Retrieves information about radio stations in Colombia from the API Colombia. This function fetches only the following fields: id, name, frequency, and band.
Usage
get_colombia_radios()
Value
A tibble with multiple rows and the selected columns, or NULL if the API is unavailable.
Examples
radios <- get_colombia_radios()
print(radios)
Get Colombia's Regions from API Colombia
Description
Retrieves comprehensive information about Colombia's six natural regions from the API Colombia service. This function fetches data including region names, descriptions, and identifiers for all Colombian regions: Caribe, Pacífico, Orinoquía, Amazonía, Andina, and Insular.
Usage
get_colombia_regions()
Details
This function sends a GET request to the API Colombia service.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
id
: Region identifier (integer) -
name
: Region name (character) -
description
: Detailed description of the region (character)
Note
Requires internet connection.
Source
API Colombia: https://api-colombia.com/api/v1/Region
See Also
Examples
if (interactive()) {
get_colombia_regions()
}
Get Typical Dishes of Colombia
Description
Retrieves information about Colombia's typical dishes from the API Colombia. This function fetches only selected fields: id, name, description, and region.
Usage
get_colombia_typical_dishes()
Value
A tibble with multiple rows and the selected columns, or NULL if the API is unavailable.
Examples
dishes <- get_colombia_typical_dishes()
print(dishes)
Get Colombia's Unemployment Rate (Total) from World Bank
Description
Retrieves Colombia's total unemployment rate, measured as a percentage of the total labor force,
for the years 2010 to 2022 using the World Bank Open Data API.
The indicator used is SL.UEM.TOTL.ZS
.
Usage
get_colombia_unemployment()
Details
This function sends a GET request to the World Bank API.
If the API request fails or returns an error status code,
the function returns NULL
with an informative message.
Value
A tibble with the following columns:
-
indicator
: Indicator name (e.g., "Unemployment, total ( -
country
: Country name ("Colombia") -
year
: Year of the data (integer) -
value
: Unemployment rate as a numeric value (percentage)
Note
Requires internet connection.
Source
World Bank Open Data API: https://data.worldbank.org/indicator/SL.UEM.TOTL.ZS
See Also
Examples
if (interactive()) {
get_colombia_unemployment()
}
Get Country Information for Colombia
Description
Retrieves comprehensive country information for Colombia from the REST Countries API. This function fetches data including official and common names, geographical information, capital, area, population, and languages.
Usage
get_country_info_co()
Value
A tibble with one row containing Colombia's country information, or NULL if the API is unavailable.
Examples
co_info <- get_country_info_co()
print(co_info)
Comparative Vocabulary for Colombia's Indigenous Languages
Description
This dataset, indigenous_vocabulary_df, is a data frame containing a comparative vocabulary (a "wordlist") for 69 indigenous languages of Colombia, originally compiled by Huber & Reed (1992). The dataset provides lexical correspondences across multiple languages, supporting linguistic, anthropological, and historical research.
Usage
data(indigenous_vocabulary_df)
Format
A data frame with 27,521 observations and 4 variables:
- CONCEPT
Gloss or concept represented in the wordlist (factor with 366 levels)
- COUNTERPART
Word corresponding to the concept in the given language (character)
- DOCULECT
Name of the language or variety (factor with 71 levels)
- TOKENS
Tokenized form of the counterpart (character)
Details
The dataset name has been kept as 'indigenous_vocabulary_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame object. The original content has not been modified in any way.
Source
Data taken from the qlcMatrix package version 0.9.9
Municipalities of Colombia (Spanish speaking)
Description
This dataset, municipalities_tbl_df, is a tibble containing the list of municipalities of Colombia and their corresponding departments, ordered alphabetically.
Usage
data(municipalities_tbl_df)
Format
A tibble with 1,141 observations and 2 variables:
- Departamento
Name of the department (character)
- Municipio
Name of the municipality (character)
Details
The dataset name has been kept as 'municipalities_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Source
Data obtained from Kaggle: https://www.kaggle.com/datasets/cavp95/municipios-colombia
Departments by Levels of Schools in Colombia
Description
This dataset, school_levels_df, is a data frame containing a contingency table that classifies schools in Colombia by department and level of performance of their students. The dataset summarizes counts of schools across different performance categories in three main subjects: communication, mathematics, and science.
Usage
data(school_levels_df)
Format
A data frame with 29 observations and 12 variables:
- Cinferior
Number of schools with communication results at the inferior level (numeric)
- Cbajo
Number of schools with communication results at the low level (numeric)
- Cmedio
Number of schools with communication results at the medium level (numeric)
- Calto
Number of schools with communication results at the high level (numeric)
- Minferior
Number of schools with mathematics results at the inferior level (numeric)
- Mbajo
Number of schools with mathematics results at the low level (numeric)
- Mmedio
Number of schools with mathematics results at the medium level (numeric)
- Malto
Number of schools with mathematics results at the high level (numeric)
- Tinferior
Number of schools with science results at the inferior level (numeric)
- Tbajo
Number of schools with science results at the low level (numeric)
- Tmedio
Number of schools with science results at the medium level (numeric)
- Talto
Number of schools with science results at the high level (numeric)
Details
The dataset name has been kept as 'school_levels_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame object. The original content has not been modified in any way.
Source
Data taken from the FactoClass package version 1.2.9
Vehicle Count: Cars and Motorcycles (Spanish speaking)
Description
This dataset, vehicle_count_tbl_df, is a tibble containing information from a license plate recognition camera system designed to generate trends and metrics for traffic management decision-making. It includes data on the date, sensor name, type of vehicle, and the number of vehicles counted. The dataset was obtained from GOV.CO Datos Abiertos, last update on March 26, 2024.
Usage
data(vehicle_count_tbl_df)
Format
A tibble with 262,114 observations and 4 variables:
- Fecha
Date of the record (character)
- Nombre Sensor
Name of the sensor capturing the data (character)
- Tipo Vehiculo
Type of vehicle (Cars, Motorcycles, etc.) (character)
- Cantidad
Number of vehicles counted (numeric)
Details
The dataset name has been kept as 'vehicle_count_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ColombiAPI package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble object. The original content has not been modified in any way.
Source
Data obtained from GOV.CO Datos Abiertos: https://www.datos.gov.co/Ciencia-Tecnolog-a-e-Innovaci-n/Conteo-de-Autos-y-Motos/s53k-ztax/about_data
View Available Datasets in ColombiAPI
Description
This function lists all datasets available in the 'ColombiAPI' package. If the 'ColombiAPI' package is not loaded, it stops and shows an error message. If no datasets are available, it returns a message and an empty vector.
Usage
view_datasets_ColombiAPI()
Value
A character vector with the names of the available datasets. If no datasets are found, it returns an empty character vector.
Examples
if (requireNamespace("ColombiAPI", quietly = TRUE)) {
library(ColombiAPI)
view_datasets_ColombiAPI()
}