Code Samples
Explore my GitHub repositories for code samples and project analyses:
- China Financial Flow: Analysis of Chinese financial flows and their impact on economic development.
This repository contains replication files and documentation for the project “Chinese Financial Flows: A Growth Lever for Recipient Economies?” The analysis evaluates the impact of Chinese financial flows on the economic development of recipient countries using econometric and statistical tools.
Tools and Methods Used:
- R: Data preprocessing, visualization, and descriptive analysis.
- Stata: Econometric modeling and statistical analysis.
- Python: Advanced network graph creation and geographic mapping.
Click to View Details
- Data Handling and Cleaning
- Extensive processing of raw data from various sources.
- Integration of control variables for robust analysis.
- Creation of intermediate and final datasets for econometric models.
- Econometric Analysis
- Estimation of impacts using advanced econometric models.
- Sub-sample analyses for Africa, Americas, and Asia to capture regional differences.
- Examination of gross fixed capital formation as a key development outcome.
- Use of within-country, instrumental variable, and dynamic panel methods.
- Visualization
- Graphs and maps illustrating Chinese financial flows, investments, and project distributions.
- Network graphs showing connections between China and recipient countries.
- Heatmaps and time-series visuals for trends over time.
- Mapping and Cartography
- Geospatial mapping of projects using precise geographic coordinates.
- Country-level mapping to highlight the dispersion of Chinese investments globally.
- Interactive maps for user exploration of investment patterns.
- Tax Data Analysis: Statistical analysis and insights into regional tax trends and revenue mobilization.
This repository contains scripts and replication files for the Tax Data Analysis project, which explores regional trends in taxation, focusing on direct taxes, VAT, property taxes, and international trade taxes as a percentage of GDP. The project uses Stata for data processing, econometric analysis, and visualization of results.
Tools and Methods Used:
- Stata: Data cleaning, panel setup, and graphical visualizations.
- Excel: Input data preparation and formatting.
Click to View Details
- Data Preparation and Cleaning
- Merging raw tax data with country-region mapping for comprehensive analysis.
- Removing duplicates to create a clean panel dataset.
- Generating unique country-year identifiers for accurate tracking.
- Variable Creation
- Creation of normalized variables showing tax components as a percentage of GDP.
- Key variables include:
- Direct Taxes (`DirectTaxesIncludingSCIncRe_pct`).
- Taxes on Income, Profits, and Capital (`TaxesonIncomeProfitsCapita_pct`).
- Property Taxes (`PropertyTaxes_pct`).
- Indirect Taxes (`IndirectTaxesTotal_pct`).
- VAT (`TaxesonGoodsandServicesVAT_pct`).
- Taxes on International Trade (Total, Imports, Exports).
- Visualization
- Line graphs showcasing trends in tax contributions across regions.
- Visualizations for specific tax categories (e.g., VAT, property taxes, international trade taxes).
- Regional breakdowns (Africa, Americas, Asia, Europe, Oceania).
- Regional Analysis of Tax Trends
- Identification of differences in tax trends by region over time.
- Comparison of tax contributions to GDP across continents.
- Insights into revenue mobilization strategies in different regions.
This repository contains Python scripts and datasets for the ACLED Data Analysis project. The project visualizes armed conflict events by region and year, highlighting event typ s, their geographic distribution, and associated fatalities. Maps are generated to provide insights into conflict patterns and severity.
Tools and Methods Used:
- Python:
pandas
for data handling and preprocessing.matplotlib
for plotting and visualization.geopandas
for geographic shapefile manipulation and mapping.
Click to View Details
- Data Preparation
- Loading and preprocessing ACLED datasets.
- Converting event dates to datetime format and extracting year information.
- Filtering data by year (e.g., 2024) for focused analysis.
- Mapping and Geographic Analysis
- Loading and filtering geographic shapefiles for countries within specific regions.
- Highlighting countries affected by conflicts within the selected region.
- Overlaying event points with geographic data for enhanced visualization.
- Visualization
- Plotting conflict events by type using scatter points differentiated by colors.
- Scaling point sizes based on fatalities to indicate event severity.
- Annotating country names at centroids for improved readability.
- Including custom legends to scale fatalities for easier interpretation.
- Regional Insights
- Exploring conflict patterns and event types for each region.
- Identifying geographic hotspots of conflict activity within regions.
- Providing visual insights into the scale and severity of events across regions.
- African Educational Data Analysis (PASEC): PASEC Data Visualization: Analysis and visual exploration of educational test scores from PASEC 2019, focusing on gender and country-level trends in mathematics and reading. This repository contains two Python scripts for analyzing and visualizing educational test score data for Grade 6 students. The project uses box plots, KDE plots, and faceted visualizations to highlight gender and country-specific disparities.
Tools and Methods Used: Python: pandas: For data preprocessing and analysis. matplotlib and seaborn: For creating detailed visualizations. Pillow (PIL): For resizing and optimizing output images.
Click to View Details
- Box Plot Visualization (`Box plot.py`)
- Generated box plots for mathematics and reading scores by country and gender.
- Included custom legends showing the percentage of male and female students.
- Created a combined visualization for mathematics and reading scores with a shared legend.
- Added author signature and software details to all visualizations.
- Score Distribution Visualization (`Distribution.py`)
- Created KDE plots for overall score distributions by gender in mathematics and reading.
- Generated faceted KDE grids to visualize score distributions by country and gender.
- Optimized visuals for high-quality outputs suitable for publications.
- Sample Visuals
- Box Plot for Mathematics Scores: `BoxPlot_MATHS_PV5.png`
- KDE Plot for Overall Mathematics Scores: `Overall_Distribution_MATHS_PV5.png`
- Faceted KDE for Reading Scores by Country: `Facet_Distribution_LECT_PV5.png`
For more projects, visit my GitHub profile.