Lucas Santos

Data Analyst

Madrid, Community of Madrid, Spain5 yrs 9 mos experience

Key Highlights

  • Expert in ETL processes and data automation.
  • Proficient in Python for data analysis and reporting.
  • Successful in generating high-quality client leads.
Stackforce AI infers this person is a Data Analyst specializing in ETL processes and data-driven insights for the Energy and Payments industries.

Contact

Skills

Core Skills

PythonEtl (extract, Transform, Load)Geospatial AnalysisAutomationData AnalysisKpi Development

Other Skills

AWS AthenaAirflowAmazon AthenaAnálise de dadosApache AirflowBanco de dadosBigQueryDashboardsData ScienceDatabasesETL (Extração, transformação e carregamento)GCPGeoPandasGoogle Data StudioLinux

About

Data Analyst with 3+ years of experience working with data. Proficient in creating and managing ETL processes with Python, Airflow, PostgreSQL, and SQL Server. Relevant projects include generating a list of 100+ potential clients for the sales team of a large energy broker, far outperforming third-party leads, using Python, GeoPandas, GCP, BigQuery, and OpenStreetMap geocoding API for geospatial analysis; also automated pipelines for high-level report creation for a contactless payments client, optimizing the process and reducing times by 50%, using Python, Airflow, and Looker Studio to compile data from S3 and PostgreSQL.

Experience

Factored

Data Analyst

Dec 2024Present · 1 yr 3 mos · Palo Alto, California, United States · Remote

Dashboards

Bayer

Data analyst (outsourced)

Jun 2024Oct 2024 · 4 mos · São Paulo, Brazil

Dashboards

Alupar

Data Analyst

Mar 2023May 2024 · 1 yr 2 mos · São Paulo, Brazil · Hybrid

  • Built ETL pipelines using Python, Airflow, SQL Server and Looker Studio to automate the ETL process for the data platform gathering data to monitor the sector scenario with up-to-date data on weather, price forecasting and generation.
  • Built and maintained an Airflow instance to automate DAGs that executed various tasks such as ETL and data replication processes, allowing the automatization of hourly tasks, to schedule and monitor data workflows efficiently.
  • Developed a pipeline to provide business teams with information about energy measurements for clients in the energy sector and their own generator plants. The main goal was to automate the process of retrieving and processing the data from an API, and then making it available to the business teams through dashboards and Excel files, using SQL Server database, Power Query, SOAP API, XML.
  • Determined the requirements and generated a list of potential clients for the sales team to contact, by combining data from different sources and using geospatial analysis to match the data using Python, GeoPandas, Google Cloud Platform (GCP), BigQuery, OpenStreetMap Geocoding API. This resulted in a list of over 100 potential suitable clients from different sectors, outperforming the quality of the leads provided by a third party.
PythonSQLLinuxETL (Extração, transformação e carregamento)Apache AirflowMicrosoft SQL Server+5

Stone

Data Analyst

Dec 2021Mar 2023 · 1 yr 3 mos · Remote

  • Developed and evaluated strategic KPIs such as Retention, CAC, TPV, and daily active users using AWS Athena, PostgreSQL, Metabase and Looker Studio to assist the decision-making process by the product team of a client in the contactless payments industry, in areas such as marketing campaigns, client segmentation and product performance
  • Automated pipelines that fed dashboards using Python, Airflow and Looker Studio to compile data from various data sources such as an S3 data lake, PostgreSQL and generate reports for the product team, decreasing the required time to create reports by 50%, going from taking two weeks to one.
  • Analyzed credit card transactions using Python and SQL to identify trends in client behavior.
  • All reports were compliant with brazil’s data protection law, applying data governance and data management best practices.
  • Developed an extensive study in conjunction with the Product team on the monthly retention rates and trends for the customers of a client in the payments industry, using Excel for prototyping and Python for automation, to understand their behavior over time and how it differed amongst different client categories based on RFM (Recency, Frequency and Monetary Value); this allowed, among others, to forecast the retention for the next twelve months in advance.
PythonSQLApache AirflowAmazon AthenaGoogle Data StudioModelagem estatística+3

Cnpq - conselho nacional de desenvolvimento científico e tecnológico

Graduate Research Student

Mar 2020Dec 2021 · 1 yr 9 mos · Recife, Pernambuco, Brazil

  • Developed a crawler using Python’s Scrapy to download historical (from 2010 up to 2021) crime data from the crime statistics department in São Paulo state relative to two types of crime occurrences: homicides, car thefts/carjackings and their respective locations. Used Pandas to convert the files from Excel to parquet format in order to optimize the speed in the analysis.
  • Leveraged Python’s GeoPandas jointly with seaborn packages to analyze spatial crime data in the city of São Paulo through the creation of maps and plots that gave a better understanding of the behavior of criminal activities in the city as well as its temporal behavior.
  • Created dashboards to display relevant information on statistics and trends on PowerBI.
RMicrosoft ExcelAnálise de dados

Superintendência do desenvolvimento do nordeste - sudene

Intern

Mar 2019Jan 2020 · 10 mos · Recife, Pernambuco, Brazil

  • Responsible for analysis of the PAM (Pesquisa Agropecuária Municipal) data in order to generate insights that were published in a large study.
  • Support on the IDEB educational data analysis project for the cities in the Sudene region.
  • Support on more intensive tasks such as handling and treating large datasets/files with R.
  • Support on data analysis using Excel and R.
RMicrosoft ExcelAnálise de dados

Education

Universidade Federal de Pernambuco

Political Science

Apr 2014Mar 2018

Universidade Federal de Pernambuco

Statistics

Apr 2020Apr 2022

MBA USP/Esalq

MBA — Data Science e Analytics

Nov 2020Nov 2022

Stackforce found 100+ more professionals with Python & Etl (extract, Transform, Load)

Explore similar profiles based on matching skills and experience