Tools Installation¶
This guide will walk you through the installation of all the tools needed for the course, step by step.
System Requirements¶
Minimum Requirements
- RAM: 8GB
- Disk space: 20GB
- Processor: i5 or equivalent
- Operating System: Windows 10+, macOS 10.14+, or Linux (Ubuntu 20.04+)
Recommended Requirements
- RAM: 16GB
- Disk space: 50GB SSD
- Processor: i7 or equivalent
- Operating System: Latest available operating system
Step 1: Install Git¶
Git is the version control system we will use to manage our code.
Option A: With winget (Recommended)¶
Option B: Graphical installer¶
- Download from git-scm.com
- Run the installer
- Use the default settings (Next, Next, Next...)
Verify installation¶
Step 2: Configure Git¶
Once Git is installed, we need to configure it with your information:
# Set your name (use your real name)
git config --global user.name "Your Full Name"
# Set your email (use the same email as GitHub)
git config --global user.email "your@email.com"
# Verify configuration
git config --list
Tip
The name and email you set will appear on all your commits, so use your real name and the email you will use on GitHub.
Step 3: Create a GitHub Account¶
GitHub is the platform where we will host our code.
- Go to github.com
- Click on "Sign Up"
- Fill in the form:
- Username: Choose a professional name (e.g.:
juan-garcia, notkitty123) - Email: Use the same one you configured in Git
- Password: Use a strong password
- Username: Choose a professional name (e.g.:
- Verify your email
- Complete your profile (photo, bio optional)
Important
Use the same email you configured in Git. This links your commits with your GitHub account.
Step 4: Install Python¶
We need Python 3.11 or higher.
Note for macOS/Linux
On macOS and Linux, use python3 and pip3 instead of python and pip.
Step 5: Install PyCharm (Optional but Recommended)¶
PyCharm is the IDE we recommend for the course.
PyCharm Community Edition (Free)¶
- Download from jetbrains.com/pycharm
- Choose "Community Edition" (free)
- Run the installer
- Follow the installer steps
Or download from jetbrains.com/pycharm
Alternatives to PyCharm¶
If you prefer another editor:
- Visual Studio Code: Lightweight and extensible (code.visualstudio.com)
- Jupyter Lab: For working with notebooks (jupyter.org)
- Sublime Text: Advanced text editor (sublimetext.com)
Step 6: Clone the Repository¶
Now that you have everything installed, clone your fork of the repository:
Important
First you must Fork the repository on GitHub. Go to the Fork and Clone guide for more details.
# Navigate to the folder where you want to save the project
cd Documents # or whichever folder you prefer
# Clone YOUR fork (replace YOUR_USERNAME)
git clone https://github.com/YOUR_USERNAME/ejercicios-bigdata.git
# Enter the folder
cd ejercicios-bigdata
# Connect to the original repository (upstream)
git remote add upstream https://github.com/TodoEconometria/ejercicios-bigdata.git
# Verify that everything is set up correctly
git remote -v
You should see something like this:
origin https://github.com/YOUR_USERNAME/ejercicios-bigdata.git (fetch)
origin https://github.com/YOUR_USERNAME/ejercicios-bigdata.git (push)
upstream https://github.com/TodoEconometria/ejercicios-bigdata.git (fetch)
upstream https://github.com/TodoEconometria/ejercicios-bigdata.git (push)
Step 7: Create a Virtual Environment¶
It is a good practice to use virtual environments for each project:
# Make sure you are in the project folder
cd ejercicios-bigdata
# Create virtual environment
python -m venv .venv
# Activate virtual environment
# Windows:
.venv\Scripts\activate
# macOS/Linux:
source .venv/bin/activate
# You should see (.venv) at the beginning of your terminal
Tip
Always activate the virtual environment before working on the project.
Step 8: Install Dependencies¶
With the virtual environment activated, install the project dependencies:
# Upgrade pip
pip install --upgrade pip
# Install project dependencies
pip install -r requirements.txt
# Verify that everything was installed correctly
python -c "import pandas, dask, sqlite3; print('All OK!')"
If you see "All OK!", you are ready to get started.
Final Verification¶
Run these commands to verify that everything is installed correctly:
# Git
git --version
# Python
python --version
# Pip
pip --version
# Verify libraries
python -c "import pandas; print(f'Pandas {pandas.__version__}')"
python -c "import dask; print(f'Dask {dask.__version__}')"
Installation Complete
If all the commands above worked, you are ready to start with Your First Exercise!
Common Issues¶
Error: 'python' is not recognized as a command
Windows: Python is not in the PATH.
Solution:
- Reinstall Python
- Check the "Add Python to PATH" option
- Restart the terminal
macOS/Linux: Use python3 instead of python
Error: Permission denied when installing with pip
Cause: Attempting to install packages globally without permissions.
Solution: Use a virtual environment:
Git says 'fatal: not a git repository'
Cause: You are not in the project folder.
Solution:
PyCharm does not detect the Python interpreter
Solution:
- Open PyCharm
- File → Settings (Windows/Linux) or PyCharm → Preferences (macOS)
- Project → Python Interpreter
- Click on the gear icon → Add
- Select "Existing environment"
- Browse to
.venv/Scripts/python.exe(Windows) or.venv/bin/python(macOS/Linux)
---
Next Steps¶
Now that you have everything installed, continue with:
- Your First Exercise - Learn the basic workflow
- Fork and Clone - Understand how to work with Git and GitHub
- Course Roadmap - See all available exercises