Building a Real-Time GPS Data Processing System on AWS: A Step-by-Step Guide

In today's interconnected world, real-time location tracking plays a crucial role in many industries. These include logistics and fleet management. It is also vital to…

Continue reading → Building a Real-Time GPS Data Processing System on AWS: A Step-by-Step Guide

Set Up a Hadoop Cluster on AWS EMR: A Step-by-Step Guide

Hadoop is a powerful framework that enables distributed processing of large datasets. It follows the MapReduce paradigm. Computation is broken down into independent map and…

Continue reading → Set Up a Hadoop Cluster on AWS EMR: A Step-by-Step Guide

How to run an external program from a python script

This is a demonstration how a python script can be used to automate the running of an external program which takes data file as input. The data file can be spread over differnet folders in the machine. The script goes to the data file location, extracts the name of each file in the folder and then passes the data file to the external binary.