Graph Databases: Cosmos DB Graph API – Key Concepts and Best Practices

The purpose of this post is to recap the most important points from recent Big Data in 30 hours Lecture 5. What is a graph? Vertices – Vertices denote discrete objects, such as a person, a place, or an event. Edges – Edges denote relationships between vertices. For example, a person might know another person, be involved in an event, and recently been at a … Continue reading Graph Databases: Cosmos DB Graph API – Key Concepts and Best Practices

Lecture notes: first steps in Hadoop

In Lecture 6 of our Big Data in 30 hours class, we talk about Hadoop. The purpose of this memo is to summarize the terms and ideas presented. About Hadoop Hadoop by Apache Software Foundation is a software used to run other software in parallel. It is a distributed batch processing system that comes together with a distributed filesystem. It scales well over commodity hardware and … Continue reading Lecture notes: first steps in Hadoop