Author Archives: Brahmini Ratnam

PostgreSQL OID Datatype: Impacts and Remedies

Introduction PostgreSQL – The World’s Most Advanced Open Source Relational Database, introduced a significant change with version 12: the discontinuation of the Object Identifier (OID) datatype. This move has left many developers and database administrators pondering over its implications and … Read More

Postgres Materialized Views: A Practical 101 – Part 2

Introduction Welcome back to the second installment of our deep dive into the Materialized views! In our previous post, we looked at the basics of materialized views, showcasing how they can help query performance and simplify complex data aggregations with … Read More

Postgres Materialized Views: A Practical 101 – Part 1

Introduction In the realm of database management, efficiency and speed are king. PostgreSQL: The World’s Most Advanced Open Source Relational Database, offers a feature that is nothing short of magical for data analysts and developers alike: Materialized Views. Unlike standard … Read More

Performance tuning with Apache Spark – Introduction

Introduction: Welcome back to our ongoing series on Data transformation with Apache Spark! In our previous posts, we’ve covered essential topics like setting up Apache Spark on Ubuntu, integrating data with Spark, and querying datasets using Apache Drill. Now, we’re … Read More

Open-source Data Engineering with PostgreSQL

Blog-4: Apache Drill Magic across PostgreSQL, Local Parquet, and S3 INTRODUCTION: Welcome back! Following our exploration of data movement between PostgreSQL and Amazon S3 in the previous blog, we now venture into the realm of querying with Apache Drill. In … Read More

Open-source Data Engineering with PostgreSQL

Blog-3: Data Loading with Apache Spark INTRODUCTION: Welcome to the next installment of our series on Open-source Data Engineering with PostgreSQL. In this blog, we’ll delve into the practicalities of transforming table data from PostgreSQL into the Parquet format and … Read More

Open-source Data Engineering with PostgreSQL

Blog-2: Installation and Setup on Ubuntu INTRODUCTION: Welcome back to the series on Open-source Data Engineering with PostgreSQL. In this post, we shall delve into the installation and configuration of Apache Spark and Apache Drill on an Ubuntu environment. Our … Read More

Open-source Data Engineering with PostgreSQL

Overview – A Curtain raiser Introduction: In the ever-evolving landscape of Data management, organizations are constantly seeking efficient ways to handle, transform, and query massive datasets. Data Archiving has become an important component of Data Engineering in the ever-evolving landscape … Read More

Building an Efficient Data Pipeline with PostgreSQL and Talend Open Studio

Introduction: In the rapidly evolving landscape of data management, creating a robust data pipeline is essential for organizations to derive meaningful insights and drive informed decision-making. In this blog, we’ll explore the integration of PostgreSQL, a powerful open-source relational database, … Read More

Bulk loading with COPY in PostgreSQL 16

Introduction: At OpenSource DB, we work with Customers on various Data Engineering projects, and among the many critical aspects of data warehousing, the efficient and rapid movement of data is paramount. This is where PostgreSQL’s COPY feature shines. COPY is … Read More