Sloppy XML Parser is a Python library designed to handle and parse malformed XML files. XML is a widely used format for data exchange, but sometimes, the XML you receive is not well-formed. This ...
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame ...