Big Data

Convert your hive output to Json format

Using CONCAT and string manipulation functions: SELECT CONCAT(‘{ “column1”: “‘, column1, ‘”, “column2”: “‘, column2, ‘”, “column3”: “‘, column3, ‘” }’) AS json_data FROM your_table; Using a custom UDF (User-Defined Function): If the built-in...

“NoSQL 101: Navigating the World of Non-Relational Databases”

Different type of NOSQL Databases Type of NoSQL Database Examples Key-value Redis, Riak Document MongoDB, Couchbase Column-family Apache Cassandra, Hbase Graph Neo4j, Amazon Neptune Time-series InfluxDB, OpenTSDB Object db4o, ZopeDB Search Engine Elasticsearch, Solr...

examples of transformations in PySpark

Here are some examples of transformations in PySpark: Map map(func) – Applies the given function to each element of the dataset and returns a new dataset with the results. For example: from pyspark.sql.functions import...

Transformations and Actions in Spark with example

In Apache Spark, transformation and action are two types of operations that can be performed on RDDs (Resilient Distributed Datasets) or DataFrames/Datasets. Transformations are operations that are performed on an RDD or DataFrame/Dataset that...