apache-spark
2024
2 post(s)
2 post(s)
2023
4 post(s)
4 post(s)
- Exploring Apache Spark's RDDs, DataFrames, and Datasets: Usage and Performance Comparison 1575 word(s) Aug 7
- Understanding Apache Spark's mapPartitions function 1483 word(s) Aug 6
- Harnessing the Power of Apache Spark's flatMap and reduceByKey Functions 1338 word(s) Aug 5
- Flatten Map Spark Python 1427 word(s) May 14
2022
17 post(s)
17 post(s)
- PySpark: Clean Map With UDF 153 word(s) Dec 21
- Spark Standalone Server Hosted in Ubuntu Linux 216 word(s) Dec 14
- How to Get Spark Scala Version 63 word(s) Jun 22
- Spark does NOT Support Azure Append Blobs and there's no workaround 67 word(s) Jun 16
- Flatten Spark Dataframe in Scala 102 word(s) Jun 11
- Install Maven .jar Dependencies from Jupyter Python Spark Notebook 343 word(s) Jun 10
- Writing to Delta Table with Spark 3 (much easier!) 31 word(s) May 5
- Access DBUtils from Scala app running under databricks-connect 63 word(s) Apr 8
- Where to get Hadoop Winutils 702 word(s) Mar 16
- Reading Files Recursively in Spark/Databricks 203 word(s) Mar 7
- Get S3 filesystem details using PySpark 125 word(s) Jan 26
- Spark: Add or Remove Struct Member/Field 168 word(s) Jan 6
- Spark: Union Incompatible Dataframes 135 word(s) Jan 5
- Spark Union in Pythonic Way 61 word(s) Jan 4
- Spark: Check Table Exists Quickly 43 word(s) Jan 3
- Spark DataFrame: Display Last Columns 49 word(s) Jan 2
- Reshuffle Schema Columns in Spark DataFrame 196 word(s) Jan 1
2021
16 post(s)
16 post(s)
- Set up Standalone Scala SBT Application with Delta Lake 220 word(s) Dec 14
- Spark - Export DataFrame Schema, and then Import it Later. 64 word(s) Nov 5
- Spark: Print DataFrame Schema Metadata 42 word(s) Nov 3
- Create Apache Spark DataFrame in memory 95 word(s) Oct 22
- Databricks - Setting up Custom External Hive Metastore in Azure using MSSQL Server 4022 word(s) Oct 21
- Set Table Property (Metadata) in Spark or Databricks 105 word(s) Sep 17
- Spark: Add or Remove a Column from a DataFrame 204 word(s) Jul 30
- Getting Info About Spark Partitions 690 word(s) Jun 28
- Connect from Spark to AWS S3 via Assume Role credential 247 word(s) Mar 10
- Creating Scala Uber JAR with Spark 3.1 Included 398 word(s) Mar 5
- Listing Spark Databases and Tables Fast 93 word(s) Mar 4
- Parsing Array of Strings in Spark 501 word(s) Feb 18
- Adding Any Constraint you want with AWS Deequ 322 word(s) Feb 10
- Using Deequ 1.1 with Spark 3 129 word(s) Feb 10
- Spark Host Configuration is Rejected in Spark 3 44 word(s) Feb 9
- Change Timestamp Year in Spark 115 word(s) Feb 3