About 39,600 results
Open links in new tab
  1. PySpark Overview — PySpark 4.0.1 documentation - Apache Spark

    Spark Connect is a client-server architecture within Apache Spark that enables remote connectivity to Spark clusters from any application. PySpark provides the client for the Spark …

  2. Configuration - Spark 4.0.1 Documentation

    Spark provides three locations to configure the system: Spark properties control most application parameters and can be set by using a SparkConf object, or through Java system properties. …

  3. Building Spark - Spark 4.0.1 Documentation

    Spark now comes packaged with a self-contained Maven installation to ease building and deployment of Spark from source located under the build/ directory. This script will …

  4. Installation — PySpark 4.0.1 documentation - Apache Spark

    PySpark is included in the official releases of Spark available in the Apache Spark website. For Python users, PySpark also provides pip installation from PyPI.

  5. Spark Release 3.5.0 - Apache Spark

    Refactoring of the sql module into sql and sql-api to produce a minimum set of dependencies that can be shared between the Scala Spark Connect client and Spark and avoids pulling all of the …

  6. Overview - Spark 4.0.1 Documentation

    If you’d like to build Spark from source, visit Building Spark. Spark runs on both Windows and UNIX-like systems (e.g. Linux, Mac OS), and it should run on any platform that runs a …

  7. Spark Release 2.4.0 - Apache Spark

    Apache Spark 2.4.0 is the fifth release in the 2.x line. This release adds Barrier Execution Mode for better integration with deep learning frameworks, introduces 30+ built-in and higher-order …

  8. Spark Release 3.0.0 - Apache Spark

    Spark SQL is the top active component in this release. 46% of the resolved tickets are for Spark SQL. These enhancements benefit all the higher-level libraries, including structured streaming …

  9. Feature Extraction and Transformation - RDD-based API - Apache …

    Find full example code at "examples/src/main/scala/org/apache/spark/examples/mllib/NormalizerExample.scala" in the …

  10. Spark Release 3.5.5 - Apache Spark

    Dependency changes While being a maintenance release we did still upgrade some dependencies in this release they are: [SPARK-50886]: Upgrade Avro to 1.11.4 You can …