Apache Spark Assembly
search
Ctrlk
  • Apache Spark Assembly
  • Ways to Read This Book
  • Core - Operation related
    • SparkContext
    • SparkConf
    • SparkEnv
    • Heartbeatchevron-right
    • Scheduler
  • CORE - Execution Related
    • RDDchevron-right
      • RDD Design
      • Principles of Overriding RDD
      • Default RDDschevron-right
      • Transformations and Their Designchevron-right
        • map / flatMap
        • filter
        • repartition / coalesce
        • sample / randomSplit / takeSample
        • union / ++ / intersection
        • sortBy
        • glom
        • cartesian
        • groupBy
        • pipe
        • mapPartitions / mapPartitionsWithIndex
        • zip / zipPartitions
        • Extra
      • Actions and Their Designchevron-right
      • Cache & Persist
      • RDD Operation Scope
      • RDD Checkpointing
    • Shuffle
    • Serializer
    • Partitioner
    • Broadcast
    • Aggregator
    • Memory
    • Storage
  • Running Spark App
    • Starting point
    • Mastering SparkConf
    • Web UI
  • Untitled
  • Programming Spark
    • Debugging
gitbookPowered by GitBook
block-quoteOn this pagechevron-down
  1. CORE - Execution Relatedchevron-right
  2. RDD

Transformations and Their Design

map / flatMapchevron-rightfilterchevron-rightrepartition / coalescechevron-rightsample / randomSplit / takeSamplechevron-rightunion / ++ / intersectionchevron-rightsortBychevron-rightglomchevron-rightcartesianchevron-rightgroupBychevron-rightpipechevron-rightmapPartitions / mapPartitionsWithIndexchevron-rightzip / zipPartitionschevron-rightExtrachevron-right
PreviousShuffledRDDchevron-leftNextmap / flatMapchevron-right

Last updated 4 years ago