- [ ] All code compiles/run on Spark 2.x (no 3.x‑only APIs). - [ ] Comments are present for every non‑obvious line. - [ ] You’ve referenced at least **one** Spark concept (lazy eval, shuffle, broadcast, etc.). - [ ] Edge cases are discussed. - [ ] The answer is written **in your own words** (no copy‑pasting from the internet).
# 3️⃣ Keep only unique words distinct_words = words.distinct() spark 2 workbook answers
Add a short paragraph for each stage, explaining why you chose that API. - [ ] All code compiles/run on Spark 2
– bulk HTTP calls: