PySpark FileAlreadyExistsException: Unable to overwrite output directory during saveAsTextFile

I am working on a PySpark script to perform a simple word count. My script runs fine, but I encounter an error when trying to save the results using saveAsTextFile (Now I'm on ubuntu). Here's the error I get:

py4j.protocol.Py4JJavaError: An error occurred while calling o48.saveAsTextFile. org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory file:/home/pyspark_python/wordcount/output_new already exists

Here are the steps I have taken so far:

Verified that the output directory does not contain any data (ls shows it is empty).Deleted and recreated the directory using rm -r and mkdir -p.Ensured no other Spark jobs are running (ps aux | grep spark).

Despite this, the error persists when I re-run the script.

Here is the code I am using :

from pyspark import SparkConf, SparkContextimport osdef main(input_file, output_dir):    # Configuration Spark    conf = SparkConf().setAppName("WordCountTask").setMaster("local[*]")    sc = SparkContext(conf=conf)    # Lecture du fichier d'entrée    text_file = sc.textFile(input_file)    # Comptage des mots    counts = (        text_file.flatMap(lambda line: line.split(" "))                 .map(lambda word: (word, 1))                 .reduceByKey(lambda a, b: a + b)    )    # Sauvegarde des résultats    if not os.path.exists(output_dir):        os.makedirs(output_dir)    counts.saveAsTextFile(output_dir)    print(f"Résultats sauvegardés dans le répertoire : {output_dir}")if __name__ == "__main__":    # Définir les chemins d'entrée et de sortie    input_file = r"/home/othniel/pyspark_python/wordcount/input/loremipsum.txt"    output_dir = "/home/othniel/pyspark_python/wordcount/output_new"    # Exécution de la tâche WordCount    main(input_file, output_dir)

How can I resolve this error and ensure PySpark successfully writes to the output directory ? Is there something specific I need to configure in my script or environment ?

Thank you for your help!

PySpark FileAlreadyExistsException: Unable to overwrite output directory during saveAsTextFile

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...