Skip to content

Instantly share code, notes, and snippets.

@GKarmakar
Created December 19, 2018 03:55
Show Gist options
  • Select an option

  • Save GKarmakar/4c21e03985bed01a515655f83c38e001 to your computer and use it in GitHub Desktop.

Select an option

Save GKarmakar/4c21e03985bed01a515655f83c38e001 to your computer and use it in GitHub Desktop.
from pyspark.sql.functions import lit
from pyspark.ml.evaluation import MulticlassClassificationEvaluator
predictionsTrainingDF = preprocessedStageTraining3.withColumn("baseline_predicton", lit(0.0))
predictionsTestDF = preprocessedStageTest3.withColumn("baseline_predicton", lit(0.0))
def display_train_and_test_f1_score(name,
predictionsTrainingDF,
predictionsTestDF,
predictionsColumn="prediction",
display_train=True):
evaluator = MulticlassClassificationEvaluator(
labelCol="Churn",
predictionCol = predictionsColumn,
metricName='f1'
)
if display_train:
print("{} Training F-score: {}".format(name, evaluator.evaluate(predictionsTrainingDF)))
print("{} Test F-score: {}".format(name, evaluator.evaluate(predictionsTestDF)))
display_train_and_test_f1_score("Baseline", predictionsTrainingDF, predictionsTestDF, "baseline_predicton")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment