Data Analysis on Price Change and Rank Correlation with Invalid Data Handling

Apr 21, 2025, 5:41 AM

amazon-beer-ranking-data.xlsx

Missing file

Visualize the correlation between the price_change and rank

The dataset has been successfully loaded. It contains columns such as scraped_date, rank, product_name, price, rating, url, asin, previous_rank, rank_change, new_entry, previous_price, price_change, price_change_rate, and unnamed:_13.
To visualize the correlation between price_change and rank, we first need to ensure that the price_change column is numeric. Let's convert it to a numeric type and then plot the correlation.
It seems that the price_change column contains non-numeric values such as 'NEW', which have been converted to NaN. This means we don't have valid numeric data in the price_change column to analyze the correlation with rank.
To proceed, we would need valid numeric data in the price_change column. If you have any other data or questions, feel free to ask!

I clean the data where NEW is replaced with "0". Please try again.

It seems there was an error related to exporting the image using the kaleido package, even though it was installed. This might be due to an environment issue where the package is not being recognized correctly.
To resolve this, you can try the following steps:
  1. Ensure that the kaleido package is correctly installed in the environment where the code is being executed.
  2. Restart the Jupyter Notebook kernel and try running the code again.
  3. If the issue persists, consider using an alternative method for exporting images or displaying them directly in the notebook without exporting.
Let's proceed by displaying the plot directly without attempting to export it.
Get started with Vizly