Load the dataset “RoadAccident-2021_Surrey.csv” and conduct an exploratory data analysis of the dataset to gain insights into its structure, content, and quality.
Assessment Information/Brief 2024-25
Data mining and text analytics
Exploring Road Traffic Accident Data and Text Analytics Insights
|
Module title |
Data mining and text analytics With application in SAS |
|
CRN |
MANM528 |
|
Level |
7 |
|
Assessment title |
Individual Assignment Exploring Road Traffic Accident Data and Text Analytics Insights |
|
Weighting within module |
100% This assessment is worth 100% of the overall module mark. |
|
Submission deadline date and time |
|
|
Module Leader/Assessment set by |
Module leader: |
|
How to submit |
Submit on SurreyLearn |
|
Assessment task details and instructions |
Module Overview: This module provides an in-depth introduction to the data mining process and its applications in the fields of business and management. Students will learn a range of techniques and tools for collecting, accessing, and analysing data. Special attention will be given to text mining and web analytics. Additionally, the module explores the practical use of data mining models in real-world scenarios. Assignment Description: For this assignment, you will work with a comprehensive dataset comprising real data collected from road traffic accidents in the UK. This dataset includes detailed information about personal injury road collisions in the Surrey area during the year 2021. To assist you in your analysis, a data dictionary file, "RoadAccident-2021-Guide.xlsx" is provided, offering in-depth definitions for all fields. Task 1 – Data Exploration and Cleaning [20 marks] The objective of this task is to enhance your skills in data exploration, visualization, summary statistics generation, and data cleaning. You will:
Write a comprehensive report for this task, including clear explanations of the steps taken. Task 2 – Predicting Accident Severity [30 marks] In this task, you will apply machine learning techniques to predict accident severity using the dataset. You should:
Write a comprehensive report for this task, including clear explanations of the modelling process and results. Task 3 – Text Analysis of Tweets [20 marks] For this task, you will work with a dataset containing text data collected from tweets related to road traffic accidents in the Surrey area. Your tasks include:
Write a comprehensive report for this task, including clear explanations of the text analysis process and results. Task 4 – Decision-Maker`s Summary and Recommendations [20 marks] Based on the results from the previous tasks, you will write a concise summary intended for decision-makers. This report should provide an explanation of the dataset, the insights gained, and offer recommendations or suggestions related to road traffic safety or public awareness. Ensure that the report is presented professionally, includes clear explanations, and incorporates visualizations to support your recommendations. Avoid technical jargon. General Assessment Criteria [10 marks] The overall layout, storytelling, professionalism, and Harvard Referencing will be assessed. Make sure your assignment adheres to appropriate formatting and citation standards. |
|
Knowledge and Understanding |
Assessed intended learning outcomes On successful completion of this assessment, you will be able to:
|
|
Practical, Professional or Subject Specific Skills |
The assessment strategy is designed to provide students with the opportunity to demonstrate: the ability to analysing a large batch of information to discern trends and patterns. |
|
Module Aims |
|
|
What to deliver / Word count (if applicable) |
You are required to submit:
|
|
Feedback arrangements |
Formative feedback is provided during the module; summative feedback will be provided for the assignment. |