Last Updated | 2025S1 |
Unit Name | Big Data Analytics in Electricity Grids |
Unit Code | BEE309 |
Unit Duration | 1 Semester |
Award |
Bachelor of Science (Engineering) Duration 3 years |
Year Level | Three |
Unit Coordinator BEE Course Coordinator |
Saeideh Sekhavat Dr Hossein Tafti |
Core/Sub-discipline | Sub-discipline |
Pre/Co-requisites | None |
Credit Points |
3 Total Course Credit Points 81 (27 x 3) |
Mode of Delivery | Online or on-campus. |
Unit Workload | (Total student workload including “contact hours” = 10 hours per week) Pre-recordings / Lecture – 1.5 hours Tutorial – 1.5 hours Guided labs / Group work / Assessments – 2 hours Personal Study recommended – 5 hours |
Unit Description and General Aims
The objective of this unit is to impart to students a detailed knowledge of the use of machine learning and data analytics in different applications related to the electricity grid. The unit provides a mathematical background and a description of the steps required to build and evaluate a machine learning system. Information presented in this unit also includes: an introduction to different algorithms used in machine learning, a description of typical applications, an overview of software tools commonly used in machine learning and different case studies. Students will complete a project covering the design of a machine learning system to solve a realistic problem related to the electricity grid.
Learning Outcomes
On successful completion of this Unit, students are expected to be able to:
- Explain the need/importance of machine learning and its applications to data analytics in electricity grid.
Bloom's Level 2 - Apply the basic mathematical concepts used in machine learning.
Bloom's Level 3 - Evaluate and compare different machine learning systems according to an application.
Bloom's Level 5 - Apply machine learning algorithms to solve supervised and unsupervised learning problems using a variety of tools.
Bloom's Level 3 - Design machine learning solutions to efficiently manage various electricity grid data.
Bloom's Level 6
Student Assessment
Assessment Type | When assessed | Weighting (% of total unit marks) | Learning Outcomes Assessed |
Assessment 1 Type: Weekly quizzes Description: Students will need to complete multiple-choice quiz questions to demonstrate a good understanding of the fundamental concepts. |
Weekly | 10% | All (Topics 2 to 11) |
Assessment 2 Type: Test (Invigilated) Description: Students will need to answer some short and/or long answer questions and/or solve some numerical problems. |
During Topic/Week 7 | 20% | 1, 2, 3 (Topics 1 to 6) |
Assessment 3 Type: Practical (Report) & Pre-recorded Presentation Description: Students will need to complete a practical project using software. It requires research on the state-of-art solutions in the field. |
End of Topic/Week 10 | 30% | 1, 2, 3, 4 (Topics 1 to 10) |
Assessment 4 Type: Exam (Invigilated) Description: An examination with a mix of theoretical short/detailed answer questions and some engineering problems. |
Exam Week | 40% | All (All topics) |
Overall Requirements: Students must achieve a result of 50% or above in the exam itself to pass the exam and must pass the exam to be able to pass the unit. An overall final unit score of 50% or above must be achieved to pass the unit once all assessment, including the exam, has been completed.
Prescribed and Recommended Readings
Required textbook(s)
- Rodolfo Bonnin. (2017). Machine Learning for Developers. Packt Publishing. ISBN: 978-1786469878
Available on Knovel - J. P. Han, H. Jian Tong, Data Mining Concepts and Techniques, 4th Edition. Elsevier, 2023
Reference Materials
- References from authentic websites on the Internet:
- Weka Wiki Documentation
- R Introduction Manual
- IEEE Transaction on Smart Grids journals
Unit Content
Topic 1
Introduction to Data Analytics and Data in Electricity Grids
- Basics of data analytics
- Machine Learning vs Artificial Intelligence
- Supervised, Unsupervised, Reinforcement Learning
- Types of data in electricity grids: electrical and non-electrical data
- Introduction of data analytics in electricity grids: event analytics, state analytics, customer analytics and operational analytics
Topic 2
Python in Data Analytics
- Essential libraries of Python for big data analytics: Pandas, Numpy, Matplotlib, Scikit-learn, Statsmodel, Tensorflow
- Examples of using the essential libraries to analyse electricity grids data
Topic 3
Data Flow and Feature Engineering
- Data sources in electricity grids
- Data preprocessing
- Data visualization
- Features and feature vectors
- Dimensionality reduction
- Data mining
- Application of feature engineering to big datasets in electricity grids
Topic 4
Commonly Used Supervised Learning Algorithms
- K-Nearest Neighbors
- Decision trees
- Linear regression
- Naïve Bayes
- Implementation of classification algorithms in electricity grids
- Implementation of regression algorithms in electricity grids
Topic 5
Unsupervised Machine Learning and Reinforcement Learning Algorithms
- Clustering: K-means, DBSCAN, Hierarchal clustering
- Association: Apriori and other algorithms
- Learning models of reinforcement: Markov decision process and Q learning
- Implementation of clustering algorithms using electricity grids data
- Implementation of association rules using electricity grids data
- Example of reinforcement learning in smart grids, e.g., energy trading
Topic 6
Other Algorithms (III)
- Genetic algorithms and other optimization algorithms
- Artificial Neural Networks:
- Feedforward Neural Networks
- Convolutional Neural Networks
- Recurrent Neural Networks
- Artificial neural networks in electricity demand forecasting
Topic 7
Introduction of other Tools in Data Analytics
- R
- MATLAB
- WEKA
- Cloud-based Solutions
- Examples of using R and other tools in analyzing electricity grids data
Topic 8
Event Analytics in Smart Grid s
- Fault and failure diagnosis
- Fraud detection
- Predictive outage management
- Statistical Process Control for event/anomaly detection
Topic 9
State Analytics in Smart Grids
- Equipment health and condition monitoring
- System and state identification
- Predictive control
- Stability enhancement
- Asset management
Topic 10
Customer Analytics in Smart Grids
- Demand response
- Consumer modelling and segmentation
- Customer behavior
- Peer-to-peer energy trading
Topic 11
Operational Analytics in Smart Grids
- Generation and load forecasting
- Real-time energy management
- Price forecasting
- Economic load dispatch
- Network constraint forecasting & dynamic network operating envelopes
- Risk analysis
Topic 12
Unit Review
In the final week, students will have an opportunity to review the contents covered so far. Opportunity will be provided for a review of student work and to clarify any outstanding issues. Instructors/facilitators may choose to cover a specialized topic if applicable to that cohort.
Software/Hardware Used
Software
- WEKA: It is an open-source software suite for machine learning and data mining, offering tools for data preprocessing, classification, regression, clustering, and visualization. It provides students with hands-on experience in applying machine learning algorithms and analyzing data, essential for data-driven decision-making in various fields.
- R: It is a statistical programming language designed for data analysis, statistical computing, and graphical representation. It is extensively used in industries for data mining, predictive modeling, and advanced statistical analysis. By studying R, students gain a strong foundation in statistical modeling and data visualization, which are essential for data-driven decision-making across various industry sectors.
- MATLAB: It is a high-level programming language and computing environment used for numerical computation, data visualization, and mathematical modeling. It is widely applied in industries such as engineering, finance, and data science for solving complex problems, performing simulations, and developing algorithms. For students, it helps build practical skills to analyze data efficiently, model real-world scenarios, and solve engineering problems.
- Python: It is a high-level, open-source programming language widely used for data analysis, machine learning, scientific computing, and automation. By learning Python, students gain practical skills in developing machine learning models, automating data workflows, and supporting data-driven decision-making in real-world applications across engineering, data science, artificial intelligence, and beyond.
Hardware
- N/A
Unit Changes Based on Student Feedback
- The assessment structure is updated based on the action items of the Learning and Teaching Committee (Dec 2024)