Sunday, March 3, 2024

Matrices and Vectors: The Mathematical Power Behind Machine Learning

With the advent of the era of big data, machine learning has become the core technology in many fields. Behind this, the two mathematical tools of matrix and vector play a crucial role. This article aims to explore the widespread use of matrices and vectors in machine learning and the mathematical principles behind them, so as to reveal how these two provide powerful support for machine learning.

I. Basic Concepts of Matrices and Vectors

matrix: A matrix is a two-dimensional array, usually consisting of the intersection of rows and columns. In machine learning, matrices are often used to represent datasets, weights, biases, etc.

vector: A vector is a one-dimensional array that can be viewed as a matrix with only one column. Vectors are often used to represent features, weights, etc., of data.


II. Application of Matrices and Vectors in Machine Learning

Data representation and processing: In machine learning, data is usually stored and processed in the form of matrices. Each row represents a sample and each column represents a feature. This representation facilitates various mathematical operations and model training.

Linear algebra operation: operations between matrices and vectors, such as matrix multiplication,Matrix transposeMatrix inversion, etc., provide a powerful mathematical tool for machine learning. For example, in theLogistic regressionThe predicted value can be calculated efficiently by matrix multiplication; In support vector machine, matrix inversion plays a key role.

Optimization algorithmMany machine learning algorithms involve optimization problems, such as gradient descent, least square method, etc. The core of these algorithms is based on matrix and vector operations, so as to find the optimal model parameters.

Feature Dimensionality Reduction and Selection: by matrix decomposition,Principle component analysisAnd other methods can reduce the dimension of high-dimensional data and extract the most important features. This not only reduces the amount of computation, but also improves the generalization ability of the model.

Neural Network: In deep learning, the core of a neural network is a weight matrix and a bias vector. By constantly adjusting these matrices and vectors, neural networks can learn complex patterns in the data, enabling highly accurate prediction and classification.


III. The Mathematical Principles Behind Matrices and Vectors

Fundamentals of linear algebra: The operation of matrices and vectors is based on the fundamentals of linear algebra, such as the system of linear equations, vector space, linear transformation, etc. These principles provide a solid mathematical foundation for machine learning.

Probability theory and statistics: Many machine learning algorithms involve probability theory and statistics, such as Gaussian distribution,Covariance matrixEtc. These concepts are closely related to matrices and vectors and together constitute the theoretical system of machine learning.

Matrices and vectors, as mathematical tools, play a pivotal role in machine learning. They not only facilitate the representation and processing of data, but also provide powerful support for the implementation of various machine learning algorithms. With the continuous development of machine learning technology, the application of matrix and vector will be more extensive and in-depth. In the future, we can expect more mathematical tools to be combined with machine learning to drive greater breakthroughs and development in this field.

Thursday, February 29, 2024

Saturday, December 9, 2023

Variable Numbering Techniques for Traffic Engineering Research

 The variable numbering techniques include binary, categorical, and ordinal variables. Which is different sometimes. Here is an example of variable numbering example.








Reference

1. Investigating the impact of influential factors on crash types for autonomous vehicles at intersections

2. Kim, Eui-Jin, Min-Ji Kang, and Shin Hyoung Park. "Deep survival analysis model for incident clearance time prediction." Journal of Intelligent Transportation Systems (2024): 1-14.

Wednesday, December 6, 2023

Programming and Data Science Fundamental Free Courses in Online by Stanford University

 ðŸ“£ðŸ“£ðŸ“£ Stanford University is offering online courses (no application or tuition fees required).


Here are 10 courses you don't want to miss. Note, some of the courses are FREE but others are also PAID courses to get certification.

1. Computer Science 101
A self-paced course that teaches the essential ideas of Computer Science for a zero-prior-experience audience.

🔴 Enroll here: https://lnkd.in/dBbz5Keh

2. Machine Learning Specialization
A beginner-friendly program that teaches you the fundamentals of Machine Learning.

🟤 Enroll here: https://lnkd.in/dFiQ5xrH

3. Algorithms: Design and Analysis
Learn several fundamental principles of algorithm design including the divide-and-conquer design paradigm.

⚪️ Enroll here: https://lnkd.in/eKsafebr

4. Databases: Advanced Topics in SQL
This course is broad and practical, covering indexes, transactions, and constraints, all in the context of relational database systems and the SQL language.

🟣 Enroll here: https://lnkd.in/gs2kHyB3

5. R Programming Fundamentals
Explore the basics of R: a free programming language and software environment used for statistical computing and graphics.

🔵 Enroll here: https://lnkd.in/dPzsu-rm

6. Statistical Learning
This is an introductory-level course in supervised learning, with a focus on regression and classification methods.

🟠 Enroll here: https://lnkd.in/ehv4iMeP

7. Advanced Cybersecurity Program
An introduction to fundamental concepts in information security and practical applications in cybersecurity and executive strategy.

🟢 Enroll here: https://lnkd.in/gqpDYEBu

8. Introduction to Internet of Things
This introductory course provides a taste of what to expect from courses that are part of the IoT Graduate Certificate program.

🟡 Enroll here: https://lnkd.in/dkaiEUNU

9. Game Theory
The course will provide the basics: representing games and strategies, the extensive form, Bayesian games, repeated and stochastic games, and more.

🔷️ Enroll here: https://lnkd.in/df4bAyJu

10. Designing Your Career
A design thinking approach to anyone develop a constructive and effective approach to designing their vocation.

♦️Enroll here: https://lnkd.in/dYQNuGbQ

Sunday, October 29, 2023

A Tale of Simulation Software for Traffic Engineering




PTV Vissim:
PTV Vissim is a powerful commercial microscopic traffic simulation software. It is crucial for traffic engineers and urban planners to model and analyze complex traffic scenarios. Vissim allows users to simulate detailed interactions between vehicles and pedestrians at intersections and networks. Its importance lies in its ability to optimize traffic flow, signal timings, and design of transportation systems, aiding in traffic management and infrastructure planning.
Programming Language: Vissim uses a proprietary scripting language called COM-interface and can be extended with the Python programming language.
Perspective: Vissim is used for microscopic traffic simulation. It's valuable for analyzing detailed vehicle and pedestrian interactions at the intersection and network levels.

SUMO (Simulation of Urban Mobility):
SUMO is an open-source microscopic traffic simulator used for modeling and analyzing large-scale traffic networks. It is essential for researchers, city planners, and traffic management agencies. SUMO helps in assessing traffic control strategies, testing autonomous vehicle algorithms, and understanding the impact of transportation policies, making it a valuable tool for urban mobility studies.
Programming Language: SUMO is primarily developed using C++ and offers Python and TraCI (Traffic Control Interface) for scripting and interfacing.
Perspective: SUMO is an open-source microscopic traffic simulator designed for simulating large-scale traffic scenarios. It's suitable for research, traffic management, and mobility studies.

MATSim (Multi-Agent Transport Simulation):
SUMO is an open-source microscopic traffic simulator used for modeling and analyzing large-scale traffic networks. It is essential for researchers, city planners, and traffic management agencies. SUMO helps in assessing traffic control strategies, testing autonomous vehicle algorithms, and understanding the impact of transportation policies, making it a valuable tool for urban mobility studies.
Programming Language: MATSim is written in Java and can be extended using Java programming.
Perspective: MATSim is used for simulating multi-agent traffic and transportation systems. It's useful for modeling individual travelers' decision-making in a transportation network.

Matlab/Simulink:
Matlab and Simulink provide a versatile environment for various domains, including traffic control and transportation. Their importance lies in their broad applicability. They are used for modeling and simulating control systems, developing traffic signal timing, and evaluating various dynamic systems within the context of traffic simulation.
Programming Language: Matlab/Simulink uses a proprietary scripting language that is a combination of MATLAB and Simulink's graphical modeling interface.
Perspective: Matlab/Simulink can be used for a wide range of purposes, including traffic control, control systems design, and modeling various dynamic systems.

PreScan:
Matlab and Simulink provide a versatile environment for various domains, including traffic control and transportation. Their importance lies in their broad applicability. They are used for modeling and simulating control systems, developing traffic signal timing, and evaluating various dynamic systems within the context of traffic simulation.
Programming Language: PreScan is a simulation platform used for the development and testing of Advanced Driver Assistance Systems (ADAS) and Autonomous Vehicles. It often uses a combination of C/C++, Python, and Simulink for developing and integrating models.
Perspective: PreScan is primarily focused on the automotive industry and is used for simulating and testing autonomous driving systems.

Aimsun:
Aimsun is a commercial software for microscopic and mesoscopic traffic modeling and simulation. It is crucial for traffic engineers and urban planners to design and analyze transportation systems, optimize traffic flow, and assess the impact of urban developments and policies on traffic and mobility.
Programming Language: Aimsun provides a Python API for extending its capabilities.
Perspective: Aimsun is used for microscopic and mesoscopic traffic modeling and simulation. It's suitable for traffic engineering, urban planning, and mobility management.

Carla:
Carla is a popular open-source simulator used for autonomous driving research. Its importance is in its ability to create diverse and realistic traffic scenarios, allowing researchers to test and develop autonomous vehicle algorithms and systems in a safe and controlled virtual environment.
Programming Language: Carla uses C++ and Python for its scripting and API extensions.
Perspective: Carla is a popular open-source simulator for autonomous driving research. It's used to simulate various traffic scenarios and test autonomous vehicle algorithms.

Limitations of these softwares:
PTV Vissim: Proprietary and can be expensive. Limited open-source extensibility.
SUMO: Open-source but may have a steeper learning curve for beginners.
MATSim: Requires good knowledge of Java programming for customization.
Matlab/Simulink: Commercial software, and it may not be as specialized for traffic simulation compared to dedicated traffic simulation tools.
PreScan: Focused on the automotive industry, may not be as suitable for broader traffic modeling.
Aimsun: Commercial software and may be costly. Limited open-source extensibility.
Carla: Mainly for autonomous driving research, not as suitable for traditional traffic engineering.

Reference:
1. Bakhtiari, Arsham, Ashraf Uz Zaman Patwary, and Francesco Ciari. "Electric Vehicle Charging Pricing Design for Agent-Based Traffic Microsimulation." Procedia Computer Science 220 (2023): 755-762.
2. Ma, Wanjing, et al. "Active traffic management strategies for expressways based on crash risk prediction of moving vehicle groups." Accident Analysis & Prevention 163 (2021): 106421.






Monday, October 16, 2023

The Key to an Impactful Research Paper's Abstract

Abstracts play a pivotal role in academic research, acting as the gateway to your study and offering readers a glimpse into the heart of your work. Although abstracts may vary in length and complexity, they generally consist of four essential components, each serving a distinct purpose in summarizing your research. In this blog, we will delve into the four key parts of an abstract and discuss how to craft each section effectively.



Part 1: Rationale, Aims, and Objectives

The first part of your abstract sets the stage for your research. It introduces readers to the background or rationale behind your study, highlighting the problem or question you aim to address. Your objective is to establish context and show why your research is significant. Consider including the following elements:

The Research Problem: Clearly state the issue or question you are investigating. What motivated your research in the first place? Define the problem concisely.

Relevance: Explain why your research matters. What gap in knowledge does it fill? How does it contribute to the field?

Aims and Objectives: Present the goals of your study. What do you intend to achieve? What are the specific objectives you've set out to accomplish?


Part 2: Methodology

This section of the abstract outlines your research methods and design. It gives readers a sense of how you conducted your study, allowing them to evaluate the reliability of your findings. Key elements to include are:

Research Approach: Specify your research methodology. Is your study quantitative, qualitative, or mixed methods?

Data Collection: Describe the data sources and collection methods you employed. This might involve surveys, interviews, experiments, or analysis of existing data.

Data Analysis: Briefly explain how you analyzed the data. Did you use statistical tests, content analysis, thematic coding, or other techniques?


Part 3: Results

Here, you summarize the key findings of your research. This section provides readers with a glimpse of what you've uncovered, which can pique their interest and encourage them to read the full paper. Make sure to include:

Main Findings: Highlight the most important results of your study. What did you discover, and how does it relate to your research objectives?

Significance: Explain the implications of your findings. What do they mean for the broader field or for practical applications?

Data and Statistics: If applicable, include specific data points or statistical results. This can help to quantify your findings.


Part 4: Conclusion

In the final part of your abstract, you wrap up your summary by emphasizing the significance of your research and suggesting potential next steps. This section is crucial in leaving a lasting impression on your readers. Consider including:

Summary of the Study: Reiterate the primary findings and their importance.

Broader Implications: Discuss the broader implications of your research. How does it contribute to the field, and what future research questions might it raise?

Recommendations: If appropriate, provide recommendations or actions that could result from your research.


A well-crafted abstract can make or break your research paper. By following this structure and offering a clear, concise summary of your work, you can capture the attention of your target audience and entice them to delve deeper into your research. So, the next time you sit down to write an abstract, keep these four essential components in mind to ensure that your research gets the recognition it deserves.


Example:



Reference:

1. Linden, A. (2018). A matching framework to improve causal inference in interrupted time‐series analysis. Journal of Evaluation in Clinical Practice24(2), 408-415.

Sunday, October 8, 2023

Graphical Abstract vs Traditional Abstract

In the realm of scientific literature, the means of conveying research findings have been continuously evolving. Two prominent methods of summarizing scientific research are Graphical Abstracts and Traditional Abstracts. This article explores the characteristics, advantages, and limitations of both approaches to help researchers and readers make informed choices in presenting and accessing scientific information.





Traditional Abstracts:

Traditional abstracts are the long-standing convention for summarizing research papers. They typically consist of a concise text-only paragraph, ranging from 150 to 250 words. The primary purpose of a traditional abstract is to provide a brief overview of the study's key components, such as the research question, methodology, results, and conclusions.


Advantages of Traditional Abstracts:

Concise Information: Traditional abstracts condense essential information into a compact format, allowing readers to quickly grasp the study's main points.

Compatibility: Traditional abstracts are universally accepted across various academic disciplines and publication platforms.

Search Engine Optimization: Traditional abstracts are search engine-friendly, making it easier for researchers to discover relevant studies online.


Limitations of Traditional Abstracts:

Limited Visual Elements: Traditional abstracts rely solely on text, which may not effectively convey complex data or visual information.

Potential for Information Overload: Striking a balance between brevity and completeness can be challenging, leading to either overly vague or excessively detailed abstracts.



Graphical Abstracts:

Graphical abstracts are a more recent innovation, gaining popularity in scientific journals. These visual representations condense the essence of a research paper into a single, visually appealing image. They incorporate icons, images, graphs, and concise text to convey the study's main findings and significance.


Advantages of Graphical Abstracts:

Visual Clarity: Graphical abstracts use visuals to present complex information intuitively, enhancing reader comprehension.

Eye-catching: The visual appeal of graphical abstracts can attract readers and make research more accessible to a broader audience.

Memorable: Visual elements tend to be more memorable than text alone, making the research more likely to stick in the reader's mind.


Limitations of Graphical Abstracts:

Discipline-specific: The effectiveness of graphical abstracts may vary depending on the field of study, as some topics are challenging to represent visually.

Limited Information: Due to space constraints, graphical abstracts may not provide as much detail as traditional abstracts.


In conclusion, the choice between Graphical Abstracts and Traditional Abstracts depends on the nature of the research and the intended audience. Traditional abstracts remain a robust and widely accepted means of summarizing research, while graphical abstracts offer a visually engaging alternative. Researchers should carefully consider which format best suits their work to effectively communicate their findings in the ever-evolving landscape of science communication.


Reference:

1. Spatiotemporal grid-based crash prediction—application of a transparent deep hybrid modeling framework

2. Exploring the travel behavior changes caused by the COVID-19 crisis: A case study for a developing country

3. Reaction characteristics of homogeneous and heterogenous reactions for  glucose gasification in supercritical water using ruthenium catalyst  supported on carbon nanotube 






Traffic Signal Optimization Techniques

Traffic signal optimization is the process of determining the most efficient timing and sequencing of traffic signals at intersections to improve overall traffic flow and reduce congestion. This involves analyzing traffic patterns, volumes, and other relevant data to develop optimal signal timings that prioritize the movement of vehicles, pedestrians, and cyclists.


Some techniques used in traffic signal optimization include:

1. Traffic Signal Coordination: This involves synchronizing the timings of adjacent traffic signals along a corridor to create a "green wave" that allows vehicles to travel without stopping at each intersection. This can significantly reduce delays and reduce travel times.


2. Adaptive Traffic Control Systems: These systems use real-time data from sensors and cameras to adjust signal timings dynamically based on current traffic conditions. They can automatically adapt to changes in demand throughout the day, improving efficiency and reducing congestion.


3. Queue Length Monitoring: By monitoring queue lengths at intersections, signal timings can be adjusted to minimize backups and prevent gridlock. This may involve extending green times for heavily congested approaches or shortening them for less busy ones.


4. Pedestrian and Cyclist Considerations: Optimizing traffic signals should also account for pedestrian and cyclist movements by providing appropriate crossing times or dedicated phases when needed. This helps enhance safety and encourages active transportation modes.




Saturday, October 7, 2023

A Tale of Traffic Conflict



Rear-end Conflict: A rear-end conflict refers to a situation where there is a risk of one vehicle colliding with the rear of another vehicle. This typically occurs when the following vehicle is moving at a higher speed than the leading vehicle1. Rear-end conflicts are often observed at signalized intersections and can be influenced by various factors such as signal timing, distance from stop lines, and driver behavior.

For example, at signalized intersections, the majority of conflicts happen in the first half of red and green time, with more severe conflicts occurring at the beginning of red time. Factors such as lateral movement of different vehicle types, dilemma behavior (uncertainty about whether to stop or proceed during a yellow light), and the stop-and-go movement of vehicles due to signal changes can contribute to rear-end conflicts.

In freeway work zones, traditional traffic conflict measures like Time to Collision (TTC) are used to identify rear-end conflicts. However, TTC may not be able to identify potential conflicts in the process of car-following due to the special road section environment of the freeway work zone. To address these limitations, an improved measure called Time to Collision in the Work Zone (WTTC) is proposed based on TTC1.

For instance, misleading signage or short sight distance could lead to a large number of rear-end crashes. But it could also be related to driver behavior. Drivers may be distracted by a mixture of signs, billboards, etc., outside the vehicle or multitasking inside the vehicle (e.g., talking on cell phones, eating and drinking, etc.).


Sideswipe Conflict: Sideswipe conflict refers to a situation where the sides of two vehicles come into contact with each other while traveling in the same or opposite directions. This typically occurs when vehicles are changing lanes or merging into traffic.

For example, consider a scenario on a highway where two lanes merge into one. If a vehicle in the left lane tries to merge into the right lane at the same time as a vehicle in the right lane tries to merge into the left lane, and neither vehicle yields to the other, this could result in a sideswipe conflict.

Moreover, surrogate safety measures (SSM) as indicators of accidents are useful tools in safety evaluations. A new framework is proposed to calculate the risk of sideswipe collisions at each instant based on SSM. For this purpose, time-to-collision (TTC) and post-encroachment time (PET), as the most important time-based indicators would be applied and a new method would be presented to calculate these indicators.


Angle Conflict: An angle conflict refers to a situation where two vehicles approach each other at an angle, creating a risk of collision. This typically occurs at intersections where vehicles are moving in different directions.

For example, consider a four-way intersection where one vehicle is trying to make a left turn while another vehicle is approaching from the opposite direction intending to go straight. If both vehicles enter the intersection at the same time, this could result in an angle conflict.

Another example could be found in roundabouts or rotary ways. Vehicles entering and exiting the roundabout can create angle conflicts as they cross paths at an angle. The weaving traffic movements in such scenarios can lead to potential conflict points.


History of Speed-Density Diagram

 Unlocking the Past: Greenshields' Traffic Theory in the 1930s

In the world of transportation and traffic engineering, understanding the fundamental relationship between speed and traffic density has always been a pivotal concern. The historical speed-density diagram, along with the groundbreaking experiment conducted by Bruce D. Greenshields in the 1930s, encapsulates this intricate connection. This diagram remains a cornerstone in the annals of traffic theory and continues to shape modern transportation planning.

The historical speed-density diagram provides a snapshot of traffic theory as it stood in the 1930s. It portrays the relationship between two critical variables: traffic speed and traffic density. In essence, this diagram illustrates how traffic flow changes as the number of vehicles on a road segment increases. Such fundamental insights have played a vital role in shaping our understanding of traffic dynamics and highway design.

Bruce D. Greenshields, a pioneer in traffic engineering, conducted the experiment that accompanies this diagram. His research, "A Study of Traffic Capacity," was published in the Proceedings of the Highway Research Board in 1935. This seminal work laid the groundwork for modern traffic theory and capacity analysis.

The experiment aimed to determine the relationship between traffic density and traffic speed under real-world conditions. Greenshields meticulously collected data on a four-kilometer stretch of road in the inner ring of Shanghai, a bustling city even in the 1930s. The road featured a variety of challenges, including a large-radius curve and six ramps, making it an ideal testbed for his study.

Greenshields' findings, as illustrated in the speed-density diagram, revolutionized traffic theory. He discovered that as traffic density increased, traffic speed decreased in a predictable and consistent manner. This phenomenon, known as the speed-density relationship, is still a fundamental concept in traffic engineering and highway design today.

The implications of Greenshields' work are profound. By establishing a quantitative link between traffic speed and density, his research provided a basis for designing roadways with optimal traffic flow. It has since become a cornerstone of traffic management strategies, influencing everything from signal timing at intersections to the design of highways and expressways.

Greenshields' experiment and the resulting speed-density diagram have stood the test of time. They have become foundational components of traffic theory, guiding the development of transportation infrastructure worldwide. In an era where traffic congestion is a global concern, Greenshields' work continues to provide valuable insights into how we can improve traffic flow, reduce congestion, and enhance road safety.

Moreover, the legacy of Bruce D. Greenshields extends beyond the world of traffic engineering. His dedication to empirical research and meticulous data collection serves as an enduring example for future generations of researchers and engineers. His work exemplifies the importance of rigorous experimentation in solving complex real-world problems.

In conclusion, the historical speed-density diagram and Bruce D. Greenshields' experiment from the 1930s offer a window into the historical roots of traffic theory and capacity analysis. They remind us of the enduring significance of understanding the relationship between speed and density in the context of traffic flow. As we continue to grapple with the challenges of modern urban transportation, Greenshields' pioneering work remains a guiding beacon, illuminating the path toward more efficient, safe, and sustainable traffic management.


Reference:

1. Traffic Flow Dynamics- Martin Treiber, Arne Kesting











Monday, October 2, 2023

How to read any Chinese document in English

Research Paper / Report Downaldoad Website

CNKY: https://www.cnki.net/index/

Step 1: log in in the system using your chinese univeristy id and password

Note: if you do no have univerity profile you can use credit card to buy



Read the Document in English

Software name: ZhiyunTrans Document Translation Reader (eng to cn and cn to eng)

Download link: https://amy.lanzouh.com/iaT76196l2vg

Step 1: Download and Install

Step 2: log in by scanning QR code of the soft

Finished