American Journal of Geographic Information System
p-ISSN: 2163-1131 e-ISSN: 2163-114X
2014; 3(2): 63-74
doi:10.5923/j.ajgis.20140302.01
Xun Li
School of Geographical Sciences and Urban Planning, Arizona State University, Tempe, Arizona, USA
Correspondence to: Xun Li, School of Geographical Sciences and Urban Planning, Arizona State University, Tempe, Arizona, USA.
| Email: | ![]() |
Copyright © 2014 Scientific & Academic Publishing. All Rights Reserved.
The application of trajectory classification to automatically detect movement types of unknown trajectories has been receiving increasing research attention in areas such as video surveillance, traffic management and location-based services. Existing research applies classic geometric shape-based classification approaches to classify trajectories by utilizing the geometric characteristics of movement to fulfill this task. However, this approach is limited to the geographic context of trajectory data. Classification methods based on movement parameters can overcome this problem but the accuracy of classification depends heavily on selecting appropriate movement features from trajectories. This research proposes an efficient trajectory classification model based on two types of complexity measures as new features for classifying movements: (1) the geometric complexity measures of trajectories based on Fractal Dimensions, and (2) structural complexity measures of movement parameters based on Approximate Entropy (ApEn). We suggest that ApEn, which provides complexity information about the subtle changes that occur in the structure of sequential movement parameters of trajectories, and Fractal Dimensions, which provide the overall description of geometric complexity, can be used together to improve the accuracy in trajectory classification. The feasibility of this proposed classification model is tested with 800 GPS trajectories that were shared and manually tagged with four movement types by Internet users on the website Openstreemap.org. The overall 85.4% average accuracy of prediction demonstrates the applicability of this classification model. By improving the quality of trajectory classification, the proposed approach in this research will benefit many applications of trajectory data analysis and mining.
Keywords: Trajectory classification, Complexity measure, Fractal dimension, Approximate entropy
Cite this paper: Xun Li, Using Complexity Measures of Movement for Automatically Detecting Movement Types of Unknown GPS Trajectories, American Journal of Geographic Information System, Vol. 3 No. 2, 2014, pp. 63-74. doi: 10.5923/j.ajgis.20140302.01.
![]() | Figure 1. 4 different randomly selected GPS trajectories (car, bike, run, walk) in 2D (upper) and in 3D (lower, with vertical axis representing time) |
![]() | (1) |
pairs, and the FD value is then calculated as (1 +α). One problem that may impact the precision of the FD value is a possible underestimation or truncation of the path length through different measuring scales. For the purpose of improving precision, Nams (2005) proposed a FMean method that computes an FD value twice by starting to measure total distance from two ends of a trajectory whereby the mean FD value is used as FMean. In this research, FMean will be used as a movement feature that describes the geometric complexity of trajectories.![]() | Figure 2. Plots of velocity against time for: car, bike, run, and walk (from top to bottom) of 4 randomly selected trajectories with 4 different movement types |
, which has N continuous observations, this research denotes a subsequence of m observations at location
, is a pattern
. If the difference between two patterns
and
is less than a predefined criterion r, we can conclude that these two patterns are similar. The approximate entropy value
can be computed with the following equation1:![]() | (2) |
is the prevalence of repetitive patterns of length m in
, which can be computed as:![]() | (3) |
is the frequency count of patterns in
that are similar to
. For a fixed number of N observations, large m will generate fewer patterns to measure the ApEn value than small m. As noted by the author of ApEn in (Pincus 1991), a small m (especially m=2) can distinguish a wide variety of systems, such as deterministic systems, chaotic system stochastic and mixed system, with relatively fewer points. For similarity criterion r, smaller r usually leads poor conditional probability with more similar patterns been identified, while larger r usually ignore detailed system information with less patterns been detected. As suggested by author in (Pincus 1991), choices of r ranging from 0.1 to 0.2 standard deviation of the sequence data
can avoid a significant contribution from noise in an ApEn calculation.
are attributed spatial and temporal information, e.g.
. Based on this, a trajectory can be represented as follows:![]() | (4) |
and
. The total cost in time is
and the approximate total length of the trajectory is ![]() | (5) |
, this trajectory has a fixed sampling interval. Usually, real-world trajectory data may not have been recorded at the same sampling rate and there may be some noise, such as incorrect locations that were recorded when location-aware devices (e.g. GPS) lost signals or were impacted by ionospheric and tropospheric errors in the trajectory data (Hoffmann-Wellenhof et al. 2001). The different sampling rate should be standardized for generating comparable Fractal Dimension and ApEn values. First, all trajectories that were recorded using latitude and longitude are simply transformed to a planar coordinate system with meters as the unit. To preprocess trajectories to contain the same fixed sampling interval, a linear interpolation approach is applied to resample trajectories at a fixed time interval. To check the noise in trajectory data, a simple rule that “moving velocity at each original point on the trajectory should be less than a predefined maximum velocity” is applied in the resample stage. The noise point will be simply removed once detected before resampling. If more than 10 noise points are detected, this trajectory will be ignored.
along trajectory
with total n sampling points, these movement parameters can be calculated as follows: ![]() | (6) |
![]() | (7) |
![]() | (8) |
![]() | (9) |
![]() | Figure 3. Illustration of computing general movement parameters such as: moving speed, turning angle, displacement etc |
) following by the explanation in section 2.2. Then, the mean, standard deviation and skewness of ApEn values are adopted as movement features.Then, a traditional and efficient approach, PCA is employed to reduce the dimensions of feature space by using an orthogonal transformation to reduce a set of possible correlated features to a smaller set of values of uncorrelated synthetic features (Smith 2002).
|
|
|
![]() | Figure 4. Receiver operating characteristic (ROC) and area under ROC curves comparison of “one-against-rest” binary classification results of Bike, Walk, Run and Car in experiment 1 and 2 |
| ||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||
|
| [1] | Batty M 1985 Fractals-geometry between dimensions. New scientist, 105: 31-5. |
| [2] | Buchin K, Buchin M, Gudmundsson J, Löffler M and Luo J 2011 Detecting commuting patterns by clustering subtrajectories. International Journal of Computational Geometry and Applications 21: 253-282. |
| [3] | Chatfield C 2004 The Analysis of Time Series: An Introduction. Florida, USA, Chapman & Hall/CRC. |
| [4] | Dodge S, Weibel R and Forootan E 2009 Revealing the physics of movement: Comparing the similarity of movement characteristics of different types of moving objects. Computers, Environment and Urban Systems: 419-434. |
| [5] | Eagle Nand Pentland A 2006 Reality mining: sensing complex social systems. Personal and Ubiquitous Computing 10: 268. |
| [6] | Feldman D Pand Crutchfield J 1998 A survey of "Complexity Measures". WWW documenthttp://www.santafe.edu/~cmg/compmech/tutorials/ComplexityMeasures.pdf. |
| [7] | Fritz H, Said S and Weimerskirch H 2003 Scale–dependent hierarchical adjustments of movement patterns in a long–range foraging seabird. Proceedings of the Royal Society of London. Series B: Biological Sciences 270: 1143. |
| [8] | Froehlich J and Krumm J 2008 Route prediction from trip observations. Society of Automotive Engineers 2193: 53. |
| [9] | Fu Z, Hu W and Tan T 2005 Similarity based vehicle trajectory clustering and anomaly detection. In IEEE International Conference on Image Processing. Genoa, Italy: IEEE: II-602-5. |
| [10] | Giannotti F, Nanni M, Pinelli F and Pedreschi D 2007 Trajectory pattern mining. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Jose, California, USA: ACM: 330-339. |
| [11] | Giannotti F and Pedreschi D 2008 Mobility, Data Mining and Privacy: Geographic Knowledge Discovery. Berlin Heidelberg: Springer-Verlag. |
| [12] | González M, Hidalgo C and Barabási A 2008 Understanding individual human mobility patterns. Nature, 453: 779-782. |
| [13] | Hanley J Aand McNeil B J 1983 A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology, 148: 839. |
| [14] | Hoffmann-Wellenhof B, Lichtenegger H and Collins J 2001GPS: Theory and Practice. Wien: Springer. |
| [15] | Johansson A, Helbing D, Al-Abideen H and Al-Bosta S 2008 From crowd dynamics to crowd safety: A video-based analysis. Advances in Complex Systems 11: 497–527. |
| [16] | Lee J G, Han J, Li X and Gonzalez H 2008 TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering. In Proceedings of the 34th International Conference on Very Large Data Bases. Auckland, New Zealand, Endowment: 1081-1094. |
| [17] | Lin R and Shim H 1995 Fast similarity search in the presence of noise, scaling and translation in time-series databases. In Proceeding of the 21th International Conference on Very Large Data Bases. San Francisco, CA, USA, Morgan Kaufmann Publishers Inc.:490-501. |
| [18] | Makris D and Ellis T 2002 Spatial and probabilistic modelling of pedestrian behaviour. In Proceeding of British Machine Vision Conference. Cardiff, UK, BMVC: 557-566. |
| [19] | Mandelbrot B 1967 How long is the coast of Britain? Statistical self-similarity and fractional dimension. Science 156: 636. |
| [20] | Nams VO 1996 The VFractal: a new estimator for fractal dimension of animal movement paths. Landscape Ecology 11: 289-297. |
| [21] | Nams V O 2005 Using animal movement paths to measure response to spatial scale. Oecologia 143: 179-188. |
| [22] | Nara A and Torrens P M 2007 Spatial and temporal analysis of pedestrian egress behavior and efficiency. In Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information System. New York, NY, USA, ACM:1. |
| [23] | Nguyen N T, Phung D Q, Venkatesh S and Bui H 2005 Learning and detecting activities from movement trajectories using the hierarchical hidden Markov model. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, California, USA: IEEE: 955-960. |
| [24] | Niu W, Long J, Han D and Wang Y F 2004 Human activity detection and recognition for video surveillance. In Proceedings of the 2004 IEEE International Conference on Multimedia and Expo.Taipei, Taiwan, IEEE: 719-722. |
| [25] | Pincus S M 1991 Approximate entropy as a measure of system complexity. Proceedings of the National Academy of Sciences 88: 2297-2301. |
| [26] | Pincus S M 2008 Approximate entropy as an irregularity measure for financial data. Econometric Reviews 27, 4: 329-362. |
| [27] | Shannon C E 1948 A mathematical theory of communications, I and II. Bell System Technical Journal 27: 379-423 and 623-656. |
| [28] | Smith L I 2002 A tutorial on principal components analysis. WWW documenthttp://www.cs.otago.ac.nz/cosc453/student_tutorials/principal_components.pdf. |
| [29] | Torrens P 2008 Wi-fi geographies. Annals of the Association of American Geographers 98: 59-84. |
| [30] | Torrens P, Li X and Griffin W A 2011 Building agent-based walking models by machine-learning on diverse databases of space-time trajectory samples. Transactions in GIS 15: 67-94. |
| [31] | Yazji S,Dick R P, Scheuermann P and Trajcevski G 2011 Protecting private data on mobile systems based on spatio-temporal analysis. In Proceedings of the International Conference on Pervasive and Embdeded Computing and Communication Systems. Vilamoura, Algarve, Portugal, PECCS:114-123. |