Emeritus Professor Peter Christen
PhD
Professor
ANU College of Engineering, Computing and Cybernetics
T:
n/a
Areas of expertise
- Pattern Recognition And Data Mining 080109
- Health Informatics 080702
- Information Retrieval And Web Search 080704
- Data Encryption 080402
Biography
Research interests:
Record Linkage, Entity Resolution, Data Mining, Privacy-Preserving Record Linkage and Data Sharing, Machine Learning, Geocoding, Data Pre-Processing, Health Data Mining, Parallel Computing
See my full CV for more details: http://users.cecs.anu.edu.au/~christen/
Publications
- Nanayakkara, V, Christen, P & Ranbaduge, T 2021, 'Active Learning Based Similarity Filtering for Efficient and Effective Record Linkage', 25th Pacific-Asia Conference, PAKDD 2021, ed. K Karlapalem, H Cheng, N Ramakrishnan, R K Agrawal, Springer Nature Switzerland AG, Switzerland, pp. 321-333.
- Christen, P, Schnell, R, Ranbaduge, T et al. 2021, 'A critique and attack on "Blockchain-based privacy-preserving record linkage"', Information Systems, vol. 108, p. 101930.
- Christen, V, Christen, P & Rahm, E 2020, 'Informativeness-Based Active Learning for Entity Resolution', Joint European Conference on Machine Learning and Knowledge Discovery in Databases ECML PKDD 2019, ed. P Cellier, K Driessens, Springer, Switzerland, pp. 125-141.
- Nanayakkara, V, Christen, P & Ranbaduge, T 2020, 'An anonymiser tool for sensitive graph data', 29th ACM International Conference on Information and Knowledge Management, CIKM2020, ed. D'Aquin, Matthieu, Dietze, Stefan, Association for Computing Machinery (ACM), New York, NY, United States, pp. 1-5.
- Ranbaduge, T & Christen, P 2020, 'A scalable privacy-preserving framework for temporal record linkage', Knowledge and Information Systems, vol. 62, pp. 45-78.
- Vatsalan, D, Christen, P & Rahm, E 2020, 'Incremental clustering techniques for multi-party Privacy-Preserving Record Linkage', Data and Knowledge Engineering, vol. 128, pp. 1-19.
- Ranbaduge, T, Vatsalan, D & Christen, P 2020, 'Secure Multi-party Summation Protocols: Are They Secure Enough Under Collusion?', Transactions on Data Privacy, vol. 13, no. 1888-5063, pp. 25-60.
- Akgun, O, Dearle, A, Kirby, G et al. 2020, 'Linking Scottish vital event records using family groups', Historical Methods, vol. 53, no. 2, pp. 130-146.
- Vidanage, A, Ranbaduge, T, Christen, P et al. 2020, 'A privacy attack on multiple dynamic match-key based privacy-preserving record linkage', International Journal of Population Data Science, vol. 5, no. 1, pp. 1-13.
- Vidanage, A, Christen, P, Ranbaduge, T et al. 2020, 'A Graph Matching Attack on Privacy-Preserving Record Linkage', 29th ACM International Conference on Information & Knowledge Management, CIKM2020, Association for Computing Machinery (ACM), New York, NY, United States, pp. 1485-1494.
- Draisbach, U, Christen, P & Naumann, F 2019, 'Transforming pairwise duplicates to entity clusters for high-quality duplicate detection', Journal of Data and Information Quality, vol. 12, no. 1, pp. 1-30.
- Christen, P, Ranbaduge, T, Vatsalan, D et al. 2019, 'Precise and Fast Cryptanalysis for Bloom Filter Based Privacy-Preserving Record Linkage', IEEE Transactions on Knowledge and Data Engineering, vol. 31, no. 11, pp. 2164-2177.
- Nanayakkara, S, Christen, P, Ranbaduge, T et al. 2019, 'Evaluation measure for group-based record linkage', International Journal of Population Data Science, vol. 4, no. 27, pp. 1-12.
- Vidanage, A, Ranbaduge, T, Christen, P et al. 2019, 'Efficient pattern mining based cryptanalysis for privacy-preserving record linkage', 35th IEEE International Conference on Data Engineering, ICDE 2019, IEEE Computer Society, United States, pp. 1698-1701.
- Kirielle Arachchillage, D, Christen, P & Ranbaduge, T 2019, 'Outlier detection based accurate geocoding of historical addresses', 17th Australasian Conference on Data Mining, AusDM 2019, ed. T D Le, K-L Ong, Y Zhao, W H Jin, S Wong, L Jiu, G Williams, Springer, Singapore, pp. 41-53.
- Christen, P, Ranbaduge, T & Vatsalan, D 2018, 'An Introduction to DLforum – An online discussion forum for data linkage researchers and practitioners', International Journal of Population Data Science, vol. 3, no. 1.
- Ranbaduge, T, Vatsalan, D & Christen, P 2018, 'A scalable and efficient subgroup blocking scheme for multidatabase record linkage', 22nd Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2018, ed. D Phung, V S Tseng, G I Webb, B Ho, M Ganji, L Rashidi, Springer Verlag, Switzerland, pp. 15-27.
- Manning, M, Wong, G, Graham, T et al. 2018, 'Towards a 'smart' cost-benefit tool: using machine learning to predict the costs of criminal justice policy interventions', Crime Science, vol. 7, no. 12, pp. 1-13.
- Gladbach, M, Sehili, Z, Kudrass, T, Christen, P, Rahm, E 2018, 'Distributed privacy-preserving record linkage using pivot-based filter techniques', 34th IEEE International Conference on Data Engineering Workshops, ICDEW 2018, IEEE, To be checked, pp. 33-38.
- Ranbaduge, T & Christen, P 2018, 'Privacy-Preserving Temporal Record Linkage', 18th IEEE International Conference on Data Mining, ICDM 2018, IEEE, Singapore, pp. 377-386pp.
- Wijenayake, S, Graham, T & Christen, P 2018, 'A Decision Tree Approach to Predicting Recidivism in Domestic Violence', 2018 Big Data Analytics for Social Computing (BDASC).
- Zhang, Y, Churchill, T, Ng, K, Christen, P et al 2018, 'Scalable entity resolution using probabilistic signatures on parallel databases', 27th ACM International Conference on Information and Knowledge Management, CIKM 2018, ed. N Paton, S Candan, H Wan, J Allan, R Agrawal, A Labrinidis, Association for Computing Machinery (ACM), TBC, pp. 2213-2222pp.
- Hand, D & Christen, P 2017, 'A note on using the F-measure for evaluating record linkage algorithms', Statistics and Computing, pp. 1-9.
- Ranbaduge, T, Vatsalan, D, Randall, S et al 2017, 'Evaluation of Advanced Techniques for Multi-Party Privacy-Preserving Record Linkage on Real-World Health Databases', International Population Data Linkage Conference, International Population Data Linkage Network, Swansea University Wales United Kingdom.
- Christen, V, Gross, A, Fisher, J, Wang, Q, Christen, P, Rahm, E 2017, 'Temporal group linkage and evolution analysis for census data', International Conference on Extending Database Technology (EDBT 2017), ed. V. Markl, S. Orlando, et al, pp. 620-631.
- Vatsalan, D, Sehili, Z, Christen, P and Rahm, E. 2017, 'Privacy-Preserving Record Linkage for Big Data: Current Approaches and Research Challenges', in (ed.), Handbook of Big Data Technologies, Springer.
- Christen, P, Gayler, R, Tran, K et al 2016, 'Automatic Discovery of Abnormal Values in Large Textual Databases', Journal of Data and Information Quality, vol. 7, no. 1-2, pp. 7:1-7:31.
- Fisher, J, Christen, P & Wang, Q. 2016, 'Active Learning Based Entity Resolution Using Markov Logic', 20th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2016, ed. J. Bailey et al, Springer International Publishing, pp. 338-349.
- Karapiperis, D, Vatsalan, D, Verykios, Christen, P. 2016, 'Efficient Record Linkage Using a Compact Hamming Space', 19th International Conference on Extending Database Technology EDBT 2016, OpenProceedings.org, pp. 209-220.
- Kim, M, Newth, D & Christen, P 2016, 'Macro-level information transfer in social media:Reflections of crowd phenomena', Neurocomputing, vol. 172, pp. 84-99.
- Ranbaduge, T, Vatsalan, D & Christen, P 2016, 'Scalable block scheduling for efficient multi-database record linkage', 16th IEEE International Conference on Data Mining, ICDM 2016, ed. Bonchi F.Wu X.Baeza-Yates, Barcelona. 1161-1166.
- Ranbaduge, T, Vatsalan, D, Christen, P. 2016, 'Hashing-Based Distributed Multi-party Blocking for Privacy-Preserving Record Linkage', 20th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2016, ed. J. Bailey et al, Springer International Publishing, pp. 415-427.
- Vatsalan, D & Christen, P 2015, 'Privacy-preserving matching of similar patients', Journal of Biomedical Informatics, vol. 59, pp. 285-298.
- Vatsalan, D, Christen, P & Rahm, E 2015, 'Scalable Privacy-Preserving Linking of Multiple Databases Using Counting Bloom Filters', 16th IEEE International Conference on Data Mining Workshops, ICDMW 2016, Barcelona, Spain, pp. 882 - 889.
- Wang, Q, Gao, J, & Christen, P. 2016, 'A Clustering-Based Framework for Incrementally Repairing Entity Resolution', 20th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2016, ed. J. Bailey et al, Springer International Publishing pp. 283-295.
- Wang, Q, Vatsalan, D & Christen, P 2016, 'Regression Classification for Improved Temporal Record Linkage', Australasian Data Mining Conference (AusDM 2016), Canberra, Australia, CRPIT.
- Bloothooft, G, Mandemakers, K, Christen, P et al, eds, 2015, Population Reconstruction, Springer International Publishing AG, Cham, Switzerland.
- Christen, P & Gayler, R 2015, 'Context-Aware Approximate String Matching for Large-Scale Real-Time Entity Resolution', IEEE International Conference on Data Mining Workshop 2015 ICDMW, ed. P. Cui, J. Dy, C. Aggarwal, Zhi-Hua Zhou, A. Tuzhilin, H. Xi, IEEE Computer Society, New York, USA, pp. 211-217.
- Tran, K & Christen, P 2015, 'Cross-Language Learning from Bots and Users to Detect Vandalism on Wikipedia', IEEE Transactions on Knowledge and Data Engineering, vol. 27, no. 3, pp. 673-685.
- Christen, P, Vatsalan, D & Fu, Z 2015, 'Advanced Record Linkage Methods and Privacy Aspects for Population Reconstruction - A Survey and Case Studies', in Gerrit Bloothooft, Peter Christen, Kees Mandemakers, Marijn Schraagen (ed.), Population Reconstruction, Springer International Publishing AG, Switzerland, pp. 87-110.
- Christen, P, Vatsalan, D & Wang, Q 2015, 'Efficient Entity Resolution with Adaptive and Interactive Training Data Selection', 2015 IEEE International Conference on Data Mining, IEEE Computer Society, USA, pp. 727-732.
- Fisher, J, Christen, P, Wang, Q et al 2015, 'A Clustering-Based Framework to Control Block Sizes for Entity Resolution', 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining KDD 2015, pp. 279-288. Sydney
- Karapiperis, D, Vatsalan, D, Verykios, V et al 2015, 'Large-scale multi-party counting set intersection usig a space efficient global synopsis', 20th International Conference on Database Systems for Advanced Applications, ed. M. Renz et al, Springer LNCS 9050, pp. 329-345, Hanoi.
- Ramadan, B & Christen, P 2015, 'Unsupervised Blocking Key Selection for Real-Time Entity Resolution', 19th Pacific-Asia Conference, PAKDD 2015, ed. Tru Cao, Ee-Peng Lim, Zhi-Hua Zhou, Tu-Bao Ho, David Cheung, Hiroshi Motoda, Springer, pp. 574-585.
- Ramadan, B, Christen, P, Liang, H et al 2015, 'Dynamic Sorted Neighborhood Indexing for Real-Time Entity Resolution', ACM Journal of Data and Information Quality, vol. 6, no. 4.
- Ranbaduge, T, Vatsalan, D & Christen, P 2015, 'Clustering-Based Scalable Indexing for Multi-party Privacy', 19th Pacific-Asia Conference, PAKDD 2015, ed. Tru Cao, Ee-Peng Lim, Zhi-Hua Zhou, Tu-Bao Ho, David Cheung, Hiroshi Motoda, Springer, pp. 549-561.
- Ranbaduge, T, Vatsalan, D & Christen, P 2015, 'MERLIN - A Tool for Multi-party Privacy-preserving Record Linkage', 2015 IEEE International Conference on Data Mining, IEEE Computer Society, USA, pp. 1640-1643.
- Tran, K, Christen, P, Sanner, S et al 2015, 'Context-Aware Detection of Sneaky Vandalism on Wikipedia Across Multiple Languages', 19th Pacific-Asia Conference, PAKDD 2015, ed. Tru Cao, Ee-Peng Lim, Zhi-Hua Zhou, Tu-Bao Ho, David Cheung, Hiroshi Motoda, Springer, pp. 380-391.
- Wang, Q, Vatsalan, D & Christen, P 2015, 'Efficient Interactive Training Selection for Large-Scale Entity Resolution', 19th Pacific-Asia Conference, PAKDD 2015, ed. Tru Cao, Ee-Peng Lim, Zhi-Hua Zhou, Tu-Bao Ho, David Cheung, Hiroshi Motoda, Springer, pp. 562-573.
- Perera, C, Jayaraman, P, Zaslavsky, A et al 2014, 'Sensor discovery and configuration framework for the Internet of Things paradigm', 2014 IEEE World Forum on Internet of Things, WF-IoT 2014, IEEE, Seoul South Korea, pp. 94-99.
- Vatsalan, D, Christen, P, O'Keefe, C et al 2014, 'An Evaluation Framework for Privacy-Preserving Record Linkage', Journal of Privacy and Confidentiality, vol. 6, no. 1, pp. 35-75.
- Kim, M, Newth, D & Christen, P 2014, 'Uncovering Diffusion in Academic Publications using Model-Driven and Model-Free Approaches', 4th IEEE International Conference on Big Data and Cloud Computing (BDCloud 2014), IEEE Computer Society, USA, pp. 564-571.
- Perera, C, Zaslavsky, A, Liu, C et al 2014, 'Sensor Search Techniques for Sensing as a Service Architecture for the Internet of Things', IEEE Sensors Journal, vol. 14, no. 2, pp. 406-420.
- Fu, Z, Christen, P & Zhou, J 2014, 'A graph matching method for historical census household linkage', 18th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2014, ed. V S Tseng, T B Ho, Z-H Zhou, A L P Chen, H-Y Kao, Springer Verlag, Tainan China, pp. 485-496.
- Fu, Z, Boot, M, Christen, P et al. 2014, 'Automatic record linkage of individuals and households in historical census data', International Journal of Humanities and Arts Computing, vol. 8, no. 2, pp. 204-225.
- Perera, C, Jayaraman, P, Zaslavsky, A et al 2014, 'MOSDEN: An internet of things middleware for resource constrained mobile devices', 47th Hawaii International Conference on System Sciences, HICSS 2014, IEEE, Waikoloa USA, pp. 1053-1062.
- Liang, H, Wang, Y, Christen, P et al. 2014, 'Noise-tolerant approximate blocking for dynamic real-time entity resolution', 18th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2014, ed. V S Tseng, T B Ho, Z-H Zhou, A L P Chen, H-Y Kao, Springer Verlag, Tainan China, pp. 449-460.
- Christen, P, Vatsalan, D & Verykios, V 2014, 'Challenges for Privacy Preservation in Data Integration', Journal of Data and Information Quality, vol. 5, no. 1-2, pp. 4.1-4.3.
- Perera, C, Zaslavsky, A, Christen, P et al 2014, 'Context aware computing for the internet of things: A survey', IEEE Communications Surveys and Tutorials, vol. 16, no. 1, pp. 414-454.
- Perera, C, Zaslavsky, A, Christen, P et al 2014, 'Sensing as a service model for smart cities supported by Internet of Things', European Transactions on Telecommunications, vol. 25, no. 1, pp. 81-93.
- Perera, C, Jayaraman, P, Zaslavsky, A et al 2014, 'Context-Aware Dynamic Discovery and Configuration of 'Things' in Smart Environments', in Nik Bessis, Ciprian Dobre (ed.), Studies in Computational Intelligence Big Data and Internet of Things: A Roadmap for Smart Environments, Springer, Cham, Switzerland, pp. 215-241.
- Christen, P 2014, 'Advanced record linkage methods and privacy aspects for population reconstruction', Workshop 'Population Reconstruction', Amsterdam.
- Christen, P 2014, 'Privacy Aspects in Big Data Integration:Challenges and Opportunities', Privacy and Secuirty of Big Data 2014, ACM CIKM, Shanghai.
- Kim, M, Newth, D & Christen, P 2014, 'Macro-Level Information Transfer across Social Networks', 23rd International World Wide Web Conference WWW2014, Association for Computing Machinery (ACM), pp. 321-322.
- Kim, M, Newth, D & Christen, P 2014, 'Trends of News Diffusion in Social Media based on Crowd Phenomena', 23rd International World Wide Web Conference WWW2014, Association for Computing Machinery (ACM), pp. 753-758.
- Ramadan, B & Christen, P 2014, 'Forest-Based Dynamic Sorted Neighborhood Indexing for Real-Time Entity Resolution', 23rd ACM International Conference on Information and Knowledge Management, Association for Computing Machinery (ACM), Shanghai, pp. 1787-1790.
- Ramadan, B, Christen, P & Liang, H 2014, 'Dynamic Sorted Neighborhood Indexing for Real-Time Entity Resolution', 25th Australasian Database Conference 2014, ed. Hua Wang, Mohamed A. Sharaf, Springer, pp. 1-12. Brisbane.
- Ranbaduge, T, Christen, P & Vatsalan, D 2014, 'Tree Based Scalable Indexing for Multi-Party Privacy-Preserving Record Linkage', Australasian Data Mining Conference (AusDM 2014), Australian Computer Society Inc., Sydney Australia.
- Vatsalan, D & Christen, P 2014, 'Scalable Privacy-Preserving Record Linkage for Multiple Databases', 23rd ACM International Conference on Information and Knowledge Management, Association for Computing Machinery (ACM), Shanghai, pp. 1795-1798.
- Verykios, V & Christen, P 2013, 'Privacy-preserving record linkage', Data Mining and Knowledge Discovery, vol. 3, no. 5, pp. 321-332.
- Perera, C, Zaslavsky, A, Compton, M et al. 2013, 'Context aware sensor configuration model for internet of things', 12th International Semantic Web Conference, ISWC 2013, Elsevier, USA, pp. 253-256.
- Perera, C, Zaslavsky, A, Compton, M et al 2013, 'Semantic-driven configuration of internet of things middleware', 9th International Conference on Semantics, Knowledge and Grids, SKG 2013, IEEE, Beijing, pp. 66-73.
- McNamara, D, Wong, P, Christen, P et al 2013, 'Predicting High Impact Academic Papers Using Citation Network Features', Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), ed. Jiuyong Li, Longbing Cao (et al), Springer, Springer Heidelberg Dordrecht, pp. 14-25.
- Vatsalan, D, Christen, P & Verykios, V 2013, 'A Taxonomy of Privacy-Preserving Record Linkage Techniques', Information Systems, vol. 38, no. 6, pp. 946-969.
- Perera, C, Zaslavsky, A, Christen, P et al 2013, 'Context-aware Sensor Search, Selection and Ranking Model for Internet of Things Middleware', Conference on Mobile Data Management (MDM 2013), IEEE Computer Society, unknown, pp. 314-322.
- Perera, C, Jayaraman, P, Zaslavsky, A et al 2013, 'Dynamic Configuration of Sensors Using Mobile Sensor Hub in Internet of Things Paradigm', International Conference on Intelligent Sensors, Sensor Networks and Information Processing (IEEE ISSNIP 2013), IEEE ISSNIP, USA, pp. 473-478.
- Tran, K & Christen, P 2013, 'Cross Language Prediction of Vandalism on Wikipedia Using Article Views and Revisions', Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), ed. Jiuyong Li, Longbing Cao (et al), Springer, Springer Heidelberg Dordrecht, pp. 268-279.
- Kim, M, Newth, D & Christen, P 2013, 'Modeling Direct and Indirect Influence across Heterogeneous Social Networks', SIGKDD Workshop on Social Network Mining and Analysis (SNA-KDD 2013), Conference Organising Committee, unkown, pp. 1-9.
- Sedhain, S, Sanner, S, Xie, L et al 2013, 'Social Affinity Filtering:Recommendation through Fine-grained Analysis of User Interactions and Activities', Conference on Online Social Networks (COSN 2013), Association for Computing Machinery Inc (ACM), unkown.
- Christen, P & Vatsalan, D 2013, 'Flexible and Extensible Generation and Corruption of Personal Data', ACM Conference on Information and Knowledge Management (CIKM 2013), Association for Computing Machinery Inc (ACM), New York USA, pp. 1165-1168.
- Tran, K & Christen, P 2013, 'Identifying Multilingual Wikipedia Articles based on Cross Language Similarity and Activity', ACM Conference on Information and Knowledge Management (CIKM 2013), Association for Computing Machinery Inc (ACM), New York USA, pp. 1485-1488.
- Christen, P & Gayler, R 2013, 'Adaptive Temporal Entity Resolution on Dynamic Databases', Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), ed. Jiuyong Li, Longbing Cao (et al), Springer, Springer Heidelberg Dordrecht, pp. 558-569.
- Ramadan, B, Christen, P, Liang, H et al. 2013, 'Dynamic Similarity-Aware Inverted Indexing for Real-Time Entity Resolution', Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), ed. Jiuyong Li, Longbing Cao (et al), Springer, Springer Heidelberg Dordrecht, pp. 47-58.
- Vatsalan, D & Christen, P 2013, 'Sorted Nearest Neighborhood Clustering for Efficient Private Blocking', Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), ed. Jiuyong Li, Longbing Cao (et al), Springer, Springer Heidelberg Dordrecht, pp. 341-353.
- Tran, K, Vatsalan, D & Christen, P 2013, 'GeCo - An online personal data Generator and Corruptor', ACM Conference on Information and Knowledge Management (CIKM 2013), Association for Computing Machinery Inc (ACM), New York USA, pp. 2473-2476.
- Vatsalan, D, Christen, P & Verykios, V 2013, 'Efficient Two-Party Private Blocking based on Sorted Nearest Neighborhood Clustering', ACM Conference on Information and Knowledge Management (CIKM 2013), Association for Computing Machinery Inc (ACM), New York USA.
- Kim, M, Newth, D & Christen, P 2013, 'Modeling Dynamics of Diffusion Across Heterogeneous Social Networks: News Diffusion in Social Media', Entropy, vol. 15, no. 10, pp. 4215-4242.
- Kim, M, Newth, D & Christen, P 2013, 'Modeling dynamics of meta-populations with a probabilistic approach: global diffusion in social media', ACM Conference on Information and Knowledge Management (CIKM 2013), Association for Computing Machinery Inc (ACM), New York USA, pp. 489-498.
- Fisher, J, Wang, Q, Wong, P, Christen, P, 'Data Cleaning and Matching of Institutions in Bibliographic Databases', Australasian Data Mining Conference (AusDM 2013), CRPIT Vol 146.
- Perera, C, Zaslavsky, A, Christen, P et al 2012, 'CA4IOT: Context Awareness for Internet of Things', Green Computing and Communications (GreenCom 2012), IEEE and ACM (USA), Fance, pp. 775-782.
- Denny, D, Christen, P & Williams, G 2012, 'Analysis of Cluster Migrations Using Self-Organizing Maps', Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2011), Conference Organising Committee, Shenzhen China, pp. 171-182.
- Christen, P 2012, 'A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication', IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 9, pp. 1537-1555.
- Kim, M, Xie, L & Christen, P 2012, 'Event Diffusion Patterns in Social Media', International Conference on Weblogs and Social Media ICWSM 2012, AAAI Press, unknown, pp. 178-185.
- Liang, H, Xu, Y, Tjondronegoro, D et al. 2012, 'Time-aware Topic Recommendation Based on Micro-blogs', ACM international conference on Information and knowledge management (CIKM 2012), Association for Computing Machinery Inc (ACM), New York USA, pp. 1657-1661.
- Perera, C, Zaslavsky, A, Christen, P et al 2012, 'Connecting Mobile Things to Global Sensor Network Middleware using System-generated Wrappers', ACM International Workshop on Data Engineering for Wireless and Mobile Access in Conjunction with ACM SIGMOD/PODS International Conference on Management of Data 2012, Conference Organising Committee, Scottsdale USA, pp. 23-30.
- Perera, C, Zaslavsky, A, Christen, P et al 2012, 'Capturing Sensor Data from Mobile Phones usingGlobal Sensor Network Middleware', IEEE International Symposium on Personal, Indoor and Mobile Radio Communications PIMRC 2012, IEEE, USA, pp. 24-29.
- Noel, J, Sanner, S, Tran, K et al. 2012, 'New Objective Functions for Social Collaborative Filtering', Annual Conference on World Wide Web (WWW 2012), Association for Computing Machinery Inc (ACM), Lyon, pp. 859-868.
- Christen, P 2012, Data Matching - Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection, Springer.
- Fu, Z, Zhou, J, Christen, P, and Boot, M 2012, 'Multiple Instance Learning for Group Record Linkage', 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Springer LNAI.
- Fu, Z, Zhou, J, Furong, P, Christen, P 2012, 'A Bag Reconstruction Method for Multiple Instance Classification and Group Record Linkage', ADMA 2012, ed. S. Zhou, S Zhang, G. Karypis, Springer-Verlag, Berlin, Heidelberg, pp. 247-259.
- Karakasidis, A, Verykios, V & Christen, P 2011, 'Fake Injection Strategies for Private Phonetic Matching', DPM International Workshop on Data Privacy Management, ed. J. Garcia-Alfaro et al, Springer-Verlag, Berlin Germany, p. 16.
- Vatsalan, D & Christen, P 2012, 'An Iterative Two-Party Protocol for Scalable Privacy-Preserving Record Linkage', Australasian Data Mining Conference (AusDM 2012), Sydney.
- Vatsalan, D, Christen, P & Verykios, V 2011, 'An Efficient Two-Party Protocol for Approximate Matching in Private Record Linkage', Australasian Data Mining Conference 2011, ed. Peter Vamplew, Andrew Stranieri, KL Ong, Peter Christen and Paul J. Ken, Australian Computer Society Inc., Sydney Australia, p. 12.
- De Vries, T, Ke, H, Chawla, S et al 2011, 'Robust record linkage blocking using suffix arrays and bloom filters', ACM Transactions on Knowledge Discovery from Data, vol. 5, no. 2, pp. A9-27.
- Fu, Z, Christen, P & Boot, M 2011, 'A Supervised Learning and Group Linking Method for Historical Census Household Linkage', Ninth Australasian Data Mining Conference, Ballarat, December 2011.
- Fu, Z, Christen, P & Boot, M 2011, 'Automatic Cleaning and Linking of Historical Census Data using Household Information', Fifth International Workshop on Domain Driven Data Mining, IEEE ICDM, Vancouver, December 2012.
- Christen, P 2009, 'Development and User Experiences of an Open Source Data Cleaning, Deduplication and Record Linkage System', SIGKDD explorations: newsletter of the Special Interest Group on Knowledge Discovery and Data Mining, vol. 11, no. 1, pp. 39-48.
- Christen, P & Pudjijono, A 2009, 'Accurate Synthetic Generation of Realistic Personal Information', in T. Theeramunkong, B. Kijsirikul, N. Cercone, Tu-Bao Ho (ed.), Advances in Knowledge Discovery and Data Mining, Springer, Berlin, Heidelberg, Germany, pp. 507-514.
- De Vries, T, Ke, H, Chawla, S et al 2009, 'Robust Record Linkage Blocking using Suffix Arrays', ACM Conference on Information and Knowledge Management (CIKM 2009), ed. Conference Program Committee, Association for Computing Machinery Inc (ACM), USA, pp. 305-314.
- Denny, D, Williams, G & Christen, P 2010, 'Visualizing temporal cluster changes using Relative Density Self-Organizing Maps', Knowledge and Information Systems, vol. 25, no. 2, pp. 281-302.
- Christen, P, Gayler, R & Hawking, D 2009, 'Similarity-Aware Indexing for Real-Time Resolution', ACM Conference on Information and Knowledge Management (CIKM 2009), ed. Conference Program Committee, Association for Computing Machinery Inc (ACM), USA, pp. 1565-1568.
- Christen, P 2009, 'Geocode Matching and Privacy Preservation', in Francesco Bonchi, Elena Ferrari, Wei Jiang, Bradley Malin (ed.), Privacy, Security, and Trust in KDD (5456), Springer, Germany, pp. 7-24.
- Denny, D, Williams, G & Christen, P 2008, 'ReDSOM: Relative Density Visualization of Temporal Changes in Cluster Structures Using Self-organizing Maps', IEEE International Conference on Data Mining (ICDM 2008), ed. F. Giannotti, D. Gunopulos, F. Turini C. Zaniolo, N. Ramakrishnan, X. Wu, IEEE Computer Society, Los Alamitos, California, pp. 173-182.
- Christen, P 2008, 'Automatic Training Example Selection for Scalable Unsupervised Record Linkage', Pacific Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2008), ed. Takashi Washio, Einoshin Suzuki, Kai Ming Ting, Akihiro Inokuchi, Springer, New York, pp. 511-528.
- Christen, P 2008, 'Febrl - An open source data cleaning, deduplication and record linkage system with a graphical user interface', ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2008), ed. Conference Program Committee, Association for Computing Machinery Inc (ACM), New York USA, pp. 1065-1068.
- Denny, D, Williams, G & Christen, P 2008, 'Exploratory Hot Spot Profile Analysis Using Interactive Visual Drill-Down Self-Organizing Maps', Pacific Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2008), ed. Takashi Washio, Einoshin Suzuki, Kai Ming Ting, Akihiro Inokuchi, Springer, New York, pp. 536-543.
- Christen, P 2008, 'Febrl - A Freely Available Record Linkage System with a Graphical User Interface', Australasian Workshop on Health Data and Knowledge Management (HDKM 2008), ed. James R Warren, Ping Yu, John Yearwood, Jon D Patrick, Australian Computer Society Inc., Australia, pp. 17-25.
- Christen, P 2008, 'Automatic Record Linkage using Seeded Nearest Neighbour and Support Vector Machine Classification', ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2008), ed. Conference Program Committee, Association for Computing Machinery Inc (ACM), New York USA, pp. 151-160.
- Christen, P & Gayler, R 2008, 'Towards Scalable Real-Time Entity Resolution using a Similarity-Aware Inverted Index Approach', Australasian Data Mining Conference (AusDM 2008), ed. John F Roddick, Jinyoung Li, Peter Christen, Paul Kennedy, Association for Computing Machinery Inc (ACM), Australia, p. 10.
- Christen, P & Goiser, K 2007, 'Quality and Complexity Measures for Data Linkage and Deduplication', in F. Guillet and H. Hamilton (ed.), Quality Measures in Data Mining: Studies in Computational Intelligence, Springer, USA, pp. 127-151.
- Christen, P 2007, 'Evaluation of a Graduate Level Data Mining Coursewith Industry Participants', Conferences in Research and Practice in Information Technology - CRPIT, vol. 70, pp. 233-241.
- Christen, P 2007, 'A Two-Step Classification Approach to Unsupervised Record Linkage', Conferences in Research and Practice in Information Technology - CRPIT, vol. 70, pp. 111-119.
- Denny, D, Williams, G & Christen, P 2007, 'Exploratory Multilevel Hot Spot Analysis: Australian Taxation Office Case Study', Conferences in Research and Practice in Information Technology - CRPIT, vol. 70, pp. 73-80.
- Christen, P & Kennedy, P, eds, 2007, Data Mining and Analytics 2007, Australian Computer Society Inc., Australia.
- Christen, P 2007, 'Febrl (Freely Extensible Biomedical Record Linkage)'.
- Goiser, K & Christen, P 2006, 'Towards automated record linkage', Australasian Data Mining Conference (AusDM 2006), ed. Peter Christen et al, Australian Computer Society Inc., Sydney, pp. 23-31.
- Summerhayes, R, Holder, P, Beard, J et al 2006, 'Automated geocoding of routinely collected health data in New South Wales', NSW Public Health Bulletin, vol. 17, no. 3-4, pp. 33-37.
- Christen, P, Willmore, A & Churches, T 2006, 'A probabilistic geocoding system utilising a parcel based address file', Lecture Notes in Computer Science (LNCS), vol. 3755, pp. 130-145.
- Christen, P 2006, 'Privacy-preserving data linkage and geocoding: current approaches and research directions', IEEE International Conference on Data Mining (ICDM 2006), ed. Conference Program Committee, Institute of Electrical and Electronics Engineers (IEEE Inc), USA, pp. 497-501.
- Christen, P 2006, 'A comparison of personal name matching: techniques and practical issues', IEEE International Conference on Data Mining (ICDM 2006), ed. Conference Program Committee, Institute of Electrical and Electronics Engineers (IEEE Inc), USA, pp. 290-294.
- Christen, P & Churches, T 2006, 'Secure Health Data Linkage and Geocoding: Current Approaches and Research Directions', National e-Health Privacy and Security Symposium 2006, ed. Peter Croll, Queensland University of Technology, Queensland, pp. 89 - 99.
- Armstrong, W, Christen, P, McCreath, E et al 2006, 'Dynamic algorithm selection using reinforcement learning', International Workshop on Integrating AI and Data Mining (AIDM 2006), ed. K-L Ong, K smith-Miles, V Lee, W-K Ng, Institute of Electrical and Electronics Engineers (IEEE Inc), Los Alamitos, p. 8.
- Christen, P & Belacic, D 2005, 'Automated Probabilistic Address Standardisation and Verification', Australasian Data Mining Conference (AusDM 2005), ed. Simeon J Simoff, Graham J Williams, John Galloway, Inna Kolyshkina, University of Technology Sydney, Sydney, Australia, p. 15.
- Christen, P 2005, 'Probabilistic Data Generation for Deduplication and Data Linkage', International Conference on Intelligent Data Engineering and Automated Learning (IDEAL 2005), ed. Marcu Gallagher, James Hogan, Frederic Maire, Springer, Berlin/Heidelberg, pp. 109-116.
- Christen, P & Goiser, K 2005, 'Assessing Deduplication and Data Linkage Quality: What to Measure?', Australasian Data Mining Conference (AusDM 2005), ed. Simeon J Simoff, Graham J Williams, John Galloway, Inna Kolyshkina, University of Technology Sydney, Sydney, Australia, p. 16.
- Churches, T & Christen, P 2004, 'Blind Data Linkage Using n-gram Similarity Comparisons', Pacific Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2004), ed. Honghua Dai, Ramakrishnan Srikant & Chengqi Zhang, Springer, Germany, pp. 121-126.
- Christen, P, Churches, T & Willmore, A 2004, 'A Probabilistic Geocoding System based on a National Address File', Australasian Data Mining Conference (AusDM 2004), ed. Simeon J. Mimoff and Graham J. Williams, Unknown, Sydney, pp. 111-231.
- Christen, P, Churches, T & Hegland, M 2004, 'Febrl - A Parallel Open Source Data Linkage System', Pacific Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2004), ed. Honghua Dai, Ramakrishnan Srikant & Chengqi Zhang, Springer, Germany, pp. 638-647.
- Churches, T & Christen, P 2004, 'Some methods for blindfolded record linkage', BMC Medical Informatics and Decision Making, vol. 4, no. 9, p. 17.
- Baxter, R, Christen, P & Churches, T 2003, 'A Comparison of Fast Blocking Methods for Record Linkage', Workshop on Data Cleaning, Record Linkage and Object Consolidation 2003, Unknown, USA, p. 6.
- Churches, T, Christen, P & Lim, K 2002, 'Preparation of name and address data for record linkage using hidden Markov models', BMC Medical Informatics and Decision Making, vol. 2, no. 1472-6947/2/9.
- Christen, P, Hegland, M, Nielsen, O et al. 2001, 'Towards a Parallel Data Mining Toolbox', International Workshop on Parallel and Distributed Data Mining in conjunction with International Parallel and Distributed Processing Symposium (IPDPS 2001), ed. Pradip Srimani, Institute of Electrical and Electronics Engineers (IEEE Inc), Piscataway, USA.
- Hawkins, S, Williams, G, Baxter, R et al. 2001, 'Data Mining of Administrative Claims Data of Pathology Services', Hawaii International Conference on System Sciences 2001, ed. Ralph H Sprague, Institute of Electrical and Electronics Engineers (IEEE Inc), online http://www.computer.org/proceedings/hicss/0981/0981toc.html.
- Nielsen, O, Christen, P, Hegland, M et al. 2001, 'A Toolbox Approach to Flexible and Efficient Data Mining', in D. Cheung, G. Williams, Li Q (ed.), Advances in Knowledge Discovery and Data Mining (2001), Springer, Berlin, pp. 124-135.
- Nielsen, O, Christen, P, Hegland, M et al. 2001, 'Data Mining with Python', 9th International Python Conference, ed. van Rossum, Foretec Seminars, Restn, USA, pp. 225-232.
- Christen, P, Nielsen, O, Hegland, M et al. 2001, 'Parallel Data Mining on a Beowulf Cluster', International Conference and Exhibition on High Performance Computing in the Asia-Pacific Region HPC Asia 2001, ed. Joseph Young, Australian Partnership for Advanced Computing, Australia.
- Christen, P, Nielsen, O & Hegland, M 2001, 'DMtools - Open Source Software for Database Mining', PKDD-Workshop on Database Support for KDD 2001, ed. Gunter Saake, Kai-Uwe Sattler, Springer, Germany, pp. 27-38.
- Christen, P, Hegland, M, Nielsen, O et al. 2001, 'Scalable parallel algorithms for surface fitting and data mining', Parallel Computing, vol. 27, pp. 941-961.
- Christen, P, Hegland, M, Nielsen, O et al. 2000, 'Algorithms for Predictive Modelling', IEEE International Conference on Data Mining (ICDM 2000), ed. Ebecken, N and Brebbia, CA, WIT Press, Southampton, UK, pp. 423-434.
Projects and Grants
Grants information is drawn from ARIES. To add or update Projects or Grants information please contact your College Research Office.
- Stage 1: Developing the Scottish Historical Population Platform (SHiPP) (Primary Investigator)
- Development of secure and accurate binary encodings for multi-domain privacy-preserving record linkage (Primary Investigator)
- Bloom filters for Privacy Preserving Record Linkage (Primary Investigator)
- Creating the social genome: Advanced techniques for linking dynamic data (Primary Investigator)
- Advancing data integration: Privacy and semantics for record linkage (Primary Investigator)
- Administrative Data Research Centre - Scotland (Primary Investigator)
- Privacy-preserving record linkage on multiple large databases (Primary Investigator)
- A Flexible Data Generator for Privacy-Preserving Data Mining and Record Linkage (Primary Investigator)
- Exposing the anonymous attacker: Detecting identity crimes using real-time resolution on large dynamic databases (Primary Investigator)
- Preference Elicitation for Social Recommendation (Primary Investigator)