I am currently a lecturer at the University of Queensland, where I conduct research in the field of Information Retrieval. My research focuses on efficient and effective representations for large-scale search engines, including indexing, compression, and retrieval. I am also interested in understanding how to measure improvements in the end-to-end search pipeline, including system-oriented effectiveness measurements and user behaviour analysis. I have a broad interest in empirical experimentation, operating systems, data structures, and algorithms.
Projects
Publications (64)
2026
5 publications- Fast, Compact, Immediate-Access Indexing for Learned Sparse Retrieval Systems
Lecture notes in computer science · 2026
- Practical, Efficient, In-Memory Inverted Indexes
Lecture notes in computer science · 2026
- Revisiting Human-vs-LLM judgments using the TREC Podcast Track
arXiv (Cornell University) · 2026
- Simple Techniques for Efficient Top-<i>k </i>Batch Query Processing
Information Retrieval Research · 2026
- When LLM Judges Inflate Scores: Exploring Overrating in Relevance Assessment
ArXiv.org · 2026
2025
9 publications- Natural language processing for the legal domain: A survey of tasks, datasets, models, and challenges
ACM Computing Surveys 58 (6), 1-37 · 2025
- Report from the 4th Strategic Workshop on Information Retrieval in Lorne (SWIRL 2025)
ACM SIGIR Forum 59 (1), 1-68 · 2025
- Examining the Impact of Transcript Variation on Podcast Search and Re-ranking
Lecture notes in computer science · 2025
- A Flexible Resource for Top-Weighted Comparisons Between Sets and Rankings
2025
- Approximate Bag-of-Words Top-k Corpus Graphs
Lecture notes in computer science · 2025
- Batched k-Mer Lookup on the Spectral Burrows-Wheeler Transform
Society for Industrial and Applied Mathematics eBooks · 2025
- Empirical Asymptotic Growth of Dynamic Pruning Mechanisms
Proceedings of the 2025 Annual International ACM SIGIR Conference on · 2025
- Efficient In-Memory Inverted Indexes: Theory and Practice
2025
- Reassessing Collaborative Writing Theories and Frameworks in the Age of LLMs: What Still Applies and What We Must Leave Behind
ArXiv.org · 2025
2024
7 publications- What do Users Really Ask Large Language Models? An Initial Log Analysis of Google Bard Interactions in the Wild
2024
- Rank-Biased Quality Measurement for Sets and Rankings
2024
- ReNeuIR at SIGIR 2024: The Third Workshop on Reaching Efficiency in Neural Information Retrieval
2024
- Revisiting Document Expansion and Filtering for Effective First-Stage Retrieval
2024
- Re-evaluating the Command-and-Control Paradigm in Conversational Search Interactions
2024
- What do Users Really Ask Large Language Models?
Proceedings of the SIGIR’24, July 14–18 · 2024
- How much freedom does an effectiveness metric really have?
Journal of the Association for Information Science and Technology · 2024
2023
8 publications- ReNeuIR at SIGIR 2023: The Second Workshop on Reaching Efficiency in Neural Information Retrieval
2023
- Efficient immediate-access dynamic indexing
Information Processing & Management · 2023
- A proposed efficiency benchmark for modern information retrieval systems
ReNeuIR at SIGIR 2024: the third workshop on reaching efficiency in neural · 2023
- Index-Based Batch Query Processing Revisited
Lecture notes in computer science · 2023
- ADCS’22: Proceedings of the 26th Australasian Document Computing Symposium
2023
- Exploring the Representation Power of SPLADE Models
2023
- Lossy Compression Options for Dense Index Retention
2023
- Profiling and Visualizing Dynamic Pruning Algorithms
2023
2022
12 publications- Faster Learned Sparse Retrieval with Guided Traversal
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval · 2022
- A Flexible Framework for Offline Effectiveness Metrics
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval · 2022
- Accelerating Learned Sparse Indexes Via Term Impact Decomposition
2022
- Efficient Document-at-a-Time and Score-at-a-Time Query Evaluation for Learned Sparse Representations
ACM Transactions on Information Systems · 2022
- Ioqp: A simple impact-ordered query processor written in rust
CEUR Workshop Proceedings 3480, 22-34 · 2022
- Tradeoff Options for Bipartite Graph Partitioning
IEEE Transactions on Knowledge and Data Engineering · 2022
- BUM at CheckThat! 2022: a composite deep learning approach to fake news detection using evidence retrieval
CEUR Workshop Proceedings 3180, 564-572 · 2022
- Efficient query processing techniques for next-page retrieval
Information Retrieval · 2022
- Immediate-Access Indexing Using Space-Efficient Extensible Arrays
2022
- A Common Framework for Exploring Document-at-a-Time and Score-at-a-Time Retrieval Methods
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval · 2022
- Greetings from the ADCS 2022 Chairs
ACM International Conference Proceeding Series, III-IV · 2022
- Report on the 25th Australasian Document Computing Symposium (ADCS 2021)
ACM SIGIR Forum · 2022
2021
12 publications- Wacky Weights in Learned Sparse Representations and the Revenge of Score-at-a-Time Query Evaluation
arXiv (Cornell University) · 2021
- ERR is not C/W/L: Exploring the Relationship Between Expected Reciprocal Rank and Other Metrics
2021
- Faster Index Reordering with Bipartite Graph Partitioning
2021
- Different keystrokes for different folks: visualizing crowdworker querying behavior
Proceedings of the 2021 Conference on Human Information Interaction and · 2021
- Modality Effects When Simulating User Querying Tasks
2021
- On the Separation of Logical and Physical Ranking Models for Text Retrieval Applications.
2021
- A Sensitivity Analysis of the MSMARCO Passage Collection
arXiv (Cornell University) · 2021
- Anytime Ranking on Document-Ordered Indexes
ACM Transactions on Information Systems · 2021
- Conferences, journals, preprints, and reviewer expectations
ACM SIGIR Forum · 2021
- Cost-Effective Updating of Distributed Reordered Indexes
Australasian Document Computing Symposium · 2021
- Greetings from the ADCS 2021 General Chairs
ACM International Conference Proceeding Series · 2021
- Proceedings of the 25th Australasian Document Computing Symposium
ACM · 2021
2020
6 publications- CC-News-En: A large English news corpus
Proceedings of the 29th ACM international conference on information · 2020
- CC-News-En
2020
- Supporting Interoperability Between Open-Source Search Engines with the Common Index File Format
2020
- Efficiency Implications of Term Weighting for Passage Retrieval
2020
- Examining the Additivity of Top-k Query Processing Innovations
2020
- Managing tail latency in large scale information retrieval systems
ACM SIGIR Forum · 2020
2019
5 publications- Boosting search performance using query variations
ACM Transactions on Information Systems (TOIS) 37 (4), 1-25 · 2019
- Exploring User Behavior in Email Re-Finding Tasks
2019
- PISA: Performant indexes and search for academia
RMIT Research Repository (RMIT University Library) · 2019
- Compressing Inverted Indexes with Recursive Graph Bisection: A Reproducibility Study
Lecture notes in computer science · 2019
- Accelerated Query Processing Via Similarity Score Prediction
2019
