Projects
Publications (92)
2026
12 publications- Beyond chunk-then-embed: A comprehensive taxonomy and evaluation of document chunking strategies for information retrieval
arXiv preprint arXiv:2602.16974 · 2026
- The Vulnerability of LLM Rankers to Prompt Injection Attacks
arXiv preprint arXiv:2602.16752 · 2026
- AutoBool: Reinforcement-Learned LLM for Effective Automatic Systematic Reviews Boolean Query Generation
Proceedings of the 19th Conference of the European Chapter of the · 2026
- CXRMate-2: Structured Multimodal Temporal Embeddings and Tractable Reinforcement Learning for Clinically Acceptable Chest X-ray Radiology Report Generation
arXiv preprint arXiv:2604.18967 · 2026
- Can It Reach the Generator? Investigating the Survival of Prompt-Injection Attacks in Realistic RAG Settings
arXiv preprint arXiv:2605.28017 · 2026
- DiffRetriever: Parallel Representative Tokens for Retrieval with Diffusion Language Models
arXiv preprint arXiv:2605.07210 · 2026
- On the impact of retrieved content representations in RAG Pipelines
arXiv preprint arXiv:2605.30790 · 2026
- Rapid, Agile Development and Evaluation of Retrieval Augmented Generation Systems Without Labels
Lecture notes in computer science · 2026
- Starbucks: Improved Training for 2D Matryoshka Embeddings
European Conference on Information Retrieval, 67-82 · 2026
- The drawing is outlined in …
European Conference on Information Retrieval, 67-82 · 2026
- Toward Clinically Acceptable Chest X-ray Report Generation: A Qualitative Retrospective Pilot Study of CXRMate-2
arXiv e-prints, arXiv: 2604.18967 · 2026
- Whole-Pool Setwise Reranking with Long-Context Language Models
arXiv preprint arXiv:2606.01782 · 2026
2025
19 publications- Rank-r1: Enhancing reasoning in llm-based document rerankers via reinforcement learning
arXiv preprint arXiv:2503.06034 · 2025
- Report from the 4th Strategic Workshop on Information Retrieval in Lorne (SWIRL 2025)
ACM SIGIR Forum 59 (1), 1-68 · 2025
- LLM-VPRF: Large Language Model Based Vector Pseudo Relevance Feedback
arXiv preprint arXiv:2504.01448 · 2025
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-ranking
Lecture notes in computer science · 2025
- VISA: Retrieval Augmented Generation with Visual Source Attribution
2025
- Set-Encoder: Permutation-Invariant Inter-passage Attention for Listwise Passage Re-ranking with Cross-Encoders
Lecture notes in computer science · 2025
- AutoBool: An Reinforcement-Learning trained LLM for Effective Automated Boolean Query Generation for Systematic Reviews
arXiv preprint arXiv:2602.00005 · 2025
- Document Screenshot Retrievers are Vulnerable to Pixel Poisoning Attacks
2025
- ReSLLM: Large Language Models are Strong Resource Selectors for Federated Search
2025
- Reassessing Large Language Model Boolean Query Generation for Systematic Reviews
2025
- 2D Matryoshka Training for Information Retrieval
2025
- Pseudo-Relevance Feedback Can Improve Zero-Shot LLM-Based Dense Retrieval
arXiv e-prints, arXiv: 2503.14887 · 2025
- RARR Unraveled: Component-Level Insights into Hallucination Detection and Mitigation
2025
- AEHRC at BioLaySumm 2025: Leveraging T5 for Lay Summarisation of Radiology Reports
2025
- Automated chest X-ray report generation remains unsolved
Faculty of 1000 Research Ltd · 2025
- Humans are more gullible than LLMs in believing common psychological myths
ArXiv.org · 2025
- Pseudo Relevance Feedback is Enough to Close the Gap Between Small and Large Dense Retrieval Models
ArXiv.org · 2025
- SIGIR-AP 2025 Tutorial on Retrieval and Ranking with LLMs (R2LLMs)
Proceedings of the 2025 Annual International ACM SIGIR Conference on · 2025
- The Impact of Auxiliary Patient Data on Automated Chest X-Ray Report Generation and How to Incorporate It
2025
2024
15 publications- A Setwise Approach for Effective and Highly Efficient Zero-shot Ranking with Large Language Models
2024
- PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
2024
- Zero-Shot Generative Large Language Models for Systematic Review Screening Automation
Lecture notes in computer science · 2024
- Longitudinal data and a semantic similarity reward for chest X-ray report generation
Informatics in Medicine Unlocked · 2024
- Generating synthetic clinical text with local large language models to identify misdiagnosed limb fractures in radiology reports
Artificial Intelligence in Medicine · 2024
- Understanding and Mitigating the Threat of Vec2Text to Dense Retrieval Systems
2024
- A Reproducibility Study of Goldilocks: Just-Right Tuning of BERT for TAR
Lecture notes in computer science · 2024
- e-Health CSIRO at RRG24: Entropy-Augmented Self-Critical Sequence Training for Radiology Report Generation
2024
- Dense Retrieval with Continuous Explicit Feedback for Systematic Review Screening Prioritisation
2024
- Team IELAB at TREC Clinical Trial Track 2023: Enhancing Clinical Trial Retrieval with Neural Rankers and Large Language Models
arXiv (Cornell University) · 2024
- e-Health CSIRO at “Discharge Me!” 2024: Generating Discharge Summary Sections with Fine-tuned Language Models
2024
- Does Vec2Text Pose a New Corpus Poisoning Threat?
arXiv (Cornell University) · 2024
- Searching in Professional Instant Messaging Applications: User Behaviour, Intent, and Pain-points
2024
- Starbucks-v2: Improved Training for 2D Matryoshka Embeddings
arXiv (Cornell University) · 2024
- TPRF: A Transformer-based Pseudo-Relevance Feedback Model for Efficient and Effective Retrieval
arXiv (Cornell University) · 2024
2023
13 publications- Can ChatGPT Write a Good Boolean Query for Systematic Review Literature Search?
2023
- Improving chest X-ray report generation by leveraging warm starting
Artificial Intelligence in Medicine · 2023
- Dr ChatGPT tell me what I want to hear: How different prompts impact health answer correctness
2023
- ChatGPT Hallucinates when Attributing Answers
2023
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking
2023
- Pseudo Relevance Feedback with Deep Language Models and Dense Retrievers: Successes and Pitfalls
ACM Transactions on Information Systems · 2023
- A concise model for medical image captioning
CLEF 2023: Conference and Labs of the Evaluation Forum · 2023
- AgAsk: an agent to help answer farmer’s questions from scientific documents
International Journal on Digital Libraries · 2023
- Generating Natural Language Queries for More Effective Systematic Review Screening Prioritisation
2023
- Catching misdiagnosed limb fractures in the emergency department using cross-institution transfer learning
Proceedings of the 21st Annual Workshop of the Australasian Language · 2023
- e-Health CSIRO at RadSum23: Adapting a Chest X-Ray Report Generator to Multimodal Radiology Report Summarisation
2023
- From Free-text Drug Labels to Structured Medication Terminology with BERT and GPT.
PubMed · 2023
- AgAsk: A Conversational Search Agent for Answering Agricultural Questions
2023
2022
12 publications- From little things big things grow: A collection with seed studies for medical systematic review literature search
Proceedings of the 45th International ACM SIGIR Conference on Research and · 2022
- Automated MeSH term suggestion for effective query formulation in systematic reviews literature search
Intelligent Systems with Applications · 2022
- Neural Rankers for Effective Screening Prioritisation in Medical Systematic Review Literature Search
2022
- The impact of query refinement on systematic review literature search: A query log analysis
Proceedings of the 2022 ACM SIGIR International Conference on Theory of · 2022
- CSIRO at ImageCLEFmedical caption 2022
Proceedings of the Working Notes of CLEF 2022-Conference and Labs of the · 2022
- How Does Feedback Signal Quality Impact Effectiveness of Pseudo Relevance Feedback for Passage Retrieval
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval · 2022
- CSIRO at the ImageCLEFmedical 2022 Tuberculosis Caverns Detection Challenge: A 2D and 3D Deep Learning Detection Network Approach
CLEF 2022: Conference and Labs of the Evaluation Forum · 2022
- ImageCLEF 2021 Best of Labs: The Curious Case of Caption Generation for Medical Images
Lecture notes in computer science · 2022
- Agvaluate
The University of Queensland · 2022
- ImageCLEF 2021 Best of Labs: The Curious Case of Caption Generation
Experimental IR Meets Multilinguality, Multimodality, and Interaction: 13th · 2022
- Self-learning ontological concept representation for searching and matching tasks.
OM@ ISWC, 73-78 · 2022
- Semantic Search for Large Scale Clinical Ontologies
arXiv (Cornell University) · 2022
2021
4 publications- Search Engines vs. Symptom Checkers: A Comparison of their Effectiveness for Online Health Advice
2021
- Precision Medicine Search for Paediatric Oncology
2021
- AEHRC CSIRO at ImageCLEFmed Caption 2021.
Griffith Research Online (Griffith University, Queensland, Australia) · 2021
- Cohort-based Clinical Trial Retrieval
Australasian Document Computing Symposium · 2021
2020
8 publications- Automatic Boolean Query Formulation for Systematic Review Literature Search
2020
- A comparison of automatic Boolean query formulation for systematic reviews
Information Retrieval · 2020
- Do better search engines really equate to better clinical decisions? If not, why not?
Journal of the Association for Information Science and Technology · 2020
- A Computational Approach for Objectively Derived Systematic Review Search Strategies
Lecture notes in computer science · 2020
- How searching under time pressure impacts clinical decision making
Journal of the Medical Library Association JMLA · 2020
- You Can Teach an Old Dog New Tricks: Rank Fusion applied to Coordination Level Matching for Ranking in Systematic Reviews
Lecture notes in computer science · 2020
- How a Conversational Agent Might Help Farmers in the Field
ACM Reference Format · 2020
- Sampling Query Variations for Learning to Rank to Improve Automatic Boolean Query Generation in Systematic Reviews
2020
2019
9 publications- Automatic Boolean Query Refinement for Systematic Review Literature Search
2019
- Health Cards for Consumer Health Search
2019
- Payoffs and pitfalls in using knowledge-bases for consumer health search
Information Retrieval Journal 22 (3), 350-394 · 2019
- Health card retrieval for consumer health search: An empirical investigation of methods
Proceedings of the 28th ACM International Conference on Information and · 2019
- WSDM 2019 Tutorial on Health Search (HS2019) A Full-Day from Consumers to Clinicians
Proceedings of the Twelfth ACM International Conference on Web Search and · 2019
- Impact of a Search Engine on Clinical Decisions Under Time and System Effectiveness Constraints: Research Protocol
JMIR Research Protocols · 2019
- Health Cards to Assist Decision Making in Consumer Health Search.
PubMed Central · 2019
- Learning Inter-Sentence, Disorder-Centric, Biomedical Relationships from Medical Literature.
PubMed · 2019
- Taskiir_study_data
UQ eSpace · 2019
