| Afzal, M. Z., Ali, S. A., Stricker, D., Eisert, P., Hilsmann, A., Perez-Marcos, D., … & Cuadros, M. (2025). Next generation xr systems-large language models meet augmented and virtual reality. IEEE computer graphics and applications. |
| Sinha, S., Khan, M. S., Usama, M., Sam, S., Stricker, D., Ali, S. A., & Afzal, M. Z. (2025). MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 8105-8116). |
| Khan, M. S., Sinha, S., Sheikh, T. U., Stricker, D., Ali, S. A., & Afzal, M. Z. (2024). Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts. Advances in Neural Information Processing Systems, 37, 7552-7579. |
| Shehzadi, T., Hashmi, K. A., Stricker, D., & Afzal, M. Z. (2024). Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 5840-5850). |
| Khan, M. S. U., Afzal, M. Z., & Stricker, D. (2025). SituationalLLM: Proactive Language Models with Scene Awareness for Dynamic, Contextual Task Guidance. Open Research Europe, 5, 61. |
| Catinari et al., “Breaking Barriers in Neurorehabilitation: Exploiting the Potential of Immersive Virtual Reality solutions” |
| Aguirre et al., “Conversational Tutoring in VR Training: The Role of Game Context and State Variables” |
| Alonso et al., “Vision-Language Models Struggle to Align Entities across Modalities” |
| Miranda et al., “BIVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval” in Neurips 2024 |
| Alonso et al., “PixT3: Pixel-based Table-To-Text Generation” |
| G. Grubert et al., “Improving Adaptive Density Control for 3D Gaussian Splatting” International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
| D. Moreno et al., “Multi-Resolution Generative Modeling of Human Motion from Limited Data” in ACM SIGGRAPH Conference on Visual Media Production (CVMP 2024) |
| W. Morgenstern et al., “Compact 3D Scene Representation via Self-Organizing Gaussian Grids” European Conference on Computer Vision (ECCV 2024) |
| F. T. Barthel et al., “Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks” IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2024) |
| K. L. Krause et al., “Realtime-Rendering of Dynamic Scenes with Neural Radiance Fields” in IEEE Conference on Virtual Reality and 3D User Interfaces (IEEE VR 2025) |
| Bagdasarian et al., “3DGS.zip: A survey on 3D Gaussian Splatting Compression Methods” in Eurographics 2025 |
| Ethics of Language-Augmented Extended Reality: A Scoping Review of Trustworthy AI Practices in LLM-Driven XR Systems in “Elsevier: Journal of Responsible Technology https://www.sciencedirect.com/journal/journal-of-responsible-technology” |
| Python, G., Salaberria, A., Ferro, M., Lopez de Lacalle, O. & Perez-Marcos, D. (2025). A chatbot to enhance digital anomia therapies by artificial intelligence and large language models: a preliminary report. Stem-, Spraak- en Taalpathologie, Vol. 30 (24th International Science of Aphasia Conference, Copenhagen). in Science of Aphasia Conference 2025 |
| Khan, M. S. U., & Stricker, D. (2026). SIMSPINE: A Biomechanics-Aware Simulation Framework for 3D Spine Motion Annotation and Benchmarking. arXiv preprint arXiv:2602.20792. |