Publications – Luminous

Scientific Publications

Afzal, M. Z., Ali, S. A., Stricker, D., Eisert, P., Hilsmann, A., Perez-Marcos, D., … & Cuadros, M. (2025). Next generation xr systems-large language models meet augmented and virtual reality. IEEE computer graphics and applications.

Sinha, S., Khan, M. S., Usama, M., Sam, S., Stricker, D., Ali, S. A., & Afzal, M. Z. (2025). MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 8105-8116).

Khan, M. S., Sinha, S., Sheikh, T. U., Stricker, D., Ali, S. A., & Afzal, M. Z. (2024). Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts. Advances in Neural Information Processing Systems, 37, 7552-7579.

Shehzadi, T., Hashmi, K. A., Stricker, D., & Afzal, M. Z. (2024). Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 5840-5850).

Khan, M. S. U., Afzal, M. Z., & Stricker, D. (2025). SituationalLLM: Proactive Language Models with Scene Awareness for Dynamic, Contextual Task Guidance. Open Research Europe, 5, 61.

Catinari et al., “Breaking Barriers in Neurorehabilitation: Exploiting the Potential of Immersive Virtual Reality solutions”

Aguirre et al., “Conversational Tutoring in VR Training: The Role of Game Context and State Variables”

Alonso et al., “Vision-Language Models Struggle to Align Entities across Modalities”

Miranda et al., “BIVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval” in Neurips 2024

Alonso et al., “PixT3: Pixel-based Table-To-Text Generation”

G. Grubert et al., “Improving Adaptive Density Control for 3D Gaussian Splatting” International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications

D. Moreno et al., “Multi-Resolution Generative Modeling of Human Motion from Limited Data” in ACM SIGGRAPH Conference on Visual Media Production (CVMP 2024)

W. Morgenstern et al., “Compact 3D Scene Representation via Self-Organizing Gaussian Grids” European Conference on Computer Vision (ECCV 2024)

F. T. Barthel et al., “Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks” IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2024)

K. L. Krause et al., “Realtime-Rendering of Dynamic Scenes with Neural Radiance Fields” in IEEE Conference on Virtual Reality and 3D User Interfaces (IEEE VR 2025)

Bagdasarian et al., “3DGS.zip: A survey on 3D Gaussian Splatting Compression Methods” in Eurographics 2025

Ethics of Language-Augmented Extended Reality: A Scoping Review of Trustworthy AI Practices in LLM-Driven XR Systems in “Elsevier: Journal of Responsible Technology https://www.sciencedirect.com/journal/journal-of-responsible-technology”

Python, G., Salaberria, A., Ferro, M., Lopez de Lacalle, O. & Perez-Marcos, D. (2025). A chatbot to enhance digital anomia therapies by artificial intelligence and large language models: a preliminary report. Stem-, Spraak- en Taalpathologie, Vol. 30 (24th International Science of Aphasia Conference, Copenhagen). in Science of Aphasia Conference 2025

Khan, M. S. U., & Stricker, D. (2026). SIMSPINE: A Biomechanics-Aware Simulation Framework for 3D Spine Motion Annotation and Benchmarking. arXiv preprint arXiv:2602.20792.

Public Deliverables

D5.1	Humans(H) – Requirement No. 1
D5.2	POPD – Requirement No. 2
D5.3	Trustworthy AI – Requirement No. 3