Practical rules for running reliable LLM experiments | Raisolo