Not all LLMs are created equal
Have you compared finetuning performance for llama on a medical dataset?
Example in another domain:
https://arxiv.org/abs/2309.06126
Have you compared finetuning performance for llama on a medical dataset?
Example in another domain:
https://arxiv.org/abs/2309.06126