
Ask for code, or ask any question and VerifAI's MultiLLM will invoke multiple LLM's in parallel and rank the results. VerifAI's Python open-source MultiLLM framework calls LLMs in parallel and ranks their outputs to find the best results (ground truth). The first use case is comparing code produced by GPT3,5 and Google-Bard. MultiLLM can be extended to support new LLMs and custom ranking function to evaluate a variety of outputs from LLMs.