You'd need a lot more than that to make the comparison meaningful. Compare phone configurations (with some configurations, phone A may use less battery, with other configurations phone B may), different battery distributers (is Anker/Hyperion/OEM the better battery), etc. Just Phone A out of the box uses less battery than phone B out of the box doesn't mean anything to people who frequent forums like these. Most of us consider "out of the box" the point from which we make modifications to the ROMs, settings, apps, launchers, etc. My Note 3, same usage pattern (you can't guarantee exactly the same usage on two phones at the same time, let alone on a phone on different days) shows a MARKED difference in battery usage from ROM to ROM. Turn off all the radios and I can get over 4 days to 90% on some. Run with 2G on and I can barely get 10 hours on others. Another phone may have different sensitivity to radios being on, etc.
And the comparison won't show how the phones will run with YOUR usage pattern, only with the usage pattern of the person conducting the test. Your usage pattern may make the best phone in the comparison perform worse (battery-life-wise) than the worst one. We're not comparing apples to apples like that, we're saying that my grapefruit is larger than your grape. We know that, but so what? It's a meaningless comparison. You have to compare different batteries against different configurations that YOU'RE going to use against different patterns of YOUR usage - against different phones. And that would be a 4-dimensjional graph - that would show that some phones are better in some situations.
If you want great battery life, use an iPhone. They're better than Android in battery usage. (And a lawnmower uses a lot less gas than a truck.)