Skip to Main Content
Article navigation

Article Type: Editorial From: Journal of Children’s Services, Volume 7, Issue 3

In this edition of the journal we feature two papers involving evaluations with contrasting methods. One measures impact by looking at parents receiving an intervention and comparing their well-being before and after (Palmer et al.,2012). The second is based on a randomised controlled trial (RCT), meaning the progress of children receiving the intervention was contrasted with a control group who did not get it (Kjøbli et al., 2012). Which evaluation should we trust more?

Standards of evidence help to answer this question. There has been a growing interest in standards in the last few years. In the USA, this culminated in May this year in the Director of Office of Management and Budget Jeffrey D. Zients announcing that he will rigorously examine the evidence base of all proposals for funding by the US Federal Government. In the UK, this has been mirrored in interest from the UK Permanent Secretary Sir Jeremy Heywood in “kite marking” evidence-based social policy.

All very good, but which standards should we use to determine if something is evidence-based? In the USA, there are already over 20 sets of standards that are relevant to commissioners of children’s services. Although the UK has come late to the game, several options are available. The Government’s so-called Magenta Book (HM Treasury, 2011) sets a broad direction of travel but is not intended for application to specific questions. The National Institute for Health and Clinical Effectiveness addresses a wide range of questions– including, but extending beyond, impact on outcomes – and therefore takes a less prescriptive approach to standards. The Social Care Institute for Excellence, which looks at what works in social care, eschews standards altogether, preferring an approach that makes best use of various types of evidence from a range of sources.

The National Academy for Parenting Practitioners (http://education.gov.uk/commissioning-toolkit/Programme/Index)has standards that cover a reasonably wide spectrum of evidence of impact,ranking evaluations from simple pre-post test without a control group to the use of two or more RCTs, but its focus is limited to parenting programmes. The Institute for Effective Education’s Best Evidence Encyclopaedia (www.bestevidence.org.uk/)takes a similar approach to education programmes, while the Education Endowment Foundation (http://educationendowmentfoundation.org.uk/)is developing standards that help to identify the best teaching practices.

Organisations like the Centre for Excellence and Outcomes (C4EO) (www.c4eo.org.uk/)and the Centre for Analysis of Youth Transitions (CAYT) (www.ifs.org.uk/centres/cayt)focus on rating the wide variety of innovation in children and youth services,and are therefore less concerned with robust impact evaluations of well established programmes (while accepting their importance).

As editors, a significant part of our day job is taken up with standards of evidence, since we are co-authors of those that underpin Project Oracle (www.london.gov.uk/priorities/crime-community-safety/time-action/project-oracle),the Allen (2011) review of Early Intervention and Blueprints for Healthy Youth Development (www.colorado.edu/cspv/blueprints/). Our specialism has been to collaborate with world experts to produce a high standard (we only rate interventions evaluated with a control group) applied to all aspects of children’s health and development.

If the USA is any guide it will not be long before there are spats, with claims that “my standards are better than yours”. We urge a more considered response.

Plurality is welcome. Our colleague and collaborator on Investing in Children (www.dartington.org.uk/investinginchildren)at the Washington State Institute for Public Policy (www.wsipp.wa.gov/),Steve Aos, calls himself an “independent investment advisor”, but rather than providing counsel on stocks and shares to private investors he explains the likely costs and benefits of competing policy approaches and programmes to politicians and decision-makers. Just as there are many investment advisors in the stock market from which to choose, he feels that there should be many options in the public sector.

From our perspective it all depends on the question. If the focus is parenting programmes, the National Academy’s list is arguably the place to start. For education programmes go to the Best Evidence Encyclopaedia. If the challenge is which children’s services innovations deserve financial backing then the approaches advocated by C4EO or CAYT are strongest. What matters is that each option has clearly articulated and defensible standards.

Dynamism is essential. It is unhelpful to think of standards as fixed and unchanging. The Social Research Unit and the Institute of Effective Education focus on high standards, but we anticipate them getting higher still as practice changes: in the future there will be more evidence-based programmes, practices,policies and processes from which to choose, and therefore a greater need to prioritise which ones to invest in. Each year will bring new methods, new measures and new understandings about the frailties of what is now considered“robust” evaluation. The standards will therefore need to become more nuanced and, no doubt, tougher.

We still depend on human judgement. Standards will generally comprise clearly articulated dimensions – in our case intervention specificity, system readiness, evaluation quality and impact – underpinned by criteria that can be rated by skilled researchers and practitioners. But few cases are clear-cut. What happens when the sample size is on the margins of having sufficient statistical power to demonstrate impact, but the study is otherwise extremely robust? What happens when several evaluations undertaken in collaboration with the programme originator produce positive results, but one without the programme originator’s involvement suggests negative results?In theory it is possible to code all of these eventualities but, in practice,sound, skilled human judgement will add more value.

A “mother of all standards”?

All of this speaks to the need for diversity allied to rigour, and strong collaboration and sharing between those helping to articulate better standards of evidence. At the Social Research Unit, for example, we know that the success of our high standards of evidence depends on the success of organisations like C4EO and CAYT in promoting good innovation: we simply will not be able to build the number of evidence-based programmes and approaches unless more people embark on the journey from innovation to proven impact. We would also like to think that by encouraging children’s services innovators to aspire to meet our high standards in the future we will help C4EO and CAYT in their work.

Inevitably, if spats occur, there will be pressure to bring all of this work together in a “mother of all standards”, an idea that we know from experience would be unmanageable even if it were in any way desirable.

Finally, we continue to argue for humility. We know so little in our field that we are not in a position to speak with absolute authority. There are benefits to drawing successive lines in the sands of changing policy and practice, learning as we go, continually getting better at the difficult task of improving the health and development of our children.

Michael Little, Nick Axford

Allen, G. (2011), Early Intervention: Next Steps, Cabinet Office,London
HM Treasury (2011), The Magenta Book: Guidance for Evaluation,HM Treasury, London
Kjøbli, K., Bjørknes, R. and Askeland, E.(2012), “Adherence to brief parent training as a predictor of parent and child outcomes in real-world settings”, Journal of Children’s Services, Vol. 7 No. 3
Palmer, D., Pittaway, S., Cook, L., Garner, S.,Holtum, S., Sansom, J. and Bassi, C. (2012), “An exploration into the effectiveness of self-help CBT for mothers with mild to moderate depression and/or anxiety in the London Borough of Bexley”, Journal of Children’s Services, Vol. 7 No. 3

or Create an Account

Close Modal
Close Modal