Logo

Almost two-thirds of organisations suffer from ‘data drift’

Data for computer vision systems becomes out of date within a few months for the majority of organisations.

  • Tuesday, 22nd November 2022 Posted 3 years ago in by Phil Alsop

 Almost two-thirds (64%) of organisations suffer from ‘data drift’, where data for computer vision systems becomes out of date after a few months. That’s according to a new study by Mindtech, the developer of the world’s leading platform for the creation of synthetic data for training AI, which surveyed 250 data scientists, AI and Machine Learning engineers, and computer researchers across the UK.

 

When it comes to attitudes towards synthetic data, 85% of organisations are already making use of synthetic data to train computer vision systems, and feel that quality (65%), simplicity (61%), scalability (58%), faster training times (55%), and cost (52%) are the main strengths of adopting synthetic data.

 

For those that don’t currently use synthetic data, approximately one in five (21%) believe their biggest block is a lack of experience, with cost also being a key barrier (26%). However, all respondents were asked if they trust synthetic data versus real world data, and 73% said yes. 

 

For real world data, respondents have concerns about changing privacy laws and regulations. 89% of AI and computer vision professionals are concerned that real world data will be impacted. Alongside this, 39% are concerned that real world data slows down computer vision training processes.

 

Steve Harris, CEO at Mindtech, commented: “Data drift is an ongoing problem for organisations everywhere, which can be a costly issue to solve. Embracing synthetic data can help to overcome these challenges. It is not only faster than real world data for training computer vision systems, but it is also more cost-effective.”

 

Looking to 2023 and beyond, the future adoption of synthetic data is positive. Of those that don’t already use synthetic data, the Mindtech survey revealed that almost a third (29%) anticipate their organisation will start using it in 2023. In addition, the majority (56%) predict that up to 50% of trained data will be synthetic in the next three years, with only less than one in ten (9%) saying it will be less than 10%.

WSO2 unveils a fresh focus on supporting agentic enterprises, aiming to strengthen AI deployment...
Samsung demonstrates multi-cell network validation using NVIDIA’s computing platform,...
The latest OSSRA report reveals rising challenges in AI-driven open source development,...
Alteryx One aims to enable enterprises to scale AI and automation by providing governed, repeatable...
A new WBBA report highlights the untapped potential of AI in telecoms beyond internal efficiency,...
Sophos’ latest report highlights the rise of identity-related cyberattacks, emphasising the need...
The new global Code of Professional Conduct sets ethical standards for cybersecurity practitioners...
Exploring the impact of AI in telecoms, Colt's report underlines the necessity for a people-first...