Cloud Wars
  • Home
  • Top 10
  • CW Minute
  • CW Podcast
  • Categories
    • AI and Copilots
    • Innovation & Leadership
    • Cybersecurity
    • Data
  • Member Resources
    • Cloud Wars AI Agent
    • Digital Summits
    • Guidebooks
    • Reports
  • About Us
    • Our Story
    • Tech Analysts
    • Marketing Services
  • Summit NA
  • Dynamics Communities
  • Ask Copilot
Twitter Instagram
  • Summit NA
  • Dynamics Communities
  • AI Copilot Summit NA
  • Ask Cloud Wars
Twitter LinkedIn
Cloud Wars
  • Home
  • Top 10
  • CW Minute
  • CW Podcast
  • Categories
    • AI and CopilotsWelcome to the Acceleration Economy AI Index, a weekly segment where we cover the most important recent news in AI innovation, funding, and solutions in under 10 minutes. Our goal is to get you up to speed – the same speed AI innovation is taking place nowadays – and prepare you for that upcoming customer call, board meeting, or conversation with your colleague.
    • Innovation & Leadership
    • CybersecurityThe practice of defending computers, servers, mobile devices, electronic systems, networks, and data from malicious attacks.
    • Data
  • Member Resources
    • Cloud Wars AI Agent
    • Digital Summits
    • Guidebooks
    • Reports
  • About Us
    • Our Story
    • Tech Analysts
    • Marketing Services
    • Login / Register
Cloud Wars
    • Login / Register
Home » How to Handle Data, the Real Asset, in AI Projects
Data

How to Handle Data, the Real Asset, in AI Projects

Pablo MorenoBy Pablo MorenoOctober 31, 2022Updated:December 1, 20224 Mins Read
Facebook Twitter LinkedIn Email
data asset
Share
Facebook Twitter LinkedIn Email

In Part One of the “Initiating, Executing, and Managing Successful AI Projects in Any Organization” series, we identified and anticipated the key elements of data projects as well as showed how projects should be treated as assets that must return value; in Part Two, we discussed how to set up and organize a team to successfully execute artificial intelligence (AI) and data projects; and now, in this third part, we’ll talk about the raw material: the data.

Fundamentals – the Why

Perhaps this is the most fundamental question: Why do we need data? If there is no data, there is no project. But let’s be cautious with the opposite idea — “If data is there, we do have a project.” The fact that data is available doesn’t mean that it is useful and that an AI project can be performed and executed. Along the same lines, the data’s size does not grant an AI project feasibility. Neither a big nor small amount of available data secures an AI project.

What does determine feasibility is the “state of data” available, which is going to serve the ultimate goal as described in Part One. By “state of data,” I mean the data’s quality. I’m sure that you have heard about Garbage-In-Garbage-Out (GIGO), and that’s exactly what I mean. Having a large amount of data that is not curated, staged, cleaned, stored, processed, or maintained, is simply data with low-to-no value for an AI project.

In my experience as a data scientist and artificial intelligence developer, I have found that a smaller data size that’s better in quality delivers much better results than having a large amount of low-quality data. I have also seen how great AI projects developed with good-quality data have been ruined because large quantities of low-quality data were added later to retrain the model. I like to tell my partners: Good data is great, but more (i.e., “unknown quality”) data is not.

Process – the How

Here are some recommendations for leaders and managers implementing AI projects. If possible, address the following concerns prior to serving data to the AI project team


Focus on how data is generated, stored, and maintained: Understand how data is generated — from a system, an application, a website, forms, human entry, and so on. Depending on how the data is generated, different challenges will arise that the project team will have to deal with. If it is system- or machine-generated, make you understand data generation rules — quantity, format, size, latency, system version, etc. If it is human-generated, understand how it is entered, who inputs it, and under what conditions. It is very important to understand the context in this case. The same scrutiny applies to data storage or data maintenance.

Data needs to be prepared for modeling, after the information technology (IT): This is often not very well understood by my managers and leaders. There are two big situations that limit the vision of leaders and managers concerning data preparation.

  1. Having an IT who runs and maintains all databases, data warehouses, etc.
  2. The “Excel” assumption

Data has been usually stored and prepared to make historical analysis at aggregated level — it has never been prepared for disaggregated statistical analysis. This means that every single record and every single feature column needs to be carefully analyzed, as this is how artificial intelligence models work.

On the other hand, thinking that “this can be done easily” — as you are probably thinking about how you would do it in Excel — does not apply to large quantities of data. Excel only works in very small amounts of data locally on your computer. When developing a large-scale data solution, the situation is very different.

Listen first, and let data talk to you: My recommendation is to analyze data with your North Star in mind as the ultimate goal, but “listen” first to determine if the data is capable of delivering what is expected. Perhaps you may discover that it can deliver something else, something that hadn’t been thought of before.

Final Thoughts

To summarize, data is the real asset. It is the foundation upon which artificial intelligence is built. It is critical to understand if the foundation is solid enough to sustain a solid, scalable artificial intelligence solution.

More data is not always better; in fact, it’s much worse if it is not curated and is lacking in quality.

Most importantly, analyze data from a statistical perspective and let it tell you what is possible and what is not.


Want more insights into all things data? Visit the Data Modernization channel:

Data Modernization Channel Logo

ai Artificial Intelligence data data analytics data scientist
Share. Facebook Twitter LinkedIn Email
Pablo Moreno
  • Website
  • LinkedIn

Business Data Scientist and Project Manager (Waterfall & Agile) with experience in Business Intelligence, Robotics Process Automation, Artificial Intelligence, Advanced Analytics and Machine Learning in multiple business fields, gained within global business environment over the last 20 years. University Professor of ML and AI, International speaker and Author. Active supporter of Open-Source software development. Looking to grow with the next challenge.

Related Posts

Larry Ellison and Oracle Beat Microsoft for Largest Tech Contract Ever: $100-Billion OpenAI Stargate Deal

August 12, 2025

Microsoft Drives AI Advances With GPT-5 Availability Across Copilot and Core Dev Tools

August 12, 2025

Larry Ellison’s $100B Deal With OpenAI: Biggest Tech Contract Ever?

August 12, 2025

Microsoft CEO Nadella Comes Out Swinging at Oracle, Google Cloud, AWS

August 11, 2025
Add A Comment

Comments are closed.

Recent Posts
  • Larry Ellison and Oracle Beat Microsoft for Largest Tech Contract Ever: $100-Billion OpenAI Stargate Deal
  • Microsoft Drives AI Advances With GPT-5 Availability Across Copilot and Core Dev Tools
  • Larry Ellison’s $100B Deal With OpenAI: Biggest Tech Contract Ever?
  • Microsoft CEO Nadella Comes Out Swinging at Oracle, Google Cloud, AWS
  • Oracle’s Kris Rice Talks AI, MCP Integration, and the Future of Cloud | Cloud Wars Live

  • Ask Cloud Wars AI Agent
  • Tech Guidebooks
  • Industry Reports
  • Newsletters

Join Today

Most Popular Guidebooks and Reports

SAP Business Network: A B2B Trading Partner Platform for Resilient Supply Chains

July 10, 2025

Using Agents and Copilots In M365 Modern Work

March 11, 2025

AI Data Readiness and Modernization: Tech and Organizational Strategies to Optimize Data For AI Use Cases

February 21, 2025

Special Report: Cloud Wars 2025 CEO Outlook

February 12, 2025

Advertisement
Cloud Wars
Twitter LinkedIn
  • Home
  • About Us
  • Privacy Policy
  • Get In Touch
  • Marketing Services
  • Do not sell my information
© 2025 Cloud Wars.

Type above and press Enter to search. Press Esc to cancel.

  • Login
Forgot Password?
Lost your password? Please enter your username or email address. You will receive a link to create a new password via email.
body::-webkit-scrollbar { width: 7px; } body::-webkit-scrollbar-track { border-radius: 10px; background: #f0f0f0; } body::-webkit-scrollbar-thumb { border-radius: 50px; background: #dfdbdb }