BAD DATA CANNOT BE FIXED BY A GOOD ALGORITHM: WHY DATA QUALITY WILL DETERMINE GHANA'S AI FUTURE

INTRODUCTION

Ghana's National Artificial Intelligence Strategy (2025–2035) reflects an ambitious vision for the country's digital future. It seeks to harness artificial intelligence to drive economic growth, improve public service delivery, stimulate innovation, and position Ghana as a leading participant in Africa's emerging AI ecosystem. The strategy rightly recognises that achieving these objectives will require investments in skills, infrastructure, research, governance, and institutional capacity.

Among its most important insights is the recognition that data is a strategic asset. The strategy places considerable emphasis on data governance, digital public infrastructure, local language datasets, and the broader data ecosystems required to support AI development and deployment. This reflects an important reality: artificial intelligence cannot function without data. Every AI system, whether deployed in healthcare, agriculture, education, finance, or government, depends on data as its foundation.

Yet it is precisely at this point that a critical question emerges. Discussions about artificial intelligence often focus on algorithms, computing power, and technological innovation. While these are important considerations, they can obscure a more fundamental issue. The effectiveness of an AI system depends not only on the sophistication of the technology, but also on the quality of the information from which it learns.

This distinction matters because weaknesses in data do not remain confined to the datasets from which they originate. They are reproduced through the systems built upon them. Inaccuracies become inaccurate outputs. Historical biases become automated biases. Gaps in data become gaps in decision-making. The consequences are amplified rather than corrected.

This is why a fundamental principle has emerged across the field of artificial intelligence: bad data cannot be fixed by a good algorithm. No matter how sophisticated the model, the quality of its outputs will ultimately be constrained by the quality of its inputs.

For Ghana, this raises an important question. If data is the foundation upon which artificial intelligence is built, then attention must extend beyond the availability of data to its quality, representativeness, integrity, and governance. Whether these dimensions receive sufficient attention may ultimately determine whether the ambitions contained in the National Artificial Intelligence Strategy are realised in practice.

This article argues that while Ghana's AI strategy correctly identifies data as a strategic resource, the country's long-term success in artificial intelligence will depend less on access to algorithms and more on the quality of the data on which those algorithms are trained. In this respect, data quality is not simply a technical concern. It is a governance issue, a strategic issue, and ultimately a prerequisite for achieving Ghana's AI ambitions.

DATA, NOT ALGORITHMS

Artificial intelligence is often discussed through the language of algorithms, models, and computing power. This has contributed to a common perception that the intelligence of an AI system resides primarily in the algorithm itself. The more sophisticated the algorithm, the assumption goes, the better the outcome. While algorithms are undeniably important, this view can be misleading.

Algorithms do not create knowledge independently. They do not possess an inherent understanding of the world. Rather, they identify patterns within data and use those patterns to generate predictions, recommendations, classifications, and decisions. Their effectiveness therefore depends fundamentally on the quality of the information from which they learn. An algorithm can only work with the data it is given.

This distinction is important because it shifts attention from technology alone to the information that makes technology useful. A highly sophisticated algorithm trained on incomplete, inaccurate, or unrepresentative data may produce unreliable outcomes. Conversely, a relatively simple model trained on high-quality data may generate results that are accurate, useful, and trustworthy. The intelligence often attributed to artificial intelligence is therefore inseparable from the quality of the data that underpins it.

One of the strengths of Ghana's National Artificial Intelligence Strategy is its recognition that data is not merely a supporting resource for artificial intelligence, but a strategic asset in its own right. While public discussions about AI often focus on algorithms, applications, and technological breakthroughs, the strategy acknowledges that none of these can exist without the data ecosystems that sustain them.

This recognition is reflected throughout the strategy. Significant attention is given to data governance, digital public infrastructure, local language datasets, and the broader information environment required to support AI development and deployment. The strategy also identifies data as a critical enabler of innovation, research, public service transformation, and economic growth. In doing so, it acknowledges an important reality: the value of artificial intelligence is inseparable from the quality of the information on which it depends.

This reality is particularly important because access to AI technology is becoming less of a differentiating factor. Powerful models are increasingly available through commercial platforms, open-source initiatives, and global technology ecosystems. The challenge facing many countries is therefore not whether they can access artificial intelligence, but whether they possess the data required to use it effectively.

Particularly noteworthy in Ghana's strategy is the emphasis placed on context-specific and locally relevant data. The ambition to develop AI systems that understand Ghanaian languages, operate effectively within local contexts, and address domestic challenges recognises that artificial intelligence cannot simply be imported and expected to perform optimally. AI systems learn from the data on which they are trained. Where that data fails to reflect local realities, the usefulness and reliability of the resulting systems may be limited.

A sophisticated algorithm trained on incomplete, inaccurate, or unrepresentative data will often produce unreliable outcomes. Conversely, a relatively simple model trained on high-quality data may generate results that are more accurate, useful, and trustworthy. In practice, the effectiveness of artificial intelligence is shaped as much by the quality of its data as by the sophistication of its technology.

Indeed, Ghana's National Artificial Intelligence Strategy itself identifies insufficient local datasets that are accurate, updated, and representative as a weakness within the country's AI ecosystem, while also recognising the risks associated with AI models trained on data that is not fit for the local context.

For countries seeking to build AI-enabled economies, this distinction matters. The strategic question is no longer simply whether artificial intelligence works. In many respects, it already does. The more important question is whether the data on which it depends is sufficiently accurate, representative, and trustworthy to support meaningful decision-making. It is here that Ghana's AI ambitions encounter their most important challenge.

THE DATA CHALLENGE BEHIND GHANA'S AI AMBITIONS

The importance of data becomes even more apparent when viewed through the ambitions set out in Ghana's National Artificial Intelligence Strategy. The strategy envisions artificial intelligence supporting transformation across sectors including healthcare, agriculture, education, finance, and public administration. It also seeks to promote local language technologies, strengthen research and innovation, and expand the use of AI within government services.

As access to artificial intelligence becomes increasingly widespread, the strategic challenge for Ghana may no longer be the technology itself. Powerful AI models are becoming more accessible through global technology ecosystems, commercial platforms, and open-source initiatives. Increasingly, the more important question is whether the country possesses the quality of data required to use that technology effectively. In many respects, Ghana's AI challenge is becoming a data challenge.

Consider, for example, the aspiration to develop AI systems capable of understanding and operating within Ghana's linguistic and cultural context. Such systems require large volumes of accurate, representative, and well-structured language data. The challenge is not simply the existence of local languages, but the availability of datasets that capture their diversity, usage, and evolving forms. Without such datasets, locally relevant AI remains difficult to achieve regardless of the sophistication of the underlying technology.

The same principle applies across other sectors. AI applications in healthcare depend on accurate and consistent health records. AI systems in agriculture rely on reliable information about weather patterns, soil conditions, crop performance, and farming practices. AI-enabled public services require administrative records that are complete, interoperable, and maintained to consistent standards. In each case, the effectiveness of artificial intelligence is directly linked to the quality of the underlying data.

The challenge therefore extends beyond data availability. Large quantities of data do not automatically translate into useful datasets. Information may be fragmented across institutions, collected according to different standards, contain gaps or inconsistencies, or fail to adequately represent the populations and contexts they are intended to serve. Indeed, the Strategy's own diagnostic assessment identifies insufficient local datasets that are accurate, updated, and representative, while also highlighting the fragmentation of data across sectors and the absence of consistent practices for data collection, digitisation, and sharing.

This distinction is important because the success of Ghana's AI strategy will depend not only on technological capability, but also on the robustness of the data ecosystems supporting that capability. The challenge facing Ghana is therefore not simply one of technological readiness. It is also one of data readiness.

The implications are significant. Poor-quality data does not merely create isolated inaccuracies. Artificial intelligence can amplify those weaknesses at scale. Incomplete or inconsistent data can produce unreliable outputs. Historical patterns of exclusion or underrepresentation can be learned and reproduced by AI systems. The apparent sophistication of AI can also create a false sense of confidence, obscuring weaknesses in the underlying information on which decisions are based. The COVID-19 pandemic illustrates this principle starkly. Health systems with high-quality, representative patient data were able to deploy AI diagnostic and prediction tools rapidly. These systems could identify patterns, predict hospitalisation needs, and guide resource allocation. By contrast, health systems with fragmented patient records, incomplete demographic data, or inconsistent health metrics struggled to deploy AI effectively—not because they lacked access to sophisticated algorithms, but because their underlying data was not trustworthy enough to train and deploy AI systems safely. The bottleneck was not technological. It was data quality and governance.

These concerns are reflected in Ghana's National Artificial Intelligence Strategy, which recognises the risks associated with bias, underrepresentation, and weaknesses in data governance when AI systems are deployed at scale. As artificial intelligence becomes integrated into more sectors of the economy, weaknesses in data can become systemic rather than isolated. Errors are repeated more quickly. Biases affect more people. Inconsistencies are reproduced across interconnected systems. The scale that makes artificial intelligence valuable can also magnify its shortcomings.

Artificial intelligence does not eliminate weaknesses in data. It amplifies them. Where the data is strong, AI can generate significant value. Where the data is weak, the technology may simply automate and scale existing problems. The question, therefore, is not whether artificial intelligence can transform decision-making, but whether the information driving those decisions is sufficiently trustworthy to justify that transformation.

DATA GOVERNANCE IS AI GOVERNANCE

If Ghana's AI ambitions are increasingly dependent on the quality of the data on which AI systems rely, then questions about data quality inevitably become questions about governance.

Discussions about AI governance often focus on ethics, regulation, transparency, accountability, and risk management. These are important considerations. However, they can create the impression that governance begins once an AI system has been developed and deployed. In reality, governance begins much earlier.

It begins with data. Every artificial intelligence system is shaped by decisions about what data is collected, how it is collected, who collects it, how it is stored, who has access to it, and how it is used. These decisions influence not only the performance of AI systems, but also the fairness, reliability, and legitimacy of the outcomes they produce. Data governance is therefore not a separate issue from AI governance. It is its foundation.

This perspective is reflected in Ghana's National Artificial Intelligence Strategy, which identifies data access and governance as a dedicated strategic pillar, recognising that the effectiveness of AI depends on the quality, availability, and stewardship of the data on which it relies.

This distinction is important because data quality is not determined solely by technical processes. It is influenced by institutional arrangements, governance frameworks, and policy choices. Questions about data quality inevitably become questions about responsibility and control. Who determines what data is collected? What standards govern its quality? Who verifies its accuracy? Who is accountable when errors exist? And who decides whether a dataset is sufficiently representative to support decisions affecting citizens?

Data governance is not fundamentally a technology project; it is a governance and organisational discipline. Decisions about data quality, ownership, accountability, stewardship, and trust are ultimately institutional decisions that technology alone cannot resolve.

These questions are particularly relevant in the context of Ghana's AI ambitions. As artificial intelligence becomes embedded within public services, economic systems, and social institutions, the quality of data can no longer be viewed as a technical matter delegated solely to information technology professionals. It becomes a matter of public policy and governance. Decisions about data shape decisions made by AI. In this sense, the governance of data directly influences the governance of artificial intelligence itself.

The issue is not simply whether data exists, but whether it can be trusted. Trustworthy data requires more than volume. It requires standards, oversight, quality assurance, accountability mechanisms, and institutional responsibility. Without these foundations, the effectiveness of even the most advanced AI systems may be compromised.

International and continental frameworks increasingly recognise this principle. UNESCO's Data Governance Toolkit provides detailed guidance for structuring data quality and governance across the full data lifecycle—from planning and collection through processing, analysis, and use. The African Union's AI Strategy, directly relevant to Ghana as an African nation, emphasises that data quality, sovereignty, and governance are foundational to Africa's AI development. These frameworks reflect a critical consensus: data quality is not optional. It is foundational to responsible AI deployment.

If data governance is indeed foundational to AI governance, then the critical question for Ghana is not merely whether the country understands this principle. It is whether Ghana can translate this understanding into institutional practice. Recognising that data governance matters is the first step. Building the governance mechanisms that deliver trustworthy data is the next.

Viewed from this perspective, several priorities emerge. One is the need for stronger institutional arrangements to assure data quality within sectors where artificial intelligence is expected to play a significant role, particularly healthcare, agriculture, education, and finance. As AI systems become increasingly dependent on large and complex datasets, ensuring accuracy, completeness, representativeness, and consistency becomes a matter of governance rather than technical administration alone.

A second priority concerns interoperability. The value of artificial intelligence is often constrained where data remains fragmented across institutions and systems. As Ghana's digital ecosystem continues to evolve, greater attention may need to be given to the standards, governance arrangements, and accountability mechanisms that enable information to be shared and used effectively across organisational boundaries while maintaining privacy and security.

A third consideration is the question of data readiness. The effectiveness of AI systems ultimately depends on the condition of the datasets on which they are built. Understanding the strengths, weaknesses, gaps, and limitations of existing datasets may therefore become increasingly important in determining where investments in data quality and governance should be directed.

These priorities are not separate from Ghana's broader AI ambitions. They are part of the institutional foundation upon which those ambitions depend. If artificial intelligence is to generate reliable and trustworthy outcomes, the governance of data cannot remain an afterthought. It must become an integral component of AI readiness itself.

This has important implications for Ghana's broader digital transformation agenda. Over the past decade, considerable effort has been invested in digitising records, services, and transactions across both the public and private sectors. These initiatives have created valuable digital foundations. Artificial intelligence, however, introduces a different requirement. The question is no longer simply whether information has been digitised, but whether the underlying data is sufficiently accurate, complete, representative, and trustworthy to support intelligent systems.

The long-term success of Ghana's AI strategy may therefore depend less on the collection of new data and more on the institutionalisation of governance mechanisms capable of improving, maintaining, and assuring the quality of existing datasets. Artificial intelligence rewards not the quantity of data that has been digitised, but the quality of the information from which it learns.

This is why the conversation about Ghana's AI future cannot be limited to algorithms, computing infrastructure, or technological innovation. It must also address the governance arrangements that determine the quality and integrity of the data on which those systems depend. The first governance decision in any AI system is not the decision made by the algorithm. It is the decision about the data from which the algorithm learns.

FROM DIGITALISATION TO INTELLIGENCE

Over the past decade, Ghana has made significant investments in digital transformation. Public services have been digitised, digital identity systems have been introduced, payment platforms have expanded, and increasing volumes of information have been moved from paper-based processes into digital environments. These developments have rightly been regarded as important milestones in the country's digital journey.

Yet artificial intelligence introduces a different challenge. The primary objective of digitalisation is to make information accessible, searchable, and usable within digital systems. Artificial intelligence, however, operates on a different logic. It does not simply store information. It learns from it.

This distinction changes how success is measured. During the digitalisation era, the central question was whether information could be digitised and made available through digital systems. Success was often measured by the number of services digitised, the volume of records converted into digital formats, and the extent to which transactions could be conducted electronically.

The AI era introduces a different question: can the digitised information be trusted? Success is no longer determined primarily by the quantity of data available, but by its quality. Accuracy, completeness, representativeness, and governance become increasingly important because artificial intelligence relies on data not merely for storage and retrieval, but for learning, prediction, and decision-making. Digitalisation rewarded data availability. Artificial intelligence rewards data trustworthiness.

This shift has important implications for Ghana's AI ambitions. The transition from digitalisation to artificial intelligence requires a corresponding shift in priorities. The challenge is no longer simply to create digital records and systems, but to ensure that the data within those systems is capable of supporting reliable intelligence. Data that may be adequate for storage, retrieval, or transaction processing may not necessarily be adequate for training, informing, or guiding artificial intelligence.

In this sense, artificial intelligence shifts the conversation from digitalisation to intelligence. Digitalisation remains an important achievement. But in the age of AI, it becomes the starting point rather than the destination. The value of Ghana's AI ambitions will depend not only on the technologies it adopts, but on the quality of the digital foundations from which those technologies learn.

CONCLUSION

Ghana's National Artificial Intelligence Strategy reflects a clear recognition of the transformative potential of artificial intelligence. Its emphasis on innovation, digital infrastructure, research, skills development, public service transformation, and data ecosystems demonstrates a serious commitment to positioning the country for participation in the emerging AI economy. Importantly, the strategy recognises that data is not merely a supporting resource, but a strategic asset and a necessary foundation for achieving these ambitions.

Yet the long-term success of artificial intelligence will depend on more than access to technology. As AI becomes increasingly integrated into economic and social life, the more fundamental question concerns the quality of the data on which it depends. The challenge is not simply whether artificial intelligence works, but whether the information from which it learns is sufficiently accurate, representative, and trustworthy to support meaningful decisions.

Viewed from this perspective, data quality is not merely a technical concern. It is a governance concern. Decisions about how data is collected, managed, verified, shared, and governed will shape the effectiveness, fairness, and legitimacy of the AI systems built upon it. The future of artificial intelligence will therefore be influenced as much by the quality of data governance as by the sophistication of the technology itself.

For Ghana, the challenge ahead is not simply to build AI systems. It is to build the trustworthy, representative, and well-governed data foundations upon which those systems depend. Realising the ambitions contained in the National Artificial Intelligence Strategy will require recognising that the path to effective artificial intelligence begins long before an algorithm is deployed.

The next phase of Ghana’s AI journey may depend less on acquiring new technologies and more on strengthening the institutional foundations of data governance. That means improving data quality, enhancing interoperability across systems, assessing the readiness of existing datasets, and embedding accountability for data stewardship across institutions. In practical terms, Ghana must institutionalise the governance mechanisms that produce trustworthy data. Digitalisation brought Ghana into the digital age; data governance will determine whether it succeeds in the AI age. Without trustworthy data, AI simply accelerates confusion.

Search This Blog

KOFI ANOKYE OWUSU-DARKO (BLOG 1)

BAD DATA CANNOT BE FIXED BY A GOOD ALGORITHM: WHY DATA QUALITY WILL DETERMINE GHANA'S AI FUTURE

Comments

Post a Comment

Popular posts from this blog

LEGAL ISSUES IN E-COMMERCE WEBSITE DEVELOPING IN GHANA: OWNER BEWARE

“LEARNED” NO MORE?: AI AND THE QUIET REVOLUTION IN LEGAL PRACTICE

ELECTRONIC SIGNATURES AND DIGITAL SIGNATURES: HAS GHANA GOTTEN IT MIXED UP UNDER THE ELECTRONIC TRANSACTIONS ACT 2008 (ACT772)?