Miracle KONEKSIn kumppanina tekoälystrategian määrittelyssä ja toteutuksessa

Pekka Kanerva

2.2.2026

Knowledge Partnership Program (KONEKSI) on tiedon ja innovaatioiden alalla toimiva yhteistyöohjelma, joka tukee australialaisten ja indonesialaisten organisaatioiden välisiä kumppanuuksia. Ohjelman tavoitteena on edistää osallistavaa ja kestävää politiikkaa ja teknologian hyödyntämistä sekä vastata yhteiskunnallisiin ja taloudellisiin haasteisiin paikallista osaamista hyödyntäen. KONEKSIa tukevat Australian ja Indonesian hallitukset, ja ohjelmaa operoi Cowater International.

Miracle toimi KONEKSI-hankkeessa asiantuntijakumppanina tekoälykyvykkyyksien suunnittelussa ja rakentamisessa osaksi ohjelman kokonaisvaltaista tiedonhallintajärjestelmää. Yhteistyö käynnistyi KONEKSIn tekoälystrategian määrittelyllä, jossa tunnistettiin keskeiset käyttötapaukset, tiedonlähteet sekä tavoitteet tekoälyn hyödyntämiselle osana päivittäistä toimintaa ja päätöksentekoa.

Strategiatyön pohjalta Miracle toteutti useita käytännön ratkaisuja hyödyntäen Oracle APEX -sovelluskehitysalustaa sekä Oracle AI Database 26ai -tietokantaa. Ratkaisuihin sisältyi muun muassa modulaarisia toiminnallisuuksia ja tekoälypohjaisia chatbotteja, jotka mahdollistavat tiedon tehokkaan haun, yhdistämisen ja analysoinnin eri lähteistä. Näiden avulla sekä rakenteista että rakenteetonta tietoa — kuten dokumentteja, ohjeita ja operatiivisen tietokannan tietoja — voidaan hyödyntää yhtenäisesti ja käyttäjälähtöisesti.

Toteutettujen ratkaisujen ansiosta KONEKSIn tiedonhallinta on kehittynyt entistä ajantasaisemmaksi, läpinäkyvämmäksi ja helposti saavutettavaksi, mikä tukee ohjelman tavoitteita toiminnan tehostamisessa ja tiedolla johtamisessa. KONEKSI on ollut erittäin tyytyväinen yhteistyön sujuvuuteen, Miraclen asiantuntemukseen sekä saavutettuihin tuloksiin. Yhteistyö Miraclen kanssa nähdään pitkäjänteisenä kumppanuutena, ja se jatkuu myös tulevaisuudessa uusien tekoälyä ja tiedonhallintaa kehittävien kokonaisuuksien parissa.

Lohjan kaupunki tuo asiakirjatiedon tehokkaaseen käyttöön Oracle Database AI Vector Searchin ja Property Graph -teknologian avulla

Pekka Kanerva

8.1.2026

Lohjan kaupunki on yhteistyössä Miraclen kanssa kehittänyt edistyksellisen dokumenttichatbotin, joka uudistaa tapaa, jolla kaupungin työntekijät hyödyntävät laajaa sisäistä asiakirja-arkistoa.

”Hankkeessa kehitetyn chatbotin avulla parannamme organisaatiomme sisäistä viestintää sekä helpotamme ja nopeutamme merkittävästi tiedonhakua niin esihenkilöille kuin työntekijöille. Nopea pääsy ajantasaiseen ohjeistukseen sujuvoittaa päivittäistä työtä, yhdenmukaistaa toimintatapoja yksiköiden välillä ja vähentää tukipalveluiden (esim. HR-tuki ja ICT-helpdesk) sisäistä työkuormaa, vapauttaen työaikaa muihin tehtäviin ja toiminnan jatkuvaan kehittämiseen.”

Pasi Perämäki Tietohallinto- ja kehitysjohtaja Lohjan kaupunki

Kasvavan strukturoimattoman dokumenttimäärän myötä kaupunki tarvitsi ratkaisun, joka menee perinteistä avainsanahakua pidemmälle. Älykäs avustaja toimii kokonaisuudessaan Oracle AI Database 26ai tietokannan sisällä ja mahdollistaa asiakirjatiedon kontekstuaalisen haun ja hyödyntämisen luonnollisen kielen avulla.

Ratkaisun käyttöliittymä on toteutettu Oracle APEX -alustalla, joka mahdollistaa turvallisen, responsiivisen ja helppokäyttöisen käyttökokemuksen. APEXin ansiosta kehitystiimi pystyi nopeasti luomaan prototyypin ja ottamaan sovelluksen käyttöön ilman monimutkaista arkkitehtuuria. Taustalogiikka on toteutettu tehokkaalla PL/SQL:llä, mikä varmistaa korkean suorituskyvyn ja tietojen eheyden suoraan tietokannassa.

Ratkaisun ytimessä on Graph RAG (Retrieval-Augmented Generation) -lähestymistapa, joka ylittää perinteisten hakumenetelmien suorituskyvyn. Järjestelmä hyödyntää suuria kielimalleja (LLM) dokumenttien analysointiin, poimien keskeiset entiteetit sekä niiden keskinäiset suhteet. Nämä suhteet tallennetaan tietokantaan ja mallinnetaan Oracle Property Graph -teknologian avulla. Näin chatbot kykenee ”ymmärtämään” yhteydet eri kunnallisten hankkeiden, päätösten ja sidosryhmien välillä sen sijaan, että se käsittelisi dokumentteja toisistaan irrallisena tekstinä.

Samanaikaisesti ratkaisu hyödyntää Oracle AI Vector Search -toiminnallisuutta semanttiseen samankaltaisuushakuun. Muuntamalla dokumenttien tekstisisällön vektoreiksi tietokanta pystyy tunnistamaan olennaisen tiedon kysymyksen merkityksen perusteella, vaikka täsmällisiä hakusanoja ei esiintyisi.

Yhdistämällä kehittyneet graafi- ja vektorikyselyt chatbot pystyy tarjoamaan kontekstuaalista tietoa. Se ei ainoastaan löydä oikeita dokumentteja, vaan myös paljastaa niihin liittyviä oivalluksia, jotka jäisivät perinteisellä RAG-ratkaisulla havaitsematta. Lopputuloksena on tehokas työkalu, joka säästää aikaa ja tukee parempaa päätöksentekoa tarjoamalla kattavia ja täsmällisiä vastauksia koko kaupungin tietopohjaa hyödyntäen.

Oracle Vectors and Similarity Search for Christmas

Heli Helskyaho

9.12.2025

As the holiday season approaches, many of us find ourselves searching for the perfect playlist, gift recommendations, family photos, or recipes that capture the right feeling. But, how can one do those kinds of “feeling searches”? The “feeling” can be captured by a vector embedding, and these vector embeddings can be compared to each other using a similarity search to find similar feelings.

The Oracle AI Database supports vector embeddings and similarity search natively. These capabilities allow applications to move beyond exact matches and rigid rules, and instead find content based on semantic or perceptual similarity.

What is a vector embedding?

A vector embedding is a numerical representation of the essential characteristics of text, images, audio, or any other data in a form that machines can compare mathematically. Items that are similar will have embeddings that are numerically close to each other. For example, you can compare two classical Christmas songs, two red Nordic-style sweaters, or two warmly lit living-room photos.

How does it work in an Oracle Database?

Oracle Database 23ai introduced a native VECTOR data type, vector indexes for an efficient approximate similarity search, and built-in SQL functions for generating vector embeddings (VECTOR_EMBEDDING), distance functions (VECTOR_DISTANCE, and its equivalents of different distance metrics, for example COSINE_DISTANCE and L1_DISTANCE) for comparing the vector embeddings, as well as several PL/SQL packages for advanced operations with vectors (DBMS_VECTOR, DBMS_VECTOR_CHAIN, DBMS_HYBRID_VECTOR).

Let’s see a couple of examples of how they work.

Example 1: Building the Ideal Christmas Playlist

Suppose a table CHRISTMAS_SONGS contains a column EMBEDDING of type VECTOR for each track. To find songs that are most similar to Michael Bublé’s “It’s Beginning to Look A Lot Like Christmas”:

SELECT song_name, artist, year
FROM christmas_songs
ORDER BY VECTOR_DISTANCE(
 embedding,
 (SELECT embedding FROM christmas_songs 
   WHERE song_name = 'It’s Beginning to Look a Lot Like Christmas')
)
FETCH FIRST 20 ROWS ONLY;

The result naturally surfaces songs by Bing Crosby, Dean Martin, Frank Sinatra, and Ella Fitzgerald, all sharing the same warm, crooner-era holiday atmosphere, without any explicit genre or year filters, and returning the 20 closest songs to Michael Bublé’s Christmas classic.

Example 2: Intelligent Christmas Gift Recommendations

An e-commerce table GIFT_CATALOG stores product data, including their images. The IMAGE_EMBEDDING column stores the vector embedding of each image.

After a customer purchases a hand-knitted Scandinavian reindeer sweater, the recommendation engine runs:

SELECT product_name, price, image_url
FROM gift_catalog
WHERE category = 'Christmas'
ORDER BY VECTOR_DISTANCE(image_embedding, :purchased_item_embedding)
FETCH FIRST 10 ROWS ONLY;

The system returns 10 visually and stylistically coherent suggestions of similar products as gift recommendations.

Example 3: Searching Family Christmas Photos by Mood

Let's say you have 10,000 family Christmas photos over the years. You want to find 50 pictures that “feel like” the one where everyone is wearing ugly sweaters around the tree in 2017. A personal photo archive table FAMILY_PHOTOS contains embeddings generated from each image. To retrieve 50 photos from the years 2000-2025 that feel like the iconic 2017 photo referred as :reference_photo_embedding, simply query:

SELECT photo_id, taken_date, thumbnail_url
FROM family_photos
WHERE EXTRACT(YEAR FROM taken_date) BETWEEN 2000 AND 2025
ORDER BY VECTOR_DISTANCE(photo_embedding, :reference_photo_embedding)
FETCH FIRST 50 ROWS ONLY;

The query instantly assembles a heartfelt montage of similar cozy, festive moments across decades.

Example 4: Finding Better Alternatives to Traditional Fruitcake

A recipe database stores text embeddings of ingredient lists and descriptions. A user searching for “something like Grandma’s rum-soaked fruitcake but less dense” can have their natural language query converted to an embedding and matched to the whole database to find the eight best-matching recipes:

SELECT recipe_name, rating
FROM holiday_recipes
ORDER BY VECTOR_DISTANCE(text_embedding, :query_embedding)
FETCH FIRST 8 ROWS ONLY;

Relevant, highly rated alternatives such as Jamaican black cake, German stollen, or panettone rise to the top.

Why This Matters

Traditional relational queries excel at precise conditions, for example, customer_no = 42, price < 50, color = red, rating ≥ 4. Vector similarity search excels at capturing nuance, mood, style, and intent.

Oracle has made the technology accessible and production-ready:

Vectors live alongside regular columns in the same table
Embeddings can be created either inside the database or outside of it
Indexing and querying require only a few lines of SQL
Performance scales to billions of vectors with reasonable response times
Integration with existing security, backup, and high-availability features is seamless
Integration with existing multimodal features of an Oracle Database is seamless

Conclusion

This holiday season, while we enjoy the lights, music, and traditions that feel just right, it’s worth noting that modern databases can now understand “feelings” in a surprisingly human way. Oracle’s vector capabilities bring semantic and perceptual search into the mainstream enterprise applications; no separate specialized engine required. Whether you’re curating the perfect Christmas playlist, recommending thoughtful gifts, rediscovering cherished memories, or rescuing dessert, vector similarity search delivers results that simply feel right.

To explore these features yourself, Oracle’s Always Free Autonomous AI Database includes full vector search support.

Happy holidays, and happy querying. 🎄

Leveraging Oracle APEX and Generative AI to Unify Project Reporting for Siemens Energy

Elmer Nickels

12.12.2025

Managing large-scale engineering projects is complex enough without the administrative burden of reporting on them. At Siemens Energy, a global leader in energy technology, Project Managers (PMs) were spending valuable hours every week manually compiling reports. This process was disconnected, labor-intensive, and inconsistent.

Miracle (miracleoy.fi) partnered with Siemens Energy to change that. By moving from scattered Word templates to a unified Oracle APEX application enhanced by Generative AI, we transformed reporting from a chore into a streamlined operation.

The Challenge: The "Template Trap"

Before the transformation, Siemens Energy faced a common enterprise hurdle: the "Template Trap."

While the project data existed in robust databases, the reporting mechanism was manual. PMs had to hunt down data from various systems and manually copy-paste it into individual Word documents. Because these documents lived on local drives or disjointed share points, there was no "single source of truth."

This created three distinct friction points:

High Manual Effort: PMs were acting as data scribes rather than managers.
Inconsistent Narratives: One PM might write a detailed essay on a technical hiccup, while another might write three vague bullet points.
Lack of Portfolio Visibility: For leadership, aggregating these disparate Word docs to get a clear view of project health was nearly impossible.

The Solution: A Unified, Intelligent Hub

To solve this, we re-engineered the process. We built a custom reporting application using Oracle APEX, chosen for its rapid development capabilities and seamless integration with Siemens Energy’s existing Oracle database infrastructure.

The new system automates the heavy lifting. Instead of typing out fields, the APEX app pulls live data—financials, timelines, and milestones—directly from the source database. The report structure is now enforced programmatically, ensuring every project report looks and feels the same.

Still, the true game-changer was addressing the qualitative side of reporting. How do you standardize the written explanation of project status, technical issues, or product bulletins?

We integrated Large Language Models (LLMs) directly into the reporting workflow. Here is how it works:

Data Ingestion: The system aggregates product bulletins and status entries associated with a project.
AI Summarization: The LLM processes this data and generates concise, natural-language executive summaries. It translates complex technical data into clear business context.
Human-in-the-Loop: The AI generates a draft, but the PM retains the controls. They review the generated summaries, make necessary edits, and approve the final text.

This approach ensures the speed of automation with the accuracy of human oversight.

Comparison: Manual vs. Automated Reporting

To understand the business impact, let’s look at the shift in methodology:

Feature	The Manual Approach	The New APEX + AI Solution
Data Source	Manual copy/paste from multiple systems	Automated real-time pull from Database
Report Structure	Varied Word templates per PM	Unified, enforce structure globally
Summary Creation	PM writes from scratch	LLM generates drafts from raw inputs
Consistency	Highly variable quality and depth	Standardized tone and format
Visibility	Siloed in documents	Aggregated and queryable data

Business Value Delivered

The shift to an APEX-based solution with AI integration delivered immediate value to Siemens Energy:

Reclaiming PM Time: By automating data entry and drafting narratives, PMs reduced the time spent on reporting significantly, freeing them up to focus on project delivery and risk mitigation.
Standardized "One Truth": Leadership now has a consistent view across diverse projects. Because the structure is unified, comparing project status across the portfolio is seamless.
Data Integrity: By removing the manual copy-paste step, human error was virtually eliminated from the quantitative data.

Conclusion

At Siemens Energy, we proved that project reporting doesn't have to be a manual burden. By combining the data-handling power of Oracle APEX with the summarization capabilities of Generative AI, we turned a fragmented documentation process into a streamlined, intelligent system.

The result? Reports that write themselves (almost), and Project Managers who can get back to managing projects.

Oracle APEX and Oracle AI Vector Search - Ask from your own PDF documents

Use Oracle APEX, Vector Embedding Models in Database, Oracle AI Vector Search and Oracle Generative AI to ask questions from your own PDF documents

Pekka Kanerva

27.3.2025

4 min read

There are several public AI services into which you can load your own documents and ask questions about their contents. But what if the documents are confidential and should not be sent outside the company? And what if there are tens or hundreds of documents, making it impossible to ask questions from all of them at the same time using the public services?

Linkki blogipostaukseen

Defining Data Model Quality Metrics for Data Vault 2.0 Model Evaluation

By Heli Helskyaho, Laura Ruotsalainen, Tomi Männistö

Published: 9 February 2024, Inventions

Designing a database is a crucial step in providing businesses with high-quality data for decision making. The quality of a data model is the key to the quality of its data. Evaluating the quality of a data model is a complex and time-consuming task. Having suitable metrics for evaluating the quality of a data model is an essential requirement for automating the design process of a data model. While there are metrics available for evaluating data warehouse data models to some degree, there is a distinct lack of metrics specifically designed to assess how well a data model conforms to the rules and best practices of Data Vault 2.0. The quality of a Data Vault 2.0 data model is considered suboptimal if it fails to adhere to these principles. In this paper, we introduce new metrics that can be used for evaluating the quality of a Data Vault 2.0 data model, either manually or automatically. This methodology involves defining a set of metrics based on the best practices of Data Vault 2.0, evaluating five representative data models using both metrics and manual assessments made by a human expert. Finally, a comparative analysis of both evaluations was conducted to validate the consistency of the metrics with the judgments made by a human expert.

Keywords: data warehouse; Data Vault 2.0; data model; metrics

Linkki artikkeliin

Towards Automating Database Designing

By Heli Helskyaho

Published: 2023 34th Conference of Open Innovations Association (FRUCT)

Database designing is an important process for enabling good quality data. Without designing the database correctly, the database might contain the same data several times, or it might contain data that is not usable for decision making. The evolution of software development, programming languages, increasing amount of data, different data models, different data sources and many more have increased the importance of designing databases to provide accurate data for decision making. Designing databases manually is time consuming. If the process can be automated, it would allow faster creation of good quality databases. The goal of this study is to investigate whether large language models could be used for designing a Data Vault 2.0 raw database to automate the designing process. In this study we introduce database designing as a process, and describe the main principles of Data Vault 2.0. We create an example data source, an example Data Vault 2.0 raw database based on the source database for reference, and then test the ChatGPTs capabilities for creating a Data Vault 2.0 raw database based on instructions given in a prompt. Finally, we analyze the results and discuss future works.

Keywords: data warehouse; Technological innovation, Computer languages, Databases, Soft sensors, Decision making, Chatbots, Data models

Linkki artikkeliin (pdf)

Introduction to AI Services in the Oracle Cloud Infrastructure

By Heli Helskyaho

Published WINTER 2023, NL.OUG Visei

Machine Learning is often seen as a complicated process with model training, feature engineering, model evaluations, deployments, and so much more. The Oracle Cloud Infrastructure (OCI) offers an easy option: AI Services. These services are pre-trained models that you can use with your own data: no training, evaluation or any of the complicated machine learning steps needed. (Page 10)

Linkki artikkeliin (pdf)

LLMs, GPTs, and All That Jazz

By Heli Helskyaho

Published July 2023, Edition #30 e-Magazine for Oracle Users published by the EOUC

Everybody is talking about ChatGPT and other similar tools. What are they and how can they be used? ChatGPT, as well as Bard, Bing, DALL-E, Midjourney, Codex and many more, belong to a machine learning category called Generative AI (GenAI). The idea of a GenAI is to generate something, for example text, images, videos, audio, and 3D models. GenAI learns patterns from existing data to generate new and unique outputs. It does not really “know” things, it just uses those patterns and combines them. A technology called transformer neural network was first
introduced in 2017. Large Language Models (LLMs), that for example ChatGPT uses, are based on this transformer architecture and have made significant advancements in natural language processing. The acronym GPT comes from words Generative Pre-trained Transformer. We will discuss the technology in later issues of ORAWORLD. In this article we will talk about how a GPT tool can be used and what are the risks and limitation you should be aware of. We will use ChatGPT as an example. (Page 12)

Linkki artikkeliin (pdf)

Machine Learning For Beginners

By Heli Helskyaho

Published: September 2021, Edition #26 e-Magazine for Oracle Users published by the EOUC

Oracle offers several tools for machine learning. You can, for example, use the in-database machine learning with models built in SQL, R, or Python. Or you can connect to Oracle Database with different libraries, such as cx_Oracle, and use the data from the Oracle Database with different IDEs for machine learning. Or you can use Oracle Data Science Cloud that has an environment for Python machine learning including special Oracle libraries, and the possibility to pip install any Python libraries. (Page 15)

Linkki artikkeliin (pdf)

Developer Strategies: How to Use Free Cloud Services

Published: September 16th, 2021

Here fishy, fishy. To entice developers to their platforms, cloud providers all offer free versions of a selection of their cloud services. The goal, of course, is to hook them with tasty functionality and keep them as paying customers for the long haul.

Free services from Oracle, Amazon, Azure, Google and others usually break down something like this: New customers can get a few hundred dollars of free credits to use full versions of cloud services until they burn through those credits. Existing customers can also get free short-term access to a smaller number of services to test and train on before deciding whether to buy.

Linkki artikkeliin

The story behind a COVID-19 exposure-tracking application in Finland

By Heli Helskyaho

Published: September 1, 2021

In September 2020, COVID-19 was spreading fast and was extremely dangerous, with people globally afraid of becoming infected. Before vaccinations became available, avoiding exposure was the only way to keep safe and minimize the spread.

In Finland, a group of passionate volunteers made it their mission to collect all available exposure data in a blog and report it on Twitter. Although the blog was a great asset to the public, maintaining it became very time-consuming. Data needed to be copied into Microsoft Excel spreadsheets for further analysis, and the volunteers needed to create new charts and reports continually.

Linkki artikkeliin

Miracle KONEKSIn kumppanina tekoälystrategian määrittelyssä ja toteutuksessa

Lohjan kaupunki tuo asiakirjatiedon tehokkaaseen käyttöön Oracle Database AI Vector Searchin ja Property Graph -teknologian avulla

Oracle Vectors and Similarity Search for Christmas

What is a vector embedding?

How does it work in an Oracle Database?

Example 1: Building the Ideal Christmas Playlist

Example 2: Intelligent Christmas Gift Recommendations

Example 3: Searching Family Christmas Photos by Mood

Example 4: Finding Better Alternatives to Traditional Fruitcake

Why This Matters

Conclusion

Leveraging Oracle APEX and Generative AI to Unify Project Reporting for Siemens Energy

The Challenge: The "Template Trap"

The Solution: A Unified, Intelligent Hub

Comparison: Manual vs. Automated Reporting

Business Value Delivered

Conclusion

Oracle APEX and Oracle AI Vector Search - Ask from your own PDF documents

Use Oracle APEX, Vector Embedding Models in Database, Oracle AI Vector Search and Oracle Generative AI to ask questions from your own PDF documents

Oracle tools for Machine Learning

Defining Data Model Quality Metrics for Data Vault 2.0 Model Evaluation

Towards Automating Database Designing

Introduction to AI Services in the Oracle Cloud Infrastructure

LLMs, GPTs, and All That Jazz

Machine Learning For Beginners

Developer Strategies: How to Use Free Cloud Services

The story behind a COVID-19 exposure-tracking application in Finland

Autonomous Databases Give You Time for Data Modeling