1 of 10

Assignment 2

Ryan Bell

July 18^th, 2022

Tailored Learning for Cost Modeling:

An Open Source RAG-Based Tool

Ryan Bell

Ryan Longshore

Dr. Raymond Madachy

2 of 10

Agenda

Abstract
Large Language Model (LLM) Overview
Improving LLM Responses
Building the RAG Pipeline
Creating the GUI
Demonstration

3 of 10

Abstract

For professionals entering the specialized field of cost modeling, there is often a significant gap in domain-specific knowledge. To address this, we present an AI-driven approach that leverages Python’s Haystack package to employ a Retrieval-Augmented Generation (RAG) pipeline to offer customized interactions with foundational cost modeling resources, including Barry Boehm’s books and other prominent cost modeling documents. This presentation explores the integration of this AI tool to deliver on-demand, conversational access to cost modeling knowledge, as well as a secondary feature for generating multiple-choice questions to reinforce learning.

The RAG pipeline enables engineers to query specific documents and receive relevant information from established cost modeling tools, simulating a mentor-mentee interaction and providing a much-needed knowledge repository. By dynamically generating both responses to complex inquiries and formative assessment items, this system aims to facilitate the understanding and application of core cost modeling principles for those new to the field. Initial deployment results will be presented, demonstrating its potential to fill knowledge gaps and improve the ability of engineers to contribute effectively to cost estimation projects.

4 of 10

Large Language Model (LLM) Overview

Large Language Models (LLMs): Neural networks that have transformed natural language procession (NLP) by demonstrating capability to generate relevant text based on a human user’s prompt.
Common LLMs

Proprietary: Anthropic Claude, OpenAI ChatGPT, Google Gemini
Open Source: Mistral, LLaMa, Falcon

Characterizing Models

Size is the number of weights, also known as parameters
Quantization converts weights to lower precision data types to reduce memory and compute requirements

Challenges

Hallucinations
Access to Unique Training Data

There are other challenges, but these are the two most prominent for this presentation

5 of 10

Improving LLM Responses

Re-Training: Re-train base model with new dataset
Finetuning: Teach the base model specialized knowledge by adjusting a small number of internal parameters. Significant time savings compared to re-training entire model
Retrieval Augmented Generation (RAG): Utilize external data to augment the LLM’s knowledge without changing model weights
Prompt Engineering: Optimize prompts for optimal outputs

Few Shot prompts teach LLMs to perform a task through examples
Zero Shot prompts challenge the LLM to perform a task correctly when it has not been correctly trained for that task

COMPUTE / COMPLEXITY / COST

Training:

The process of teaching an LLM to perform tasks by exposing it to large datasets and optimizing it to recognize patterns, relationships, and context within the data.

Inferencing:

The LLM’s process of generating responses or predictions based on new inputs, using the knowledge it gained during training.

Context Optimization

What the model needs to know

LLM Optimization

How the model needs to act

6 of 10

Building the RAG Pipeline

The Haystack Python package was utilized to build the RAG pipeline
Pipeline documents included: Barry Boehm’s Software Engineering and Software Engineering Economics, Cost Estimating Guide from Government Accountability Office, and NASA’s Cost Estimating Handbook
Language model used: meta-llama/Meta-Llama-3.1-8B-Instruct

Modular, can use other LLMs here

Base image from Haystack.deepset.ai

7 of 10

Creating the GUI

The Python Gradio (https://www.gradio.app/) library was used to create a chatbot interface.

QUERY

RESPONSE

HISTORY

import gradio as gr

�# the function called when the button is clicked

def get_answer(prompt, history):

# Call the RAG pipeline to get the response

results = query_pipeline.run({

"text_embedder": {"text": prompt},

"prompt_builder": {"query": prompt}

})

� # Extract the response and reference section

response = results["generator"]["replies"][0]

reference = "Cost Modeling Document - Section X.Y" # Example reference format

� # Add the new interaction to the history

history.append((prompt, response, reference))

� # Format the history for display in reverse order

history_display = "\n".join([f" **Q:** {q} \n \n \n **A:** {a} \n \n **Reference:** {ref} \n \n" for q, a, ref in reversed(history)])

8 of 10

Demonstration

9 of 10

Contact Information

Ryan Bell: ryan.bell@nps.edu
Ryan Longshore: ryan.longshore@nps.edu
Raymond Madachy, PhD: rjmadach@nps.edu

Some of our other research:

“Leveraging Generative AI to Create, Modify, and Query MBSE Models”
“Benchmarking Systems Engineering LLM Performance with SysEngBench”
“Baselining AI Performance in Cost Modeling”
“Systems Modeling with Large Language Models”
“Systems and Software Engineering Cost Modeling of AI Assistance”
“LLM Optimization for Systems Modeling”