-
Notifications
You must be signed in to change notification settings - Fork 100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFE: Skeleton for DMA layer #306
base: main
Are you sure you want to change the base?
Changes from all commits
57c366b
d991f9e
2f4ad18
3163d08
728ac70
eb7efbc
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,105 @@ | ||
use std::sync::{Arc, Weak}; | ||
use crate::dma_manager::GlobalDmaManager; | ||
use user_driver::dma; | ||
|
||
// DMA Client structure, representing a specific client instance | ||
pub struct DmaClient { | ||
manager: Weak<GlobalDmaManager>, | ||
} | ||
|
||
impl DmaClient { | ||
pub fn new(manager: Weak<GlobalDmaManager>) -> Self { | ||
Self { manager } | ||
} | ||
|
||
fn pin_memory(&self, range: &MemoryRange) -> Result<usize, DmaError> { | ||
let manager = self.manager.upgrade().ok_or(DmaError::InitializationFailed)?; | ||
let threshold = manager.get_client_threshold(self).ok_or(DmaError::InitializationFailed)?; | ||
|
||
if range.size <= threshold && manager.is_pinned(range) { | ||
Ok(range.start) | ||
} else { | ||
Err(DmaError::PinFailed) | ||
} | ||
} | ||
|
||
pub fn map_dma_ranges( | ||
&self, | ||
ranges: &[MemoryRange], | ||
options: Option<&DmaMapOptions> | ||
) -> Result<DmaTransactionHandler, DmaError> { | ||
let manager = self.manager.upgrade().ok_or(DmaError::InitializationFailed)?; | ||
let mut dma_transactions = Vec::new(); | ||
let force_bounce_buffer = options.map_or(false, |opts| opts.force_bounce_buffer); | ||
|
||
let threshold = manager.get_client_threshold(self).ok_or(DmaError::InitializationFailed)?; | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This line can be moved before defining dma_transactions to fail earlier. |
||
|
||
for range in ranges { | ||
let use_bounce_buffer = force_bounce_buffer || range_size > threshold || !self.can_pin(range); | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. If can_pin returns true, would it grantee the later pin_memory succeed? |
||
|
||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. extra empty line |
||
|
||
if use_bounce_buffer { | ||
// Allocate a bounce buffer for this range | ||
let bounce_buffer_addr = self | ||
.dma_manager | ||
.allocate_bounce_buffer(range_size) | ||
.map_err(|_| DmaError::BounceBufferFailed)?; | ||
|
||
dma_transactions.push(DmaTransaction { | ||
original_addr: range.start_addr(), | ||
dma_addr: bounce_buffer_addr, | ||
size: range_size, | ||
is_pinned: false, | ||
is_bounce_buffer: true, | ||
is_physical: false, | ||
is_prepinned: false, | ||
}); | ||
|
||
self.copy_to_bounce_buffer(range, bounce_buffer_addr)?; | ||
} else { | ||
// Use direct pinning | ||
let dma_addr = self | ||
.dma_manager | ||
.pin_memory(range) | ||
.map_err(|_| DmaError::PinFailed)?; | ||
|
||
dma_transactions.push(DmaTransaction { | ||
original_addr: range.start_addr(), | ||
dma_addr, | ||
size: range_size, | ||
is_pinned: true, | ||
is_bounce_buffer: false, | ||
is_physical: true, | ||
is_prepinned: false, | ||
}); | ||
} | ||
} | ||
|
||
Ok(DmaTransactionHandler { transactions }) | ||
} | ||
|
||
pub fn unmap_dma_ranges(&self, dma_transactions: &[DmaTransaction]) -> Result<(), DmaError> { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Will this need to be an &mut reference to the dma_transactions? |
||
let manager = self.manager.upgrade().ok_or(DmaError::InitializationFailed)?; | ||
|
||
for transaction in dma_transactions { | ||
if transaction.is_bounce_buffer { | ||
// Code to release bounce buffer | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Do we need copy out from bounce buffer here? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The caller may not know if it's bounce buffer or not, so I think we need handle it here. And need pass the Memory range to copy data out. Actually, I think we need know the IO direction to decide which copy is needed (including the one in map_dma_ranges). |
||
} else if transaction.is_pinned && !transaction.is_prepinned { | ||
// Code to unpin memory | ||
} | ||
} | ||
Ok(()) | ||
} | ||
} | ||
|
||
|
||
// Implementation of the DMA interface for `DmaClient` | ||
impl DmaInterface for DmaClient { | ||
fn map_dma_ranges(&self, ranges: &[MemoryRange]) -> Result<DmaTransactionHandler, DmaError> { | ||
self.map_dma_ranges(ranges) | ||
} | ||
|
||
fn unmap_dma_ranges(&self, dma_transactions: &[DmaTransaction]) -> Result<(), DmaError> { | ||
self.unmap_dma_ranges(dma_transactions) | ||
} | ||
} |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,85 @@ | ||
use std::sync::{Arc, Mutex, Weak}; | ||
use memory_range::MemoryRange; | ||
use once_cell::sync::OnceCell; | ||
|
||
pub use dma_client::{DmaClient, DmaInterface, DmaTransaction, DmaTransactionHandler}; | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think either clippy or fmt will want you to split these out. Doesn't hurt to run |
||
|
||
pub enum DmaError { | ||
InitializationFailed, | ||
MapFailed, | ||
UnmapFailed, | ||
PinFailed, | ||
BounceBufferFailed, | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Please use source attributes so that we don't lose error origination. E.g.
|
||
} | ||
|
||
static GLOBAL_DMA_MANAGER: OnceCell<Arc<GlobalDmaManager>> = OnceCell::new(); | ||
|
||
/// Global DMA Manager to handle resources and manage clients | ||
pub struct GlobalDmaManager { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What settings will this manager have? Which of those do you expect to expose in Vtl2Settings? |
||
physical_ranges: Vec<MemoryRange>, | ||
bounce_buffers: Vec<MemoryRange>, | ||
clients: Mutex<Vec<Weak<DmaClient>>>, | ||
client_thresholds: Mutex<Vec<(Weak<DmaClient>, usize)>>, | ||
} | ||
|
||
impl GlobalDmaManager { | ||
/// Initializes the global DMA manager with physical ranges and bounce buffers | ||
pub fn initialize( | ||
physical_ranges: Vec<MemoryRange>, | ||
bounce_buffers: Vec<MemoryRange>, | ||
) -> Result<(), DmaError> { | ||
let manager = Arc::new(Self { | ||
physical_ranges, | ||
bounce_buffers, | ||
clients: Mutex::new(Vec::new()), | ||
client_thresholds: Mutex::new(Vec::new()), | ||
}); | ||
|
||
GLOBAL_DMA_MANAGER.set(manager).map_err(|_| DmaError::InitializationFailed) | ||
} | ||
|
||
/// Accesses the singleton instance of the global manager | ||
pub fn get_instance() -> Arc<GlobalDmaManager> { | ||
GLOBAL_DMA_MANAGER | ||
.get() | ||
.expect("GlobalDmaManager has not been initialized") | ||
.clone() | ||
} | ||
|
||
/// Creates a new `DmaClient` and registers it with the global manager, along with its threshold | ||
pub fn create_client(&self, pinning_threshold: usize) -> Arc<DmaClient> { | ||
let client = Arc::new(DmaClient::new(Arc::downgrade(&self.get_instance()))); | ||
self.register_client(&client, pinning_threshold); | ||
client | ||
} | ||
|
||
/// Adds a new client to the list and stores its pinning threshold | ||
fn register_client(&self, client: &Arc<DmaClient>, threshold: usize) { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Is there a need to have per-client bounce buffers? |
||
let mut clients = self.clients.lock().unwrap(); | ||
clients.push(Arc::downgrade(client)); | ||
|
||
let mut thresholds = self.client_thresholds.lock().unwrap(); | ||
thresholds.push((Arc::downgrade(client), threshold)); | ||
} | ||
|
||
/// Retrieves the pinning threshold for a given client | ||
pub fn get_client_threshold(&self, client: &Arc<DmaClient>) -> Option<usize> { | ||
let thresholds = self.client_thresholds.lock().unwrap(); | ||
thresholds.iter().find_map(|(weak_client, threshold)| { | ||
weak_client | ||
.upgrade() | ||
.filter(|c| Arc::ptr_eq(c, client)) | ||
.map(|_| *threshold) | ||
}) | ||
} | ||
|
||
/// Checks if the given memory range is already pinned | ||
pub fn is_pinned(&self, range: &MemoryRange) -> bool { | ||
false // Placeholder | ||
} | ||
|
||
/// Allocates a bounce buffer if available, otherwise returns an error | ||
pub fn allocate_bounce_buffer(&self, size: usize) -> Result<usize, DmaError> { | ||
Err(DmaError::BounceBufferFailed) // Placeholder | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Do we need ensure the bounce buffer is page aligned? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I envision that bounce buffer management will be aligned. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. And also note it: our current bounce buffer allocation function has infinite loop issue which we want to avoid it in the new implementation. |
||
} | ||
} |
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -404,68 +404,46 @@ impl Issuer { | |
guest_memory: &GuestMemory, | ||
mem: PagedRange<'_>, | ||
) -> Result<spec::Completion, RequestError> { | ||
let mut double_buffer_pages = None; | ||
let opcode = spec::Opcode(command.cdw0.opcode()); | ||
assert!( | ||
opcode.transfer_controller_to_host() | ||
|| opcode.transfer_host_to_controller() | ||
|| mem.is_empty() | ||
); | ||
|
||
// Ensure the memory is currently mapped. | ||
guest_memory | ||
.probe_gpns(mem.gpns()) | ||
.map_err(RequestError::Memory)?; | ||
|
||
let prp = if mem | ||
.gpns() | ||
.iter() | ||
.all(|&gpn| guest_memory.iova(gpn * PAGE_SIZE64).is_some()) | ||
{ | ||
// Guest memory is available to the device, so issue the IO directly. | ||
self.make_prp( | ||
mem.offset() as u64, | ||
mem.gpns() | ||
.iter() | ||
.map(|&gpn| guest_memory.iova(gpn * PAGE_SIZE64).unwrap()), | ||
) | ||
.await | ||
} else { | ||
tracing::debug!(opcode = opcode.0, size = mem.len(), "double buffering"); | ||
|
||
// Guest memory is not accessible by the device. Double buffer | ||
// through an allocation. | ||
let double_buffer_pages = double_buffer_pages.insert( | ||
self.alloc | ||
.alloc_bytes(mem.len()) | ||
.await | ||
.ok_or(RequestError::TooLarge)?, | ||
); | ||
let dma_manager = GlobalDmaManager::get_instance(); | ||
let dma_client = dma_manager.create_client(self.pinning_threshold); | ||
|
||
if opcode.transfer_host_to_controller() { | ||
double_buffer_pages | ||
.copy_from_guest_memory(guest_memory, mem) | ||
.map_err(RequestError::Memory)?; | ||
} | ||
// Force bounce buffer for all transactions in this example | ||
let options = DmaMapOptions { | ||
force_bounce_buffer: true, | ||
}; | ||
|
||
self.make_prp( | ||
0, | ||
(0..double_buffer_pages.page_count()) | ||
.map(|i| double_buffer_pages.physical_address(i)), | ||
let dma_transactions = dma_client | ||
.map_dma_ranges( | ||
mem.gpns().iter().map(|&gpn| guest_memory.range(gpn)).collect::<Vec<_>>(), | ||
Some(&options), | ||
) | ||
.await | ||
}; | ||
.map_err(|_| RequestError::Memory)?; | ||
|
||
command.dptr = prp.dptr; | ||
let r = self.issue_raw(command).await; | ||
if let Some(double_buffer_pages) = double_buffer_pages { | ||
if r.is_ok() && opcode.transfer_controller_to_host() { | ||
double_buffer_pages | ||
.copy_to_guest_memory(guest_memory, mem) | ||
.map_err(RequestError::Memory)?; | ||
} | ||
} | ||
r | ||
command.dptr = self | ||
.make_prp( | ||
dma_transactions.transactions[0].dma_addr as u64, | ||
dma_transactions.transactions.iter().map(|t| t.dma_addr), | ||
) | ||
.await; | ||
|
||
let result = self.issue_raw(command).await; | ||
|
||
dma_client | ||
.unmap_dma_ranges(&dma_transactions.transactions) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. So we need handle the same functionality as copy_to_guest_memory in unmap_dma_ranges if opcode.transfer_controller_to_host() There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. To simplify usage, can we just pass dma_transactions to unmap_dma_ranges? So caller needn't know details of DmaTransactionHandler? |
||
.map_err(|_| RequestError::Memory)?; | ||
|
||
result | ||
} | ||
|
||
async fn make_prp( | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
|
||
#[derive(Debug)] | ||
pub enum DmaError { | ||
InitializationFailed, | ||
MapFailed, | ||
UnmapFailed, | ||
PinFailed, | ||
BounceBufferFailed, | ||
} | ||
|
||
#[derive(Debug, Clone)] | ||
pub struct DmaMapOptions { | ||
pub force_bounce_buffer: bool, // Always use bounce buffers, even if pinning succeeds | ||
} | ||
|
||
// Structure encapsulating the result of a DMA mapping operation | ||
pub struct DmaTransactionHandler { | ||
pub transactions: Vec<DmaTransaction>, | ||
} | ||
|
||
// Structure representing a DMA transaction with address and metadata | ||
pub struct DmaTransaction { | ||
pub original_addr: usize, | ||
pub dma_addr: usize, | ||
pub size: usize, | ||
pub is_pinned: bool, | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Can we have comments explaining these fields? like is_pinned and is_bounce_buffer cannot be both true, why do we need keep both? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. yes, i will add it. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Agree with Juan. It seems you can go further, and do something like: pub enum MemoryBacking {
Pinned(prepinned: bool),
InBounceBuffer
} And rather than doing There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. if dma mapping options are disjoint (ie pinned or is bounce buffer), then they should be represented with an enum like Matt suggested. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yes, will change this to enum. |
||
pub is_bounce_buffer: bool, | ||
pub is_physical: bool, | ||
pub is_prepinned: bool, | ||
} | ||
|
||
// Trait for the DMA interface | ||
pub trait DmaInterface { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Do you envision that this would replace other uses of bounce buffering? (for example, copying from private memory into shared memory for isolated VMs, or when the block disk bounces for arm64 guests)? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. yes, i do envision that. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Got it. How do you envision handling the case:
(I'm thinking about the block device driver here, where it would never want to pin memory - the kernel doesn't know about the VTL0 addresses.) There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We discussed this offline. |
||
fn map_dma_ranges(&self, ranges: &[MemoryRange], options: Option<&DmaMapOptions>,) -> Result<DmaTransactionHandler, DmaError>; | ||
fn unmap_dma_ranges(&self, dma_transactions: &[DmaTransaction]) -> Result<(), DmaError>; | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Not returning an opaque handle but asking the caller to provide some pub struct fields seems a bit odd to me, but we can always iterate on this api later since all users will be in-tree. At least, it seems like perhaps There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
|
||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I understand this is a draft ... could you add a doc comment so that folks know the intended contract and usage here?