


default search action
Bita Darvish Rouhani
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c24]Neusha Javidnia, Bita Darvish Rouhani, Farinaz Koushanfar:
Key, Value, Compress: A Systematic Exploration of KV Cache Compression Techniques. CICC 2025: 1-3
[c23]Mengting Ai
, Tianxin Wei
, Yifan Chen
, Zhichen Zeng
, Ritchie Zhao
, Girish Varatkar
, Bita Darvish Rouhani
, Xianfeng Tang
, Hanghang Tong
, Jingrui He
:
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration. KDD (1) 2025: 1-12
[i20]Mengting Ai, Tianxin Wei, Yifan Chen, Zhichen Zeng, Ritchie Zhao, Girish Varatkar, Bita Darvish Rouhani, Xianfeng Tang, Hanghang Tong, Jingrui He:
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration. CoRR abs/2503.06881 (2025)
[i19]Neusha Javidnia, Bita Darvish Rouhani, Farinaz Koushanfar:
Key, Value, Compress: A Systematic Exploration of KV Cache Compression Techniques. CoRR abs/2503.11816 (2025)
[i18]Tiyasa Mitra, Ritika Borkar, Nidhi Bhatia, Ramon Matas, Shivam Raj, Dheevatsa Mudigere, Ritchie Zhao, Maximilian Golub, Arpan Dutta, Sailaja Madduri, Dharmesh Jani, Brian Pharris, Bita Darvish Rouhani:
Beyond the Buzz: A Pragmatic Take on Inference Disaggregation. CoRR abs/2506.05508 (2025)
[i17]Nidhi Bhatia, Ankit More, Ritika Borkar, Tiyasa Mitra, Ramon Matas, Ritchie Zhao, Maximilian Golub, Dheevatsa Mudigere, Brian Pharris, Bita Darvish Rouhani:
Helix Parallelism: Rethinking Sharding Strategies for Interactive Multi-Million-Token LLM Decoding. CoRR abs/2507.07120 (2025)
[i16]Aarti Basant, Abhijit Khairnar, Abhijit Paithankar, Abhinav Khattar, Adithya Renduchintala, Aditya Malte, Akhiad Bercovich, Akshay Hazare, Alejandra Rico, Aleksander Ficek, Alex Kondratenko, Alex Shaposhnikov, Alexander Bukharin, Ali Taghibakhshi, Amelia Barton, Ameya Sunil Mahabaleshwarkar, Amy Shen, Andrew Tao, Ann Guan, Anna Shors, Anubhav Mandarwal, Arham Mehta, Arun Venkatesan, Ashton Sharabiani, Ashwath Aithal, Ashwin Poojary, Ayush Dattagupta, Balaram Buddharaju, Banghua Zhu, Barnaby Simkin, Bilal Kartal, Bita Darvish Rouhani, Bobby Chen, Boris Ginsburg, Brandon Norick, Brian Yu, Bryan Catanzaro, Charles Wang, Charlie Truong, Chetan Mungekar, Chintan Patel, Chris Alexiuk, Christian Munley, Christopher Parisien, Dan Su, Daniel Afrimi, Daniel Korzekwa, Daniel Rohrer, Daria Gitman, David Mosallanezhad, Deepak Narayanan, Dima Rekesh, Dina Yared, Dmytro Pykhtar, Dong Ahn, Duncan Riach, Eileen Long, Elliott Ning, Eric Chung, Erick Galinkin, Evelina Bakhturina, Gargi Prasad, Gerald Shen, Haifeng Qian, Haim Elisha, Harsh Sharma, Hayley Ross, Helen Ngo, Herman Sahota, Hexin Wang, Hoo Chang Shin, Hua Huang, Iain Cunningham, Igor Gitman, Ivan Moshkov, Jaehun Jung, Jan Kautz, Jane Polak Scowcroft, Jared Casper, Jian Zhang, Jiaqi Zeng, Jimmy Zhang, Jinze Xue, Jocelyn Huang, Joey Conway, John Kamalu, Jonathan M. Cohen, Joseph Jennings, Julien Veron Vialard, Junkeun Yi, Jupinder Parmar, Kari Briski, Katherine Cheung, Katherine Luna, Keith W. Ross, Keshav Santhanam, Kezhi Kong, Krzysztof Pawelec, Kumar Anik:
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model. CoRR abs/2508.14444 (2025)
[i15]Felix Abecassis, Anjulie Agrusa, Dong Ahn, Jonah Alben, Stefania Alborghetti, Michael Andersch, Sivakumar Arayandi, Alexis Bjorlin, Aaron Blakeman, Evan Briones, Ian Buck, Bryan Catanzaro, Jinhang Choi, Mike Chrzanowski, Eric Chung, Victor Cui, Steve Dai, Bita Darvish Rouhani, Carlo del Mundo, Deena Donia, Sukru Burc Eryilmaz, Henry Estela, Abhinav Goel, Oleg Goncharov, Yugi Guvvala, Robert Hesse, Russell Hewett, Herbert Hum, Ujval J. Kapasi, Brucek Khailany, Mikail Khona, Nick Knight, Alex Kondratenko, Ronny Krashinsky, Ben Lanir, Simon Layton, Michael Lightstone, Daniel Lo, Paulius Micikevicius, Asit K. Mishra, Tim Moon, Deepak Narayanan, Chao Ni, Abhijit Paithankar, Satish Pasumarthi, Ankit Patel, Mostofa Patwary, Ashwin Poojary, Gargi Prasad, Sweta Priyadarshi, Yigong Qin, Xiaowei Ren, Oleg Rybakov, Charbel Sakr, Sanjeev Satheesh, Stas Sergienko, Pavel Shamis, Kirthi Shankar, Nishant Sharma, Mohammad Shoeybi, Michael Siu, Misha Smelyanskiy, Darko Stosic, Dusan Stosic, Bor-Yiing Su, Frank Sun, Nima Tajbakhsh, Shelby Thomas, Przemek Tredak, Evgeny Tsykunov, Gandhi Vaithilingam, Aditya Vavre, Rangharajan Venkatesan, Roger Waleffe, Qiyu Wan, Hexin Wang, Mengdi Wang, Lizzie Wei, Hao Wu, Evan Wu, Keith Wyss, Ning Xu, Jinze Xue, Charlene Yang, Yujia Zhai, Ruoxi Zhang, Jingyang Zhu, Zhongbo Zhu:
Pretraining Large Language Models with NVFP4. CoRR abs/2509.25149 (2025)- 2024
[j12]Huili Chen
, Cheng Fu, Bita Darvish Rouhani, Jishen Zhao
, Farinaz Koushanfar
:
Intellectual Property Protection of Deep-Learning Systems via Hardware/Software Co-Design. IEEE Des. Test 41(2): 23-31 (2024)- 2023
[c22]Bita Darvish Rouhani
, Ritchie Zhao
, Venmugil Elango
, Rasoul Shafipour
, Mathew Hall
, Maral Mesmakhosroshahi
, Ankit More
, Levi Melnick
, Maximilian Golub
, Girish Varatkar
, Lai Shao
, Gaurav Kolhe
, Dimitry Melts
, Jasmine Klar
, Renee L'Heureux
, Matt Perry
, Doug Burger
, Eric S. Chung
, Zhaoxia (Summer) Deng
, Sam Naghshineh
, Jongsoo Park
, Maxim Naumov
:
With Shared Microexponents, A Little Shifting Goes a Long Way. ISCA 2023: 83:1-83:13
[i14]Bita Rouhani, Ritchie Zhao, Venmugil Elango, Rasoul Shafipour, Mathew Hall, Maral Mesmakhosroshahi, Ankit More, Levi Melnick, Maximilian Golub, Girish Varatkar, Lei Shao, Gaurav Kolhe, Dimitry Melts, Jasmine Klar, Renee L'Heureux, Matt Perry, Doug Burger, Eric S. Chung, Zhaoxia Deng, Sam Naghshineh, Jongsoo Park, Maxim Naumov:
Shared Microexponents: A Little Shifting Goes a Long Way. CoRR abs/2302.08007 (2023)
[i13]Bita Darvish Rouhani, Ritchie Zhao, Ankit More, Mathew Hall, Alireza Khodamoradi, Summer Deng, Dhruv Choudhary, Marius Cornea, Eric Dellinger, Kristof Denolf, Dusan Stosic, Venmugil Elango, Maximilian Golub, Alexander Heinecke, Phil James-Roxby, Dharmesh Jani, Gaurav Kolhe, Martin Langhammer, Ada Li, Levi Melnick, Maral Mesmakhosroshahi, Andres Rodriguez, Michael Schulte, Rasoul Shafipour, Lei Shao, Michael Y. Siu, Pradeep Dubey, Paulius Micikevicius, Maxim Naumov, Colin Verilli, Ralph Wittig, Doug Burger, Eric S. Chung:
Microscaling Data Formats for Deep Learning. CoRR abs/2310.10537 (2023)- 2021
[j11]Mojan Javaheripi
, Mohammad Samragh
, Bita Darvish Rouhani, Tara Javidi
, Farinaz Koushanfar
:
Hardware/Algorithm Codesign for Adversarially Robust Deep Learning. IEEE Des. Test 38(3): 31-38 (2021)
[j10]Mojan Javaheripi
, Bita Darvish Rouhani, Farinaz Koushanfar
:
SWANN: Small-World Architecture for Fast Convergence of Neural Networks. IEEE J. Emerg. Sel. Topics Circuits Syst. 11(4): 575-585 (2021)
[j9]Mojan Javaheripi
, Mohammad Samragh
, Bita Darvish Rouhani
, Tara Javidi
, Farinaz Koushanfar
:
CuRTAIL: ChaRacterizing and Thwarting AdversarIal Deep Learning. IEEE Trans. Dependable Secur. Comput. 18(2): 736-752 (2021)- 2020
[c21]Huili Chen, Bita Darvish Rouhani, Farinaz Koushanfar
:
SpecMark: A Spectral Watermarking Framework for IP Protection of Speech Recognition Systems. INTERSPEECH 2020: 2312-2316
[c20]Bita Darvish Rouhani, Daniel Lo, Ritchie Zhao, Ming Liu, Jeremy Fowers, Kalin Ovtcharov, Anna Vinogradsky, Sarah Massengill, Lita Yang, Ray Bittner, Alessandro Forin, Haishan Zhu, Taesik Na, Prerak Patel, Shuai Che, Lok Chand Koppaka, Xia Song, Subhojit Som, Kaustav Das, Saurabh Tiwary, Steven K. Reinhardt, Sitaram Lanka, Eric S. Chung, Doug Burger:
Pushing the Limits of Narrow Precision Inferencing at Cloud Scale with Microsoft Floating Point. NeurIPS 2020
[i12]Behnaz Arzani, Bita Rouhani:
Towards A Domain-Customized Automated Machine Learning Framework For Networks and Systems. CoRR abs/2004.11931 (2020)
2010 – 2019
- 2019
[j8]Bita Darvish Rouhani
, Mohammad Samragh, Tara Javidi
, Farinaz Koushanfar
:
Safe Machine Learning and Defeating Adversarial Attacks. IEEE Secur. Priv. 17(2): 31-38 (2019)
[j7]M. Sadegh Riazi
, Bita Darvish Rouhani
, Farinaz Koushanfar
:
Deep Learning on Private Data. IEEE Secur. Priv. 17(6): 54-63 (2019)
[c19]Bita Darvish Rouhani, Huili Chen, Farinaz Koushanfar
:
DeepSigns: An End-to-End Watermarking Framework for Ownership Protection of Deep Neural Networks. ASPLOS 2019: 485-497
[c18]Mohsen Imani, Samuel Bosch
, Mojan Javaheripi, Bita Darvish Rouhani, Xinyu Wu, Farinaz Koushanfar
, Tajana Rosing:
SemiHD: Semi-Supervised Learning Using Hyperdimensional Computing. ICCAD 2019: 1-8
[c17]Huili Chen, Cheng Fu, Bita Darvish Rouhani, Jishen Zhao, Farinaz Koushanfar
:
DeepAttest: an end-to-end attestation framework for deep neural networks. ISCA 2019: 487-498
[c16]Huili Chen, Bita Darvish Rouhani, Cheng Fu, Jishen Zhao, Farinaz Koushanfar
:
DeepMarks: A Secure Fingerprinting Framework for Digital Rights Management of Deep Learning Models. ICMR 2019: 105-113
[i11]Huili Chen, Bita Darvish Rouhani, Farinaz Koushanfar:
BlackMarks: Blackbox Multibit Watermarking for Deep Neural Networks. CoRR abs/1904.00344 (2019)
[i10]Mojan Javaheripi, Bita Darvish Rouhani, Farinaz Koushanfar:
SWNet: Small-World Neural Networks and Rapid Convergence. CoRR abs/1904.04862 (2019)- 2018
[b1]Bita Darvish Rouhani:
Succinct and Assured Machine Learning: Training and Execution. University of California, San Diego, USA, 2018
[j6]Eric S. Chung, Jeremy Fowers, Kalin Ovtcharov, Michael Papamichael, Adrian M. Caulfield, Todd Massengill, Ming Liu, Daniel Lo, Shlomi Alkalay, Michael Haselman, Maleen Abeydeera, Logan Adams, Hari Angepat, Christian Boehn, Derek Chiou, Oren Firestein, Alessandro Forin, Kang Su Gatlin, Mahdi Ghandi, Stephen Heil, Kyle Holohan, Ahmad El Husseini, Tamás Juhász, Kara Kagi, Ratna Kovvuri, Sitaram Lanka, Friedel van Megen, Dima Mukhortov, Prerak Patel, Brandon Perez, Amanda Rapsang, Steven K. Reinhardt, Bita Rouhani, Adam Sapek, Raja Seera, Sangeetha Shekar, Balaji Sridharan, Gabriel Weisz, Lisa Woods, Phillip Yi Xiao, Dan Zhang, Ritchie Zhao, Doug Burger:
Serving DNNs in Real Time at Datacenter Scale with Project Brainwave. IEEE Micro 38(2): 8-20 (2018)
[j5]Bita Darvish Rouhani
, Siam Umar Hussain, Kristin E. Lauter, Farinaz Koushanfar
:
ReDCrypt: Real-Time Privacy-Preserving Deep Learning Inference in Clouds Using FPGAs. ACM Trans. Reconfigurable Technol. Syst. 11(3): 21:1-21:21 (2018)
[c15]Bita Darvish Rouhani, M. Sadegh Riazi, Farinaz Koushanfar
:
Deepsecure: scalable provably-secure deep learning. DAC 2018: 2:1-2:6
[c14]Siam U. Hussain, Bita Darvish Rouhani, Mohammad Ghasemzadeh, Farinaz Koushanfar
:
MAXelerator: FPGA accelerator for privacy preserving multiply-accumulate (MAC) on cloud servers. DAC 2018: 33:1-33:6
[c13]Bita Darvish Rouhani, Mohammad Ghasemzadeh, Farinaz Koushanfar
:
CausaLearn: Automated Framework for Scalable Streaming-based Causal Bayesian Learning using FPGAs. FPGA 2018: 1-10
[c12]Bita Darvish Rouhani, Mohammad Samragh, Mojan Javaheripi, Tara Javidi
, Farinaz Koushanfar
:
Assured deep learning: practical defense against adversarial attacks. ICCAD 2018: 20
[c11]Bita Darvish Rouhani, Mohammad Samragh, Mojan Javaheripi, Tara Javidi
, Farinaz Koushanfar
:
DeepFense: online accelerated defense against adversarial deep learning. ICCAD 2018: 134
[i9]Bita Darvish Rouhani, Huili Chen, Farinaz Koushanfar:
DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models. CoRR abs/1804.00750 (2018)
[i8]Huili Chen, Bita Darvish Rouhani, Farinaz Koushanfar:
DeepMarks: A Digital Fingerprinting Framework for Deep Neural Networks. CoRR abs/1804.03648 (2018)
[i7]Mohammad Ghasemzadeh, Fang Lin, Bita Darvish Rouhani, Farinaz Koushanfar, Ke Huang:
AgileNet: Lightweight Dictionary-based Few-shot Learning. CoRR abs/1805.08311 (2018)
[i6]Huili Chen, Bita Darvish Rouhani, Xinwei Fan, Osman Cihan Kilinc, Farinaz Koushanfar:
Performance Comparison of Contemporary DNN Watermarking Techniques. CoRR abs/1811.03713 (2018)
[i5]Bita Darvish Rouhani, Huili Chen, Farinaz Koushanfar
:
DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models. IACR Cryptol. ePrint Arch. 2018: 311 (2018)
[i4]Huili Chen, Bita Darvish Rohani, Farinaz Koushanfar
:
DeepMarks: A Digital Fingerprinting Framework for Deep Neural Networks. IACR Cryptol. ePrint Arch. 2018: 322 (2018)- 2017
[j4]Bita Darvish Rouhani, Azalia Mirhoseini, Farinaz Koushanfar
:
RISE: An Automated Framework for Real-Time Intelligent Video Surveillance on FPGA. ACM Trans. Embed. Comput. Syst. 16(5s): 158:1-158:18 (2017)
[c10]Bita Darvish Rouhani, Azalia Mirhoseini, Farinaz Koushanfar
:
Deep3: Leveraging Three Levels of Parallelism for Efficient Deep Learning. DAC 2017: 61:1-61:6
[c9]Azalia Mirhoseini, Bita Darvish Rouhani, Ebrahim M. Songhori, Farinaz Koushanfar
:
ExtDict: Extensible Dictionaries for Data- and Platform-Aware Large-Scale Learning. IPDPS Workshops 2017: 379-388
[c8]Bita Darvish Rouhani, Azalia Mirhoseini, Farinaz Koushanfar
:
TinyDL: Just-in-time deep learning solution for constrained embedded systems. ISCAS 2017: 1-4
[i3]Bita Darvish Rouhani, M. Sadegh Riazi, Farinaz Koushanfar:
DeepSecure: Scalable Provably-Secure Deep Learning. CoRR abs/1705.08963 (2017)
[i2]Bita Darvish Rouhani, Mohammad Samragh, Tara Javidi, Farinaz Koushanfar:
CuRTAIL: ChaRacterizing and Thwarting AdversarIal deep Learning. CoRR abs/1709.02538 (2017)
[i1]Bita Darvish Rouhani, M. Sadegh Riazi, Farinaz Koushanfar
:
DeepSecure: Scalable Provably-Secure Deep Learning. IACR Cryptol. ePrint Arch. 2017: 502 (2017)- 2016
[j3]Azalia Mirhoseini, Bita Darvish Rouhani, Ebrahim M. Songhori, Farinaz Koushanfar
:
Chime: Checkpointing Long Computations on Interm ittently Energized IoT Devices. IEEE Trans. Multi Scale Comput. Syst. 2(4): 277-290 (2016)
[j2]Bita Darvish Rouhani, Azalia Mirhoseini, Ebrahim M. Songhori, Farinaz Koushanfar
:
Automated Real-Time Analysis of Streaming Big and Dense Data on Reconfigurable Platforms. ACM Trans. Reconfigurable Technol. Syst. 10(1): 8:1-8:22 (2016)
[c7]Bita Darvish Rouhani, Azalia Mirhoseini, Farinaz Koushanfar
:
Going deeper than deep learning for massive data analytics under physical constraints. CODES+ISSS 2016: 17:1-17:3
[c6]Azalia Mirhoseini, Bita Darvish Rouhani, Ebrahim M. Songhori, Farinaz Koushanfar
:
Perform-ML: performance optimized machine learning by platform and content aware customization. DAC 2016: 20:1-20:6
[c5]Bita Darvish Rouhani, Azalia Mirhoseini, Farinaz Koushanfar
:
DeLight: Adding Energy Dimension To Deep Neural Networks. ISLPED 2016: 112-117- 2015
[j1]Babak Darvish Rouhani, Mohd Naz'ri Mahrin, Hossein Shirazi
, Fatemeh Nikpay, Bita Darvish Rouhani:
An Effectiveness Model for Enterprise Architecture Methodologies. Int. J. Enterp. Inf. Syst. 11(2): 50-64 (2015)
[c4]Bita Darvish Rouhani, Ebrahim M. Songhori
, Azalia Mirhoseini, Farinaz Koushanfar
:
SSketch: An Automated Framework for Streaming Sketch-Based Analysis of Big Data on FPGA. FCCM 2015: 187-194
[c3]Azalia Mirhoseini, Ebrahim M. Songhori
, Bita Darvish Rouhani, Farinaz Koushanfar
:
Flexible Transformations For Learning Big Data. SIGMETRICS 2015: 453-454
[c2]Babak Darvish Rouhani, Mohd Naz'ri Mahrin, Fatemeh Nikpay, Pourya Nikfard, Bita Darvish Rouhani:
Agent-Oriented Based Enterprise Architecture Implementation Methodology. WorldCIST (1) 2015: 411-419- 2014
[c1]Babak Darvish Rouhani, Mohd Naz'ri Mahrin
, Fatemeh Nikpay, Bita Darvish Rouhani:
Current Issues on Enterprise Architecture Implementation Methodology. WorldCIST (2) 2014: 239-246
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-28 23:26 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







