Skip to main content

Showing 1–21 of 21 results for author: Pezeshkpour, P

  1. arXiv:2406.05194  [pdf, other

    cs.CL cs.AI cs.LG

    LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs

    Authors: Arash Gholami Davoodi, Seyed Pouyan Mousavi Davoudi, Pouya Pezeshkpour

    Abstract: Large language models (LLMs) demonstrate impressive capabilities in mathematical reasoning. However, despite these achievements, current evaluations are mostly limited to specific mathematical topics, and it remains unclear whether LLMs are genuinely engaging in reasoning. To address these gaps, we present the Mathematical Topics Tree (MaTT) benchmark, a challenging and structured benchmark that o… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2406.00584  [pdf, other

    cs.DB cs.AI

    A Blueprint Architecture of Compound AI Systems for Enterprise

    Authors: Eser Kandogan, Sajjadur Rahman, Nikita Bhutani, Dan Zhang, Rafael Li Chen, Kushan Mitra, Sairam Gurajada, Pouya Pezeshkpour, Hayate Iso, Yanlin Feng, Hannah Kim, Chen Shen, Jin Wang, Estevam Hruschka

    Abstract: Large Language Models (LLMs) have showcased remarkable capabilities surpassing conventional NLP challenges, creating opportunities for use in production use cases. Towards this goal, there is a notable shift to building compound AI systems, wherein LLMs are integrated into an expansive software infrastructure with many components like models, retrievers, databases and tools. In this paper, we intr… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Compound AI Systems Workshop at the Data+AI Summit 2024

  3. arXiv:2404.00211  [pdf, other

    cs.CL cs.LG

    Multi-Conditional Ranking with Large Language Models

    Authors: Pouya Pezeshkpour, Estevam Hruschka

    Abstract: Utilizing large language models (LLMs) to rank a set of items has become a common approach in recommendation and retrieval systems. Typically, these systems focus on ordering a substantial number of documents in a monotonic order based on a given query. However, real-world scenarios often present a different challenge: ranking a comparatively smaller set of items, but according to a variety of div… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  4. arXiv:2402.01108  [pdf, other

    cs.CL cs.LG

    Reasoning Capacity in Multi-Agent Systems: Limitations, Challenges and Human-Centered Solutions

    Authors: Pouya Pezeshkpour, Eser Kandogan, Nikita Bhutani, Sajjadur Rahman, Tom Mitchell, Estevam Hruschka

    Abstract: Remarkable performance of large language models (LLMs) in a variety of tasks brings forth many opportunities as well as challenges of utilizing them in production settings. Towards practical adoption of LLMs, multi-agent systems hold great promise to augment, integrate, and orchestrate LLMs in the larger context of enterprise platforms that use existing proprietary data and models to tackle comple… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  5. arXiv:2311.06383  [pdf, other

    cs.CL cs.LG

    Distilling Large Language Models using Skill-Occupation Graph Context for HR-Related Tasks

    Authors: Pouya Pezeshkpour, Hayate Iso, Thom Lake, Nikita Bhutani, Estevam Hruschka

    Abstract: Numerous HR applications are centered around resumes and job descriptions. While they can benefit from advancements in NLP, particularly large language models, their real-world adoption faces challenges due to absence of comprehensive benchmarks for various HR tasks, and lack of smaller models with competitive capabilities. In this paper, we aim to bridge this gap by introducing the Resume-Job Des… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  6. arXiv:2309.07382  [pdf, other

    cs.CL

    Less is More for Long Document Summary Evaluation by LLMs

    Authors: Yunshu Wu, Hayate Iso, Pouya Pezeshkpour, Nikita Bhutani, Estevam Hruschka

    Abstract: Large Language Models (LLMs) have shown promising performance in summary evaluation tasks, yet they face challenges such as high computational costs and the Lost-in-the-Middle problem where important information in the middle of long documents is often overlooked. To address these issues, this paper introduces a novel approach, Extract-then-Evaluate, which involves extracting key sentences from a… ▽ More

    Submitted 18 January, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: EACL (main)

  7. arXiv:2308.13676  [pdf, other

    cs.CL cs.AI cs.LG

    Rethinking Language Models as Symbolic Knowledge Graphs

    Authors: Vishwas Mruthyunjaya, Pouya Pezeshkpour, Estevam Hruschka, Nikita Bhutani

    Abstract: Symbolic knowledge graphs (KGs) play a pivotal role in knowledge-centric applications such as search, question answering and recommendation. As contemporary language models (LMs) trained on extensive textual data have gained prominence, researchers have extensively explored whether the parametric knowledge within these models can match up to that present in knowledge graphs. Various methodologies… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  8. arXiv:2308.11483  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions

    Authors: Pouya Pezeshkpour, Estevam Hruschka

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in various NLP tasks. However, previous works have shown these models are sensitive towards prompt wording, and few-shot demonstrations and their order, posing challenges to fair assessment of these models. As these models become more powerful, it becomes imperative to understand and address these limitations. In this paper, we… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  9. arXiv:2306.06264  [pdf, other

    cs.CL cs.LG

    Measuring and Modifying Factual Knowledge in Large Language Models

    Authors: Pouya Pezeshkpour

    Abstract: Large Language Models (LLMs) store an extensive amount of factual knowledge obtained from vast collections of text. To effectively utilize these models for downstream tasks, it is crucial to have reliable methods for measuring their knowledge. However, existing approaches for knowledge measurement have certain limitations, and despite recent efforts, they fail to provide accurate measurements and… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  10. arXiv:2210.04337  [pdf, other

    cs.CL cs.LG

    Quantifying Social Biases Using Templates is Unreliable

    Authors: Preethi Seshadri, Pouya Pezeshkpour, Sameer Singh

    Abstract: Recently, there has been an increase in efforts to understand how large language models (LLMs) propagate and amplify social biases. Several works have utilized templates for fairness evaluation, which allow researchers to quantify social biases in the absence of test sets with protected attribute labels. While template evaluation can be a convenient and helpful diagnostic tool to understand model… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  11. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  12. arXiv:2205.02216  [pdf, other

    cs.IT

    The Extremal GDoF Gain of Optimal versus Binary Power Control in $K$ User Interference Networks Is $Θ(\sqrt{K})$

    Authors: Yao-Chia Chan, Pouya Pezeshkpour, Chunhua Geng, Syed A. Jafar

    Abstract: Using ideas from Generalized Degrees of Freedom (GDoF) analyses and extremal network theory, this work studies the extremal gain of optimal power control over binary (on/off) power control, especially in large interference networks, in search of new theoretical insights. Whereas numerical studies have already established that in most practical settings binary power control is close to optimal, the… ▽ More

    Submitted 5 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: 18 pages, 5 figures; Typo corrections

  13. arXiv:2107.00323  [pdf, other

    cs.CL cs.LG

    Combining Feature and Instance Attribution to Detect Artifacts

    Authors: Pouya Pezeshkpour, Sarthak Jain, Sameer Singh, Byron C. Wallace

    Abstract: Training the deep neural networks that dominate NLP requires large datasets. These are often collected automatically or via crowdsourcing, and may exhibit systematic biases or annotation artifacts. By the latter we mean spurious correlations between inputs and outputs that do not represent a generally held causal relationship between features and classes; models that exploit such correlations may… ▽ More

    Submitted 25 March, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: ACL Findings 2022

  14. arXiv:2104.04128  [pdf, other

    cs.CL cs.LG

    An Empirical Comparison of Instance Attribution Methods for NLP

    Authors: Pouya Pezeshkpour, Sarthak Jain, Byron C. Wallace, Sameer Singh

    Abstract: Widespread adoption of deep models has motivated a pressing need for approaches to interpret network outputs and to facilitate model debugging. Instance attribution methods constitute one means of accomplishing these goals by retrieving training instances that (may have) led to a particular prediction. Influence functions (IF; Koh and Liang 2017) provide machinery for doing this by quantifying the… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: NAACL 2021

  15. arXiv:2012.06154  [pdf, other

    cs.CL cs.AI

    ParsiNLU: A Suite of Language Understanding Challenges for Persian

    Authors: Daniel Khashabi, Arman Cohan, Siamak Shakeri, Pedram Hosseini, Pouya Pezeshkpour, Malihe Alikhani, Moin Aminnaseri, Marzieh Bitaab, Faeze Brahman, Sarik Ghazarian, Mozhdeh Gheini, Arman Kabiri, Rabeeh Karimi Mahabadi, Omid Memarrast, Ahmadreza Mosallanezhad, Erfan Noury, Shahab Raji, Mohammad Sadegh Rasooli, Sepideh Sadeghi, Erfan Sadeqi Azer, Niloofar Safi Samghabadi, Mahsa Shafaei, Saber Sheybani, Ali Tazarv, Yadollah Yaghoobzadeh

    Abstract: Despite the progress made in recent years in addressing natural language understanding (NLU) challenges, the majority of this progress remains to be concentrated on resource-rich languages like English. This work focuses on Persian language, one of the widely spoken languages in the world, and yet there are few NLU datasets available for this rich language. The availability of high-quality evaluat… ▽ More

    Submitted 13 July, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: To appear on Transactions of the Association for Computational Linguistics (TACL), 2021

  16. arXiv:1906.10244  [pdf, other

    cs.LG cs.HC stat.ML

    Generating User-friendly Explanations for Loan Denials using GANs

    Authors: Ramya Srinivasan, Ajay Chander, Pouya Pezeshkpour

    Abstract: Financial decisions impact our lives, and thus everyone from the regulator to the consumer is interested in fair, sound, and explainable decisions. There is increasing competitive desire and regulatory incentive to deploy AI mindfully within financial services. An important mechanism towards that end is to explain AI decisions to various stakeholders. State-of-the-art explainable AI systems mostly… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: Presented at the NeurIPS 2018 Workshop on Challenges and Opportunities for AI in Financial Services: the Impact of Fairness, Explainability, Accuracy, and Privacy, Montreal, Canada

  17. arXiv:1905.00563  [pdf, other

    cs.LG cs.CL stat.ML

    Investigating Robustness and Interpretability of Link Prediction via Adversarial Modifications

    Authors: Pouya Pezeshkpour, Yifan Tian, Sameer Singh

    Abstract: Representing entities and relations in an embedding space is a well-studied approach for machine learning on relational data. Existing approaches, however, primarily focus on improving accuracy and overlook other aspects such as robustness and interpretability. In this paper, we propose adversarial modifications for link prediction models: identifying the fact to add into or remove from the knowle… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

    Comments: Published at NAACL 2019

  18. arXiv:1809.01341  [pdf, other

    cs.AI cs.CL stat.ML

    Embedding Multimodal Relational Data for Knowledge Base Completion

    Authors: Pouya Pezeshkpour, Liyan Chen, Sameer Singh

    Abstract: Representing entities and relations in an embedding space is a well-studied approach for machine learning on relational data. Existing approaches, however, primarily focus on simple link structure between a finite set of entities, ignoring the variety of data types that are often used in knowledge bases, such as text, images, and numerical values. In this paper, we propose multimodal knowledge bas… ▽ More

    Submitted 7 September, 2018; v1 submitted 5 September, 2018; originally announced September 2018.

    Comments: Published at EMNLP 2018

  19. arXiv:1805.00184  [pdf, other

    stat.ML cs.AI cs.LG

    Compact Factorization of Matrices Using Generalized Round-Rank

    Authors: Pouya Pezeshkpour, Carlos Guestrin, Sameer Singh

    Abstract: Matrix factorization is a well-studied task in machine learning for compactly representing large, noisy data. In our approach, instead of using the traditional concept of matrix rank, we define a new notion of link-rank based on a non-linear link function used within factorization. In particular, by applying the round function on a factorization to obtain ordinal-valued matrices, we introduce gene… ▽ More

    Submitted 1 May, 2018; originally announced May 2018.

  20. arXiv:1504.07570  [pdf, ps, other

    cs.IT

    An Optimal Linear Coding for Index Coding Problem

    Authors: Pouya Pezeshkpour

    Abstract: An optimal linear coding solution for index coding problem is established. Instead of network coding approach by focus on graph theoric and algebraic methods a linear coding program for solving both unicast and groupcast index coding problem is presented. The coding is proved to be the optimal solution from the linear perspective and can be easily utilize for any number of messages. The importance… ▽ More

    Submitted 7 May, 2015; v1 submitted 28 April, 2015; originally announced April 2015.

    Comments: 5 pages, 3 figures

  21. arXiv:1502.02253  [pdf

    cs.IT

    Data Bits in Karnaugh Map and Increasing Map Capability in Error Correcting

    Authors: Pouya Pezeshkpour, Mahmoud Tabandeh

    Abstract: To provide reliable communication in data transmission, ability of correcting errors is of prime importance. This paper intends to suggest an easy algorithm to detect and correct errors in transmission codes using the well-known Karnaugh map. Referring to past research done and proving new theorems and also using a suggested simple technique taking advantage of the easy concept of Karnaugh map, we… ▽ More

    Submitted 30 September, 2018; v1 submitted 8 February, 2015; originally announced February 2015.

    Comments: 8 pages, 4 figures, 1 table