Skip to main content

Showing 1–8 of 8 results for author: Kandogan, E

  1. arXiv:2406.00584  [pdf, other

    cs.DB cs.AI

    A Blueprint Architecture of Compound AI Systems for Enterprise

    Authors: Eser Kandogan, Sajjadur Rahman, Nikita Bhutani, Dan Zhang, Rafael Li Chen, Kushan Mitra, Sairam Gurajada, Pouya Pezeshkpour, Hayate Iso, Yanlin Feng, Hannah Kim, Chen Shen, Jin Wang, Estevam Hruschka

    Abstract: Large Language Models (LLMs) have showcased remarkable capabilities surpassing conventional NLP challenges, creating opportunities for use in production use cases. Towards this goal, there is a notable shift to building compound AI systems, wherein LLMs are integrated into an expansive software infrastructure with many components like models, retrievers, databases and tools. In this paper, we intr… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Compound AI Systems Workshop at the Data+AI Summit 2024

  2. CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems

    Authors: Yanlin Feng, Sajjadur Rahman, Aaron Feng, Vincent Chen, Eser Kandogan

    Abstract: Compound AI systems (CASs) that employ LLMs as agents to accomplish knowledge-intensive tasks via interactions with tools and data retrievers have garnered significant interest within database and AI communities. While these systems have the potential to supplement typical analysis workflows of data analysts in enterprise data platforms, unfortunately, CASs are subject to the same data discovery c… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Governance, Understanding and Integration of Data for Effective and Responsible AI (GUIDE-AI '24), June 14, 2024, Santiago, AA, Chile

  3. arXiv:2402.01108  [pdf, other

    cs.CL cs.LG

    Reasoning Capacity in Multi-Agent Systems: Limitations, Challenges and Human-Centered Solutions

    Authors: Pouya Pezeshkpour, Eser Kandogan, Nikita Bhutani, Sajjadur Rahman, Tom Mitchell, Estevam Hruschka

    Abstract: Remarkable performance of large language models (LLMs) in a variety of tasks brings forth many opportunities as well as challenges of utilizing them in production settings. Towards practical adoption of LLMs, multi-agent systems hold great promise to augment, integrate, and orchestrate LLMs in the larger context of enterprise platforms that use existing proprietary data and models to tackle comple… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  4. arXiv:2301.03656  [pdf, other

    cs.DB cs.AI cs.HC

    Towards Multifaceted Human-Centered AI

    Authors: Sajjadur Rahman, Hannah Kim, Dan Zhang, Estevam Hruschka, Eser Kandogan

    Abstract: Human-centered AI workflows involve stakeholders with multiple roles interacting with each other and automated agents to accomplish diverse tasks. In this paper, we call for a holistic view when designing support mechanisms, such as interaction paradigms, interfaces, and systems, for these multifaceted workflows.

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: Workshop on Human-Centered AI at NeurIPS 2022

  5. arXiv:2301.03095  [pdf, other

    cs.HC cs.CL

    MEGAnno: Exploratory Labeling for NLP in Computational Notebooks

    Authors: Dan Zhang, Hannah Kim, Rafael Li Chen, Eser Kandogan, Estevam Hruschka

    Abstract: We present MEGAnno, a novel exploratory annotation framework designed for NLP researchers and practitioners. Unlike existing labeling tools that focus on data labeling only, our framework aims to support a broader, iterative ML workflow including data exploration and model development. With MEGAnno's API, users can programmatically explore the data through sophisticated search and automated sugges… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

    Comments: Data Science with Human-in-the-loop (DaSH) @ EMNLP 2022. Demo: https://meganno.github.io

  6. arXiv:2206.04853  [pdf, other

    cs.DB

    Machop: an End-to-End Generalized Entity Matching Framework

    Authors: Jin Wang, Yuliang Li, Wataru Hirota, Eser Kandogan

    Abstract: Real-world applications frequently seek to solve a general form of the Entity Matching (EM) problem to find associated entities. Such scenarios include matching jobs to candidates in job targeting, matching students with courses in online education, matching products with user reviews on e-commercial websites, and beyond. These tasks impose new requirements such as matching data entries with diver… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: aiDM 2022

  7. arXiv:1911.02095  [pdf, other

    q-bio.QM cs.DB

    IBM Functional Genomics Platform, A Cloud-Based Platform for Studying Microbial Life at Scale

    Authors: Edward E. Seabolt, Gowri Nayar, Harsha Krishnareddy, Akshay Agarwal, Kristen L. Beck, Ignacio Terrizzano, Eser Kandogan, Mary Roth, Vandana Mukherjee, James H. Kaufman

    Abstract: The rapid growth in biological sequence data is revolutionizing our understanding of genotypic diversity and challenging conventional approaches to informatics. With the increasing availability of genomic data, traditional bioinformatic tools require substantial computational time and the creation of ever-larger indices each time a researcher seeks to gain insight from the data. To address these c… ▽ More

    Submitted 30 March, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

  8. arXiv:1907.11184  [pdf, other

    cs.CL cs.AI cs.HC cs.LG cs.LO

    HEIDL: Learning Linguistic Expressions with Deep Learning and Human-in-the-Loop

    Authors: Yiwei Yang, Eser Kandogan, Yunyao Li, Walter S. Lasecki, Prithviraj Sen

    Abstract: While the role of humans is increasingly recognized in machine learning community, representation of and interaction with models in current human-in-the-loop machine learning (HITL-ML) approaches are too low-level and far-removed from human's conceptual models. We demonstrate HEIDL, a prototype HITL-ML system that exposes the machine-learned model through high-level, explainable linguistic express… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.