Skip to main content

Showing 1–35 of 35 results for author: Nandi, A

  1. Simple Augmentations of Logical Rules for Neuro-Symbolic Knowledge Graph Completion

    Authors: Ananjan Nandi, Navdeep Kaur, Parag Singla, Mausam

    Abstract: High-quality and high-coverage rule sets are imperative to the success of Neuro-Symbolic Knowledge Graph Completion (NS-KGC) models, because they form the basis of all symbolic inferences. Recent literature builds neural models for generating rule sets, however, preliminary experiments show that they struggle with maintaining high coverage. In this work, we suggest three simple augmentations to ex… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 12 pages, 15 tables Published in ACL 2023

  2. arXiv:2407.00870  [pdf, other

    cs.CL cs.HC

    Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles

    Authors: Ryan Louie, Ananjan Nandi, William Fang, Cheng Chang, Emma Brunskill, Diyi Yang

    Abstract: Recent works leverage LLMs to roleplay realistic social scenarios, aiding novices in practicing their social skills. However, simulating sensitive interactions, such as in mental health, is challenging. Privacy concerns restrict data access, and collecting expert feedback, although vital, is laborious. To address this, we develop Roleplay-doh, a novel human-LLM collaboration pipeline that elicits… ▽ More

    Submitted 14 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 34 pages, 24 figures, 11 Tables

  3. arXiv:2404.00488  [pdf

    cs.CL cs.AI cs.LG

    Noise-Aware Training of Layout-Aware Language Models

    Authors: Ritesh Sarkhel, Xiaoqi Ren, Lauro Beltrao Costa, Guolong Su, Vincent Perot, Yanan Xie, Emmanouil Koukoumidis, Arnab Nandi

    Abstract: A visually rich document (VRD) utilizes visual features along with linguistic cues to disseminate information. Training a custom extractor that identifies named entities from a document requires a large number of instances of the target document type annotated at textual and visual modalities. This is an expensive bottleneck in enterprise scenarios, where we want to train custom extractors for tho… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  4. arXiv:2402.04959  [pdf, other

    cs.IT eess.SP

    Margin Propagation based XOR-SAT Solvers for Decoding of LDPC Codes

    Authors: Ankita Nandi, Shantanu Chakrabartty, Chetan Singh Thakur

    Abstract: Decoding of Low-Density Parity Check (LDPC) codes can be viewed as a special case of XOR-SAT problems, for which low-computational complexity bit-flipping algorithms have been proposed in the literature. However, a performance gap exists between the bit-flipping LDPC decoding algorithms and the benchmark LDPC decoding algorithms, such as the Sum-Product Algorithm (SPA). In this paper, we propose a… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 12 pages, 7 figures, Paper submitted to IEEE Transactions on Communications

  5. arXiv:2311.03780  [pdf, other

    cs.CL cs.AI cs.LG

    DynaSemble: Dynamic Ensembling of Textual and Structure-Based Models for Knowledge Graph Completion

    Authors: Ananjan Nandi, Navdeep Kaur, Parag Singla, Mausam

    Abstract: We consider two popular approaches to Knowledge Graph Completion (KGC): textual models that rely on textual entity descriptions, and structure-based models that exploit the connectivity structure of the Knowledge Graph (KG). Preliminary experiments show that these approaches have complementary strengths: structure-based models perform exceptionally well when the gold answer is easily reachable fro… ▽ More

    Submitted 2 July, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 12 pages, 2 figures, 15 tables Accepted to ACL 2024

    ACM Class: I.2.7

  6. arXiv:2306.04086  [pdf, other

    eess.IV cs.CV

    TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation

    Authors: Rui Sun, Tao Lei, Weichuan Zhang, Yong Wan, Yong Xia, Asoke K. Nandi

    Abstract: The hybrid architecture of convolution neural networks (CNN) and Transformer has been the most popular method for medical image segmentation. However, the existing networks based on the hybrid architecture suffer from two problems. First, although the CNN branch can capture image local features by using convolution operation, the vanilla convolution is unable to achieve adaptive extraction of imag… ▽ More

    Submitted 19 December, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2306.03373

  7. CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation

    Authors: Tao Lei, Rui Sun, Xuan Wang, Yingbo Wang, Xi He, Asoke Nandi

    Abstract: The hybrid architecture of convolutional neural networks (CNNs) and Transformer are very popular for medical image segmentation. However, it suffers from two challenges. First, although a CNNs branch can capture the local image features using vanilla convolution, it cannot achieve adaptive feature learning. Second, although a Transformer branch can capture the global features, it ignores the chann… ▽ More

    Submitted 19 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 9 pages, 3 figures, 3 tables

    Journal ref: The 32nd International Joint Conference on Artificial Intelligence, IJCAI2023, MACAO

  8. arXiv:2306.01988  [pdf, other

    cs.CV

    Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection

    Authors: Tao Lei, Yetong Xu, Hailong Ning, Zhiyong Lv, Chongdan Min, Yaochu Jin, Asoke K. Nandi

    Abstract: Popular Transformer networks have been successfully applied to remote sensing (RS) image change detection (CD) identifications and achieve better results than most convolutional neural networks (CNNs), but they still suffer from two main problems. First, the computational complexity of the Transformer grows quadratically with the increase of image spatial resolution, which is unfavorable to very h… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  9. arXiv:2303.00720  [pdf, ps, other

    cs.LG cs.DB cs.IR

    Cross-Modal Entity Matching for Visually Rich Documents

    Authors: Ritesh Sarkhel, Arnab Nandi

    Abstract: Visually rich documents (e.g. leaflets, banners, magazine articles) are physical or digital documents that utilize visual cues to augment their semantics. Information contained in these documents are ad-hoc and often incomplete. Existing works that enable structured querying on these documents do not take this into account. This makes it difficult to contextualize the information retrieved from qu… ▽ More

    Submitted 30 March, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

  10. arXiv:2302.14269  [pdf, ps, other

    cs.HC

    Measuring arousal and stress physiology on Esports, a League of Legends case study

    Authors: David Berga, Alexandre Pereda, Eleonora De Filippi, Arijit Nandi, Eulalia Febrer, Marta Reverte, Lautaro Russo

    Abstract: Esports gaming is an area in which videogame players need to cooperate and compete with each other, influencing their cognitive load, processing, stress, and social skills. Here it is unknown to which extent competitive videogame play using a desktop setting can affect the physiological responses of players' autonomic nervous system. For such, we propose a study where we have measured distinct ele… ▽ More

    Submitted 22 May, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 10 pages, 6 tables

  11. A Dynamic Weighted Federated Learning for Android Malware Classification

    Authors: Ayushi Chaudhuri, Arijit Nandi, Buddhadeb Pradhan

    Abstract: Android malware attacks are increasing daily at a tremendous volume, making Android users more vulnerable to cyber-attacks. Researchers have developed many machine learning (ML)/ deep learning (DL) techniques to detect and mitigate android malware attacks. However, due to technological advancement, there is a rise in android mobile devices. Furthermore, the devices are geographically dispersed, re… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted in SoCTA 2022

    Report number: Lecture Notes in Networks and Systems book series (LNNS,volume 627)-978-981-19-9857-7

    Journal ref: 25 April 2023

  12. arXiv:2207.06410  [pdf, other

    cs.HC cs.AI cs.LG

    MDEAW: A Multimodal Dataset for Emotion Analysis through EDA and PPG signals from wireless wearable low-cost off-the-shelf Devices

    Authors: Arijit Nandi, Fatos Xhafa, Laia Subirats, Santi Fort

    Abstract: We present MDEAW, a multimodal database consisting of Electrodermal Activity (EDA) and Photoplethysmography (PPG) signals recorded during the exams for the course taught by the teacher at Eurecat Academy, Sabadell, Barcelona in order to elicit the emotional reactions to the students in a classroom scenario. Signals from 10 students were recorded along with the students' self-assessment of their af… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  13. arXiv:2205.05664  [pdf, other

    cs.AR cs.ET cs.LG eess.SP eess.SY

    Process, Bias and Temperature Scalable CMOS Analog Computing Circuits for Machine Learning

    Authors: Pratik Kumar, Ankita Nandi, Shantanu Chakrabartty, Chetan Singh Thakur

    Abstract: Analog computing is attractive compared to digital computing due to its potential for achieving higher computational density and higher energy efficiency. However, unlike digital circuits, conventional analog computing circuits cannot be easily mapped across different process nodes due to differences in transistor biasing regimes, temperature variations and limited dynamic range. In this work, we… ▽ More

    Submitted 4 January, 2023; v1 submitted 11 May, 2022; originally announced May 2022.

    Comments: 14 Pages, 15 Figures, 5 Tables. This work has been accepted in IEEE for publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  14. arXiv:2202.05022  [pdf, other

    cs.ET cs.AI cs.AR cs.LG eess.SY

    Bias-Scalable Near-Memory CMOS Analog Processor for Machine Learning

    Authors: Pratik Kumar, Ankita Nandi, Shantanu Chakrabartty, Chetan Singh Thakur

    Abstract: Bias-scalable analog computing is attractive for implementing machine learning (ML) processors with distinct power-performance specifications. For instance, ML implementations for server workloads are focused on higher computational throughput for faster training, whereas ML implementations for edge devices are focused on energy-efficient inference. In this paper, we demonstrate the implementation… ▽ More

    Submitted 4 January, 2023; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: 11 pages, 11 figures, 2 Tables

  15. arXiv:2103.01278  [pdf, ps, other

    cs.LG math.OC stat.ML

    Non-Euclidean Differentially Private Stochastic Convex Optimization: Optimal Rates in Linear Time

    Authors: Raef Bassily, Cristóbal Guzmán, Anupama Nandi

    Abstract: Differentially private (DP) stochastic convex optimization (SCO) is a fundamental problem, where the goal is to approximately minimize the population risk with respect to a convex loss function, given a dataset of $n$ i.i.d. samples from a distribution, while satisfying differential privacy with respect to the dataset. Most of the existing works in the literature of private convex optimization foc… ▽ More

    Submitted 4 May, 2022; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: This version contains several extensions to the conference paper that appeared at COLT 2021 (and to the earlier arXiv version: arXiv:2103.01278v1). This version contains new, linear-time constructions with optimal, high-probability risk guarantees

  16. arXiv:2009.13120  [pdf, other

    eess.IV cs.CV

    Medical Image Segmentation Using Deep Learning: A Survey

    Authors: Risheng Wang, Tao Lei, Ruixia Cui, Bingtao Zhang, Hongying Meng, Asoke K. Nandi

    Abstract: Deep learning has been widely used for medical image segmentation and a large number of papers has been presented recording the success of deep learning in the field. In this paper, we present a comprehensive thematic survey on medical image segmentation using deep learning techniques. This paper makes two original contributions. Firstly, compared to traditional surveys that directly divide litera… ▽ More

    Submitted 22 December, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

  17. arXiv:2008.00331  [pdf, ps, other

    cs.LG stat.ML

    Learning from Mixtures of Private and Public Populations

    Authors: Raef Bassily, Shay Moran, Anupama Nandi

    Abstract: We initiate the study of a new model of supervised learning under privacy constraints. Imagine a medical study where a dataset is sampled from a population of both healthy and unhealthy individuals. Suppose healthy individuals have no privacy concerns (in such case, we call their data "public") while the unhealthy individuals desire stringent privacy protection for their data. In this example, the… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

  18. arXiv:2002.07845  [pdf, other

    cs.CL

    Interpretable Multi-Headed Attention for Abstractive Summarization at Controllable Lengths

    Authors: Ritesh Sarkhel, Moniba Keymanesh, Arnab Nandi, Srinivasan Parthasarathy

    Abstract: Abstractive summarization at controllable lengths is a challenging task in natural language processing. It is even more challenging for domains where limited training data is available or scenarios in which the length of the summary is not known beforehand. At the same time, when it comes to trusting machine-generated summaries, explaining how a summary was constructed in human-understandable term… ▽ More

    Submitted 27 November, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: 9 pages, 5 figures

    Journal ref: International Conference on Computational Linguistics (COLING) 2020

  19. arXiv:1907.13553  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Privately Answering Classification Queries in the Agnostic PAC Model

    Authors: Anupama Nandi, Raef Bassily

    Abstract: We revisit the problem of differentially private release of classification queries. In this problem, the goal is to design an algorithm that can accurately answer a sequence of classification queries based on a private training set while ensuring differential privacy. We formally study this problem in the agnostic PAC model and derive a new upper bound on the private sample complexity. Our results… ▽ More

    Submitted 3 December, 2019; v1 submitted 31 July, 2019; originally announced July 2019.

    Comments: Made a a small tweak in the analysis to save a factor of $1/ε$

  20. arXiv:1905.04522  [pdf, other

    cs.LG cs.NE stat.ML

    Accuracy Improvement of Neural Network Training using Particle Swarm Optimization and its Stability Analysis for Classification

    Authors: Arijit Nandi, Nanda Dulal Jana

    Abstract: Supervised classification is the most active and emerging research trends in today's scenario. In this view, Artificial Neural Network (ANN) techniques have been widely employed and growing interest to the researchers day by day. ANN training aims to find the proper setting of parameters such as weights ($\textbf{W}$) and biases ($b$) to properly classify the given data samples. The training proce… ▽ More

    Submitted 15 May, 2019; v1 submitted 11 May, 2019; originally announced May 2019.

  21. Adaptive Morphological Reconstruction for Seeded Image Segmentation

    Authors: Tao Lei, Xiaohong Jia, Tongliang Liu, Shigang Liu, Hongying Meng, Asoke K. Nandi

    Abstract: Morphological reconstruction (MR) is often employed by seeded image segmentation algorithms such as watershed transform and power watershed as it is able to filter seeds (regional minima) to reduce over-segmentation. However, MR might mistakenly filter meaningful seeds that are required for generating accurate segmentation and it is also sensitive to the scale because a single-scale structuring el… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

  22. Short and Long-term Pattern Discovery Over Large-Scale Geo-Spatiotemporal Data

    Authors: Sobhan Moosavi, Mohammad Hossein Samavatian, Arnab Nandi, Srinivasan Parthasarathy, Rajiv Ramnath

    Abstract: Pattern discovery in geo-spatiotemporal data (such as traffic and weather data) is about finding patterns of collocation, co-occurrence, cascading, or cause and effect between geospatial entities. Using simplistic definitions of spatiotemporal neighborhood (a common characteristic of the existing general-purpose frameworks) is not semantically representative of geo-spatiotemporal data. We therefor… ▽ More

    Submitted 17 May, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

    Comments: In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

  23. arXiv:1807.11149  [pdf, ps, other

    cs.DB

    To Ship or Not to (Function) Ship (Extended version)

    Authors: Feilong Liu, Niranjan Kamat, Spyros Blanas, Arnab Nandi

    Abstract: Sampling is often used to reduce query latency for interactive big data analytics. The established parallel data processing paradigm relies on function shipping, where a coordinator dispatches queries to worker nodes and then collects the results. The commoditization of high-performance networking makes data shipping possible, where the coordinator directly reads data in the workers' memory using… ▽ More

    Submitted 29 July, 2018; originally announced July 2018.

    Comments: 4 pages, 3 figures

  24. arXiv:1804.09477  [pdf

    physics.soc-ph cs.SI

    Bribery Games on Interdependent Complex Networks

    Authors: Prateek Verma, Anjan K. Nandi, Supratim Sengupta

    Abstract: Bribe demands present a social conflict scenario where decisions have wide-ranging economic and ethical consequences. Nevertheless, such incidents occur daily in many countries across the globe. Harassment bribery constitute a significant sub-set of such bribery incidents where a government official demands a bribe for providing a service to a citizen legally entitled to it. We employ an evolution… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: 24 pages, 7 figures, 5 supplementary figures; version to appear in Journal of Theoretical Biology

  25. arXiv:1804.08748  [pdf, other

    cs.AI

    Discovery of Driving Patterns by Trajectory Segmentation

    Authors: Sobhan Moosavi, Arnab Nandi, Rajiv Ramnath

    Abstract: Telematics data is becoming increasingly available due to the ubiquity of devices that collect data during drives, for different purposes, such as usage based insurance (UBI), fleet management, navigation of connected vehicles, etc. Consequently, a variety of data-analytic applications have become feasible that extract valuable insights from the data. In this paper, we address the especially chall… ▽ More

    Submitted 3 April, 2020; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: Accepted in the 3rd PhD workshop, ACM SIGSPATIAL 2016

  26. Characterizing Driving Context from Driver Behavior

    Authors: Sobhan Moosavi, Behrooz Omidvar-Tehrani, R. Bruce Craig, Arnab Nandi, Rajiv Ramnath

    Abstract: Because of the increasing availability of spatiotemporal data, a variety of data-analytic applications have become possible. Characterizing driving context, where context may be thought of as a combination of location and time, is a new challenging application. An example of such a characterization is finding the correlation between driving behavior and traffic conditions. This contextual informat… ▽ More

    Submitted 17 November, 2017; v1 submitted 13 October, 2017; originally announced October 2017.

    Comments: Accepted to be published at The 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2017)

  27. arXiv:1710.01854  [pdf, other

    cs.DB

    InfiniViz: Interactive Visual Exploration using Progressive Bin Refinement

    Authors: Niranjan Kamat, Arnab Nandi

    Abstract: Interactive visualizations can accelerate the data analysis loop through near-instantaneous feedback. To achieve interactivity, techniques such as data cubes and sampling are typically employed. While data cubes can speedup querying for moderate-sized datasets, they are ineffective at doing so at a larger scales due to the size of the materialized data cubes. On the other hand, while sampling can… ▽ More

    Submitted 4 October, 2017; originally announced October 2017.

  28. arXiv:1702.06976  [pdf, other

    cs.LG stat.ML

    Heavy-Tailed Analogues of the Covariance Matrix for ICA

    Authors: Joseph Anderson, Navin Goyal, Anupama Nandi, Luis Rademacher

    Abstract: Independent Component Analysis (ICA) is the problem of learning a square matrix $A$, given samples of $X=AS$, where $S$ is a random vector with independent coordinates. Most existing algorithms are provably efficient only when each $S_i$ has finite and moderately valued fourth moment. However, there are practical applications where this assumption need not be true, such as speech and finance. Algo… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

    Comments: 16 Pages, 9 Figures, AAAI 2017

  29. arXiv:1604.00080  [pdf, other

    cs.HC

    Graphical Perception in Animated Bar Charts

    Authors: Eugene Wu, Lilong Jiang, Larry Xu, Arnab Nandi

    Abstract: Interactive visual applications create animations that encode changes in the data. For example, cross-filtering dynamically updates linked visualizations based on the user's continuous brushing actions. The animated effects resulting from these interactions depends both on how interaction (e.g., brushing speed) controls properties of the animation such as frame rate, as well as how the data that i… ▽ More

    Submitted 31 March, 2016; originally announced April 2016.

    Comments: 10 pages

    ACM Class: H.5.2; I.2.10; I.3.6

  30. arXiv:1601.05118  [pdf, other

    cs.DB

    Perfect and Maximum Randomness in Stratified Sampling over Joins

    Authors: Niranjan Kamat, Arnab Nandi

    Abstract: Supporting sampling in the presence of joins is an important problem in data analysis, but is inherently challenging due to the need to avoid correlation between output tuples. Current solutions provide either correlated or non-correlated samples. Sampling might not always be feasible in the non-correlated sampling-based approaches -- the sample size or intermediate data size might be exceedingly… ▽ More

    Submitted 14 February, 2017; v1 submitted 19 January, 2016; originally announced January 2016.

  31. arXiv:1601.00073  [pdf, other

    cs.DB cs.PL

    Mimir: Bringing CTables into Practice

    Authors: Arindam Nandi, Ying Yang, Oliver Kennedy, Boris Glavic, Ronny Fehling, Zhen Hua Liu, Dieter Gawlick

    Abstract: The present state of the art in analytics requires high upfront investment of human effort and computational resources to curate datasets, even before the first query is posed. So-called pay-as-you-go data curation techniques allow these high costs to be spread out, first by enabling queries over uncertain and incomplete data, and then by assessing the quality of the query results. We describe the… ▽ More

    Submitted 1 January, 2016; originally announced January 2016.

    Comments: Under submission; The first two authors should be considered a joint first-author

  32. arXiv:1509.04349  [pdf, other

    cs.DB

    A Closer Look at Variance Implementations in Modern Database Systems

    Authors: Niranjan Kamat, Arnab Nandi

    Abstract: Variance is a popular and often necessary component of sampled aggregation queries. It is typically used as a secondary measure to ascertain statistical properties of the result such as its error. Yet, it is more expensive to compute than simple, primary measures such as \texttt{SUM}, \texttt{MEAN}, and \texttt{COUNT}. There exist numerous techniques to compute variance. While the definition of… ▽ More

    Submitted 23 December, 2016; v1 submitted 14 September, 2015; originally announced September 2015.

  33. arXiv:1509.00727  [pdf, ps, other

    cs.LG math.ST stat.CO stat.ML

    Heavy-tailed Independent Component Analysis

    Authors: Joseph Anderson, Navin Goyal, Anupama Nandi, Luis Rademacher

    Abstract: Independent component analysis (ICA) is the problem of efficiently recovering a matrix $A \in \mathbb{R}^{n\times n}$ from i.i.d. observations of $X=AS$ where $S \in \mathbb{R}^n$ is a random vector with mutually independent coordinates. This problem has been intensively studied, but all existing efficient algorithms with provable guarantees require that the coordinates $S_i$ have finite fourth mo… ▽ More

    Submitted 2 September, 2015; originally announced September 2015.

    Comments: 30 pages

  34. arXiv:1306.1350  [pdf, other

    cs.CE cs.LG stat.ML

    Diffusion map for clustering fMRI spatial maps extracted by independent component analysis

    Authors: Tuomo Sipola, Fengyu Cong, Tapani Ristaniemi, Vinoo Alluri, Petri Toiviainen, Elvira Brattico, Asoke K. Nandi

    Abstract: Functional magnetic resonance imaging (fMRI) produces data about activity inside the brain, from which spatial maps can be extracted by independent component analysis (ICA). In datasets, there are n spatial maps that contain p voxels. The number of voxels is very high compared to the number of analyzed spatial maps. Clustering of the spatial maps is usually based on correlation matrices. This usua… ▽ More

    Submitted 27 September, 2013; v1 submitted 6 June, 2013; originally announced June 2013.

    Comments: 6 pages. 8 figures. Copyright (c) 2013 IEEE. Published at 2013 IEEE International Workshop on Machine Learning for Signal Processing

  35. arXiv:0909.1765  [pdf

    cs.DB cs.IR

    Qunits: queried units in database search

    Authors: Arnab Nandi, H V Jagadish

    Abstract: Keyword search against structured databases has become a popular topic of investigation, since many users find structured queries too hard to express, and enjoy the freedom of a ``Google-like'' query box into which search terms can be entered. Attempts to address this problem face a fundamental dilemma. Database querying is based on the logic of predicate evaluation, with a precisely defined ans… ▽ More

    Submitted 9 September, 2009; originally announced September 2009.

    Comments: CIDR 2009