Sailesh Krishnamurthy
University of California, Berkeley
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Sailesh Krishnamurthy.
international conference on management of data | 2003
Sirish Chandrasekaran; Owen Cooper; Amol Deshpande; Michael J. Franklin; Joseph M. Hellerstein; Wei Hong; Sailesh Krishnamurthy; Samuel Madden; Frederick Reiss; Mehul A. Shah
At Berkeley, we are developing TelegraphCQ [1, 2], a dataflow system for processing continuous queries over data streams. TelegraphCQ is based on a novel, highly-adaptive architecture supporting dynamic query workloads in volatile data streaming environments. In this demonstration we show our current version of TelegraphCQ, which we implemented by leveraging the code base of the open source PostgreSQL database system. Although TelegraphCQ differs significantly from a traditional database system, we found that a significant portion of the PostgreSQL code was easily reusable. We also found the extensibility features of PostgreSQL very useful, particularly its rich data types and the ability to load user-developed functions. Challenges: As discussed in [1], sharing and adaptivity are our main techniques for implementing a continuous query system. Doing this in the codebase of a conventional database posed a number of challenges:
international conference on management of data | 2006
Sailesh Krishnamurthy; Chung Wu; Michael J. Franklin
Data streaming systems are becoming essential for monitoring applications such as financial analysis and network intrusion detection. These systems often have to process many similar but different queries over common data. Since executing each query separately can lead to significant scalability and performance problems, it is vital to share resources by exploiting similarities in the queries. In this paper we present ways to efficiently share streaming aggregate queries with differing periodic windows and arbitrary selection predicates. A major contribution is our sharing technique that does not require any up-front multiple query optimization. This is a significant departure from existing techniques that rely on complex static analyses of fixed query workloads. Our approach is particularly vital in streaming systems where queries can join and leave the system at any point. We present a detailed performance study that evaluates our strategies with an implementation and real data. In these experiments, our approach gives us as much as an order of magnitude performance improvement over the state of the art.
international conference on management of data | 2010
Sailesh Krishnamurthy; Michael J. Franklin; Jeffrey Davis; Daniel Robert Farina; Pasha Golovko; Alan Li; Neil Thombre
Continuous analytics systems that enable query processing over steams of data have emerged as key solutions for dealing with massive data volumes and demands for low latency. These systems have been heavily influenced by an assumption that data streams can be viewed as sequences of data that arrived more or less in order. The reality, however, is that streams are not often so well behaved and disruptions of various sorts are endemic. We argue, therefore, that stream processing needs a fundamental rethink and advocate a unified approach toward continuous analytics over discontinuous streaming data. Our approach is based on a simple insight - using techniques inspired by data parallel query processing, queries can be performed over independent sub-streams with arbitrary time ranges in parallel, generating partial results. The consolidation of the partial results over each sub-stream can then be deferred to the time at which the results are actually used on an on-demand basis. In this paper, we describe how the Truviso Continuous Analytics system implements this type of order-independent processing. Not only does the approach provide the first real solution to the problem of processing streaming data that arrives arbitrarily late, it also serves as a critical building block for solutions to a host of hard problems such as parallelism, recovery, transactional consistency, high availability, failover, and replication.
international conference on management of data | 2005
Shariq Rizvi; Shawn R. Jeffery; Sailesh Krishnamurthy; Michael J. Franklin; Nathan Burkhart; Anil Edakkunni; Linus Liang
The emergence of large-scale receptor-based systems has enabled applications to execute complex business logic over data generated from monitoring the physical world. An important functionality required by these applications is the detection and response to complex events, often in real-time. Bridging the gap between low-level receptor technology and such high-level needs of applications remains a significant challenge.We demonstrate our solution to this problem in the context of HiFi, a system we are building to solve the data management problems of large-scale receptor-based systems. Specifically, we show how HiFi generates simple events out of receptor data at its edges and provides high-functionality complex event processing mechanisms for sophisticated event detection using a real-world library scenario.
international conference on management of data | 2003
Christof Bornhövd; Mehmet Altinel; Sailesh Krishnamurthy; C. Mohan; Hamid Pirahesh; Berthold Reinwald
Multi-tier infrastructures have become common practice for implementing high volume web sites. Such infrastructures typically contain TCP load balancers, HTTP servers, application servers, transaction-processing monitors, and databases. Caching has been widely used at different layers of the infrastructure stack to improve scalability and response time of e-business applications. The majority of existing caching mechanisms target only static HTML pages or page fragments. However, as web applications become more dynamic through increased personalization, these caching techniques turn out to be less useful. Consequently, as more application requests result in increased querying and updating of backend database servers, scalability limits are often reached.
international conference on management of data | 2002
Mehmet Altinel; Qiong Luo; Sailesh Krishnamurthy; C. Mohan; Hamid Pirahesh; Bruce G. Lindsay; Honguk Woo; Larry Brown
Many e-Business applications today are being developed and deployed on multi-tier environments involving browser-based clients, web application servers and backend databases. The dynamic nature of these applications necessitates generating web pages on-demand, making middle-tier database caching an effective approach to achieve high scalability and performance [3]. In the DBCache project, we are incorporating a database cache feature in DB2 UDB by modifying the engine code and leveraging existing federated database functionality. This allows us to take advantage of DB2s sophisticated distributed query processing power for database caching. As a result, the user queries can be executed at either the local database cache or the remote backend server, or more importantly, the query can be partitioned and then distributed to both databases for cost optimum execution.DBCache also includes a cache initialization component that takes a backend database schema and SQL queries in the workload, and generates a middle-tier database schema for the cache. We have implemented an initial prototype of the system that supports table level caching. As DB2s functionality is extended, we will be able to support subtable level caching, XML data caching and caching of execution results of web services.
very large data bases | 2004
Owen Cooper; Anil Edakkunni; Michael J. Franklin; Wei Hong; Shawn R. Jeffery; Sailesh Krishnamurthy; Fredrick Reiss; Shariq Rizvi; Eugene Wu
Advances in data acquisition and sensor technologies are leading towards the development of “High Fan-in” architectures: widely distributed systems whose edges consist of numerous receptors such as sensor networks and RFID readers and whose interior nodes consist of traditional host computers organized using the principle of successive aggregation. Such architectures pose significant new data management challenges. The HiFi system, under development at UC Berkeley, is aimed at addressing these challenges. We demonstrate an initial prototype of HiFi that uses data stream query processing to acquire, filter, and aggregate data from multiple devices including sensor motes, RFID readers, and low power gateways organized as a High Fan-in system.
conference on innovative data systems research | 2003
Sirish Chandrasekaran; Owen Cooper; Amol Deshpande; Michael J. Franklin; Joseph M. Hellerstein; Wei Hong; Sailesh Krishnamurthy; Samuel Madden; Vijayshankar Raman; Frederick Reiss; Mehul A. Shah
international conference on management of data | 2002
Qiong Luo; Sailesh Krishnamurthy; C. Mohan; Hamid Pirahesh; Honguk Woo; Bruce G. Lindsay; Jeffrey F. Naughton
conference on innovative data systems research | 2005
Michael J. Franklin; Shawn R. Jeffery; Sailesh Krishnamurthy; Frederick Reiss; Shariq Rizvi; Eugene Wu; Owen Cooper; Anil Edakkunni; Wei Hong