Documentation Index Fetch the complete documentation index at: https://firecrawl-mog-search-exclude-include-domains.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Gradle (Kotlin DSL)
Maven
dependencies {
implementation ( "com.firecrawl:firecrawl-java:1.2.0" )
}
< dependency >
< groupId > com.firecrawl </ groupId >
< artifactId > firecrawl-java </ artifactId >
< version > 1.2.0 </ version >
</ dependency >
import com.firecrawl.client.FirecrawlClient;
import com.firecrawl.models.SearchData;
import com.firecrawl.models.SearchOptions;
public class Main {
public static void main ( String [] args ) {
FirecrawlClient client = FirecrawlClient . builder ()
. apiKey ( "fc-YOUR-API-KEY" )
. build ();
SearchData results = client . search (
"firecrawl web scraping" ,
SearchOptions . builder (). limit ( 5 ). build ()
);
if ( results . getWeb () != null ) {
for ( var result : results . getWeb ()) {
System . out . println ( result . get ( "title" ) + " - " + result . get ( "url" ));
}
}
}
}
Hacer scraping de una página
import com.firecrawl.models.Document;
Document doc = client . scrape ( "https://example.com" );
System . out . println ( doc . getMarkdown ());
{
"markdown" : "# Example Domain \n\n This domain is for use in illustrative examples..." ,
"metadata" : {
"title" : "Example Domain" ,
"sourceURL" : "https://example.com"
}
}
Interactuar con una página
Abre una sesión de navegador, ejecuta código de Playwright en esa sesión y ciérrala al terminar:
import com.firecrawl.models.ScrapeOptions;
import com.firecrawl.models.BrowserExecuteResponse;
import java.util.List;
Document doc = client . scrape ( "https://www.amazon.com" ,
ScrapeOptions . builder (). formats ( List . of ((Object) "markdown" )). build ());
String scrapeId = (String) doc . getMetadata (). get ( "scrapeId" );
BrowserExecuteResponse run = client . interact (scrapeId,
"const title = await page.title(); console.log(title);" );
System . out . println ( run . getStdout ());
client . stopInteractiveBrowser (scrapeId);
En lugar de pasar apiKey directamente, establece la variable de entorno FIRECRAWL_API_KEY:
export FIRECRAWL_API_KEY = fc-YOUR-API-KEY
FirecrawlClient client = FirecrawlClient . fromEnv ();
Documentación de Search Buscar en la web y obtener el contenido completo de la página
Documentación de scraping Todas las opciones de scraping, incluidos formatos, acciones y proxies
Documentación de Interact Haz clic, completa formularios y extrae contenido dinámico
Referencia del SDK para Java Referencia completa del SDK con crawl, map, extracción por lotes y más